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Abstract 

The companionate'theory of marriage suggests that egalitarianism in practice and belief 
leads to higher marital quality for wives and higher levels of positive emotion work on the 
part of husbands. Our analysis of women’s marital quality and men’s marital emotion 
work provides little evidence in support of this theory. Rather, in examining women’s 
marital quality and men’s emotional investments in marriage, we find that dyadic 
commitment to institutional ideals about marriage and women’s contentment with the 
division of household tasks are more critical. We also show that men’s marital emotion 
work is a very important determinant of women’s marital quality. We conclude by noting 
that “her” marriage is happiest when it combines elements of the new and old: that is, 
gender equity and normative commitment to the institution of marriage. 


The last century has witnessed profound changes in the functions, character and stability of 
marriage. In particular, the concomitant rise in women’s social and economic status, the relative 
decline in social and economic functions once associated with the family, and the increased 
cultural power of expressive individualism have all conspired to heighten the importance of the 
emotional life of marriage. The emotional functions and character of marriage have become 
particularly crucial for contemporary marital happiness and marital stability as other sources of 
satisfaction and/or stability — home production, childrearing, the gendered division of labor and 
religious authority — have migrated to other sectors or weakened (Bumpass 1990; Cherlin 2004). 
Sentiment is increasingly the tie that binds together contemporary marriages. 

The literature on marriage suggests that the emotional character of marriage is an 
especially salient determinant of women’s marital quality (Erickson 1993; Wilkie, Ferree and 
Ratcliff 1998). Women are particularly vested in the emotional quality of their marriages 
because they have long borne the primary emotional burdens of family life. Their stake in the 
emotional character of their marriages is also rooted in gendered patterns of childhood 
socialization that encourage female proficiency in and sensitivity towards emotional dynamics 
in relationships (England and Farkas 1986; Maccoby 1998; Thompson and Walker 1989). 
Furthermore, because other sources of marital satisfaction have declined, women should 
now place a premium on the emotional quality of their marriages that outweighs other 
potential sources of marital satisfaction. Indeed, recent evidence indicates that the emotional 
quality of marriage is a better predictor of divorce for wives than husbands (Nock 2001; Sayer 
and Bianchi 2000). For all of these reasons, the “emotion work” (Hochschild 1979: 561) that 
men do in marriage — and the assessments their wives make of this work —is, in all likelihood, 
a crucial determinant of women’s marital quality. 


We thank Paula England, Laura Sanchez and anonymous reviewers for Social Forces for their helpful 
comments. David Franz provided editorial assistance. This research was funded by grants to the first author 
from the U.S. Department of Health and Human Services (Grant 90XP0048 ) and the Lilly Endowment 
(Grant # 2002 2301-000). Direct correspondence to W. Bradford Wilcox, Department of Sociology, 
University of Virginia, P.O. Box 400766, Charlottesville, VA 22904. E-mail: wbwilcox@virginia.edu. 
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Accordingly, using data from the second wave of the National Survey of Families and 
Households (NSFH2 [1992-1994]), this paper tests the relative importance of men’s emotion 
work compared to a range of cultural and sociodemographic factors in determining women’s 
marital quality. Drawing on Hochschild’s work (1979, 1989), we define husbands’ marital 
emotion work as any effort to express positive emotion to their wives, to be attentive to the 
dynamics of their relationship and the needs of their wives, or to set aside time for activities 
focused specifically on their relationship. After demonstrating the crucial import of men’s 
marital emotion work, we then set out to determine the key determinants of men’s emotion 
work in marriage. Specifically, we elaborate four theoretical models, which test the relative 
contributions that gender egalitarianism, normative commitment to marriage, gender equity 
and gender traditionalism make to men’s marital emotion work (and women’s marital quality). 


Theoretical Perspectives on Marriage and Men’s Emotion Work 


A Companionate Model of Marriage 


Many contemporary family scholars argue that egalitarian marriages are characterized by 
the kind of emotion work — affection, empathy, quality time devoted to intimacy — that 
makes for high-quality, stable marriages (Amato, Johnson, Booth and Rogers 2003; 
Blumstein and Schwartz 1983; Burgess, Locke, and Thomes 1963; England and Farkas 
1986; Goldscheider and Waite 1991). What we call the companionate theory of marriage 
is predicated on three assumptions about the links between egalitarianism and marital 
emotion work.' 

First, spouses share similar work and family responsibilities. Such role sharing is supposed 
to increase the quality of emotion work in marriage by providing husbands and wives with 
common experiences and interests around which they can build conversations, empathetic 
regard, mutual understanding and the like. The companionate marriage stands in clear 
contrast to an older model of marriage where women specialize in expressive, private 
functions and mén specialize in instrumental, public functions. The blurring or elimination of 
such gender roles, advocates of companionate marriage suggest, will result in a richer 
emotional life where, among other things, men do more marital emotion work. 

Second, the elimination of patriarchal authority and power is seen as a key mechanism 
for promoting marital intimacy. Classical social theory has long noted the tensions between 
authority or power and intimacy (Weber 1978). The exercise of authority and power is usually 
associated with social distance, and marital theorists have argued that one of the reasons that 
men are less expressive in marriage is that they seek to protect their traditional dominance by 
limiting their expression of affect. Likewise, women’s financial dependence on marriage has 
led them to cater to the emotional needs of their mates and to the emotional dynamics of the 
marital relationship in an effort to maintain the security of their marriages and to elevate their 
status within marriage. Women also have been socialized to minimize the expression of their 
own thoughts, desires and feelings — especially negative ones — for fear of jeopardizing their 
marriages (Blood and Wolfe 1960; Blumberg and Coleman 1989; Thompson and Walker 
1989). By contrast, the companionate theory of marriage predicts that marriages 
characterized by an ethic. of equal regard, as well as equal access to the labor force, will have 
higher levels of male emotion work and interpersonal honesty (Gottman 1994). In such 
marriages, women should feel like they have the power to speak their minds and men should 
feel a greater responsibility to shoulder their share of the emotion work associated with 
marriage. 
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Third, egalitarian-minded men are supposed to be more open to a “counter-stereotypical” 
masculinity conducive to marital emotion work (McQuillan and Ferree 1998). Traditionally, 
masculinity has been defined in opposition to all things feminine — including the ready and 
frequent expression of emotion, affection and vulnerability, as well as attentiveness to 
relationship dynamics (Gilmore 1990). By contrast, men who identify with the ethos of 
egalitarianism should embrace a counter-stereotypical masculinity, that is, “a style of 
manliness that is not afraid to accept influence from women, to recognize and express 
emotion, and to give cognitive room to the marriage relation as such.” (McQuillan and Ferree 
1998: 223) For all these reasons, the companionate theory of marriage would predict that 
egalitarian relationships are characterized by more “interpersonal closeness, trust, 
communication and mutuality” that generate the kinds of experiences and emotional skills: 
that foster marital emotion work on the part of men (Goldscheider and Waite 1991: 4). Thus, 
the companionate model of marriage suggests the following hypotheses: 


Hypothesis 1a: Wives in egalitarian marriages will be more satisfied 
with their marriages. é 


Hypothesis 1b: Husbands in egalitarian marriages will do more positive 
marital emotion work. 


Given the strong hypothesized link between women’s marital quality and men’s marital 
emotion work, the companionate theory of marriage would predict — ceteris paribus — that 
more equal marriages are happier. Nonetheless, recent research on the link between marital 
equality and marital quality is mixed, with a number of studies finding that more traditional 
women have happier marriages (e.g., Amato and Booth 1995; Gager and Sanchez 1998; 
Sanchez, Wright, Wilson and Nock 2003; Wilkie et al. 1998; but see Amato et al. 2003). Of 
course, the lack of a clear connection between marital equality and women’s marital 
happiness may be because other institutional and cultural factors confound the association 
between egalitarianism and marital happiness for women. 

Specifically, what we call the /nstitutiona/ and equity models of marriage (see below) may 
be required to understand the companionate theory properly. These models explore the 
possibility that conservative women enjoy higher normative and communal support for their 
marriages or maintain lower expectations of marital equality, both of which may affect the 
relationship between marital equality and marital quality. Alternatively, the companionate 
model of marriage may fail to accurately describe the dynamics of contemporary marriages; 
if this is the case, another perspective (the gender model of marriage described below) may 
offer a better account of women’s marital quality and men’s emotion work. Thus, we turn to 
three additional theoretical models in an effort to explain women’s marital happiness, and to 
explore the emotion work that men devote to marriage. 


An Institutional Model of Marriage 


One reason that the companionate model may have difficulty predicting women’s marital 
happiness is that gender egalitarianism tends to be associated with lower levels of normative 
support for the institution of marriage, as well as lower levels of participation in institutions 
that provide social support to marriage, such as religious organizations (Chafetz 1995; Wilcox 
2004). Traditional women may be happier because they are more likely to endow the 
institution of marriage with high moral significance and/or be affiliated with religious 
institutions that provide social support for marriage. Such women may be better able to 
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negotiate the contemporary challenges of marriage and to retain their happiness because of 
these social and normative supports. 

Specifically, the institutional model of marriage predicts that women are happier in their 
marriages if they are strongly committed to the institution of marriage, if they are involved 
with institutions that lend social support to marriage, and if they share the normative and 
social sources of their commitment to marriage with their husbands. First, normative and 
social support for the institution of marriage may be associated with strong legitimation 
pressures. Wives who have a strong normative commitment to the institution of marriage 
may feel greater internal pressure to construct a “family myth” that they are happy 
(Hochschild 1989: 43). Such a myth would legitimize their investments in their own marriages 
and their self-imposed dependency upon their husbands. Likewise, women who are actively 
involved in a religious organization may feel greater social pressure to remain committed to 
their marriages regardless of the behavior of their husbands. Accordingly, these women may 
also more be inclined to view their marriage in a positive light to legitimize their investment 
in married life. 

A second, related point is that a high level of normative and social support for the institution 
of marriage may also promote women's marital happiness by fostering an altruistic mindset 
that makes wives less likely to continuously monitor the relationship to see if it is serving their 
individual interests. Although a growing number of Americans, influenced by the cultural logic 
of “expressive individualism” (Bellah, Madsen, Sullivan, Swidler and Tipton 1985), act as self- 
interested agents who bargain over their marital roles and interests in an effort to maximize 
their personal fulfillment (Bumpass 1990; Cherlin 2000), other Americans conceptualize their 
marriages along more institutional lines (Wilcox 2004). These Americans see marriage as a 
sacred institution in the Durkheimian sense that the relationship is accorded extraordinary 
value. Hence, the marital relationship is supposed to trump the individual interests of partners, 
calling forth virtues such as fidelity, sacrifice and mutual support (Bahr and Bahr 2001). In this 
setting, exchanges between marital partners are often conducted according to an “enchanted” 
cultural logic of gift exchange where spouses give one another gifts that vary in value, may or 
may not be reciprocated, and often have some kind of symbolic value above and beyond their 
immediate instrumental value (Bourdieu 1990: 126; Bahr and Bahr 2001; Wilcox 2004). 
Women who are deeply committed to the institution of marriage, and who identify with this 
enchanted view of marriage, are probably less likely than more individualistic women to keep 
an ongoing account of how the relationship is or is not serving their own interests. This 
willingness to avoid looking at the marriage in a self-interested fashion is probably associated 
with fewer critical evaluations of the marital relationship. This should lead to higher levels of 
marital quality for women (Brines and Joyner 1999; Wilcox 2004). 

Third, wives who share high levels of normative and social support for the institution of 
marriage with their husbands should enjoy a higher degree of marital well-being — apart from 
whether or not this shared commitment actually leads to distinctive marital behavior on the 
part of their husbands. This shared dyadic commitment appears to engender a sense of 
marital well-being in wives (and husbands). Specifically, wives who believe that their 
husbands are committed to the marriage respond with greater personal commitment to their 
marriage and express greater marital happiness (Nock 1995). This may be, in part, because 
shared commitment engenders a sense of relational security and a long-term view of the 
marital relationship, which allows women to face the ordinary stresses, challenges and 
conflicts associated with married life without worrying that these challenges pose a threat to 
the survival of their relationship (Brines and Joyner 1999; Nock 2000). In other words, a shared 
commitment to marriage fosters a spirit of trust that, in turn, makes women happier about the 
current state of and future prospects for their marriages. In sum, the institutional theory of 
marriage predicts that women will be happier in their marriages when they have higher levels 
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of normative and social support for the institution of marriage and when they share their 
commitment to marriage with their husbands. 

For similar reasons, we predict that men who adopt a more institutional approach to 
marriage will do more positive marital emotion work than those who do not. First, one of the 
central norms of modern marriage is that married people provide emotional support to one 
another. Marriage, now more than ever, is about meaningful conversation, empathy, affection 
and spending leisure time together. Nevertheless, such emotion work is often particularly 
hard for men, who have been traditionally socialized to be less expressive (Maccoby 1998; 
Thompson and Walker 1989). However, men who are committed to the institution of marriage 
should feel a greater moral responsibility than other men to signal their belief in the sanctity 
of marriage by investing emotionally in their marriages (Wilcox 2004). 

Second, men who are strongly committed to marriage are likely to have a third party with 
a vested interest in the success of their marriage. Scholars have noted that marital 
commitment is often linked to cultural norms fostered and reinforced through communal ties, 
religious participation, and extended family networks (Stolzenberg, Blair-Loy and Waite 19985). 
Specifically, men who are actively involved in religious congregations — which generally offer 
social support and control on behalf of the institution of marriage — probably face informal and 
occasionally formal pressure to make greater investments in their marriage. 

Finally, men who share a high level of normative and social commitment to the institution 
of marriage with their wives are likely to enjoy high levels of marital trust, which in turn makes 
them more likely to devote themselves to their marriage. Men who share a commitment to 
the institution of marriage with their wives are more likely to trust their spouses and to 
engender greater trust in them, in large part because of their commitment to the ideals of 
marital fidelity and permanence. This marital trust, in turn, enables men to make investments 
in their marriages without worrying about maintaining an ongoing account of goods and 
services exchanged in the marriage (Amato and Rogers 1999; Blumstein and Schwartz 1983; 
Brines and Joyner 1999). Thus, they are able to conduct their marriages according to the 
“enchanted” logic of gift-exchange in which gifts can be given even when there is no 
immediate expectation of reciprocity and, indeed, where one seeks not to keep an explicit 
accounting of the marital pattern of exchange for fear of dissipating the spirit of enchantment 
that permeates the intimate relationship (Bourdieu 1990: 126; Bahr and Bahr 2001; Wilcox 
2004). Thus, the institutional theory of marriage would predict the following: 


Hypothesis 2a: Wives who possess high levels of social and normative 
commitment to marriage will be more satisfied with their marnages, 
especially if they share this commitment with their husbands. 


Hypothesis 2b: Husbands are more likely to do positive marital emotion 
work if they have high levels of social and normative commitment to 
marriage, especially if they share this commitment with their wives. 


An Equity Model of Marriage 


With the dramatic changes in women’s labor force participation and cultural norms surrounding 
gender roles since the 1960s, along with men’s failure to take up an equal share of household 
labor, it is not surprising that the division of household labor has emerged as a crucial source of 
conflict for many contemporary marriages (Blair and Johnson 1992; Greenstein 1996; 
Hochschild 1989). Indeed, women who perceive housework arrangements as unfair are more 
likely to report lower levels of marital happiness (Blair and Johnson 1992; Greenstein 1996). 
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But objective inequality in the division of household labor does not always lead to 
perceptions of inequity, and consequently feelings of marital unhappiness, on the part of 
women. Here, equity theory notes that perceptions of justice in the division of family 
responsibilities are shaped — among other things — by the ideological commitments of women 
(DeMaris and Longmore 1996; Sanchez 1994; Sanchez and Kane 1996; Thompson 1991). 
Traditional women adhere to a gender ideology that suggests that women have a natural or 
innate orientation towards care giving and domestic labor. Thus, they are more likely to see 
housework as a feminine task, which makes them less likely to view inequalities in the 
division of household labor as unfair. This willingness to accept continuing inequalities in the 
division of household labor makes traditional women less likely to suffer marital conflict and 
lower levels of marital happiness (Blair and Johnson 1992; Greenstein 1996; Hochschild 
1989). For such wives, standards of equity are more complex than simple equality. 

Equity theory may also help us understand men’s marital emotion work and women’s 
assessments of that work. Traditional women may encourage higher levels of emotion work 
from their husbands, albeit indirectly, by making fewer demands upon them precisely 
because they do not equate equality and equity in a one-to-one fashion (Nock 2000). Chafetz 
(1995) argues that women committed to a liberal gender role ideology are more likely to be 
angered by marital inequality and to initiate marital conflict as a consequence. This conflict, in 
turn, may discourage positive emotion work on the part of men. She also argues that more 
progressive women have higher expectations of intimacy than their peers. Equity theory 
would thus predict that gender role traditionalism leads to lower female expectations of 
men's marital emotion work and to higher levels of such work. Thus, equity theory leads to 
the following hypotheses: 


Hypothesis 3a: Traditional wives have lower expectations of marital 
equality in the division of household labor and emotional work; 
consequently, they will be happier with their marriages and the marital 
emotion work they receive because they do not associate equity with 
equality. 


Hypothesis 3b: Husbands married to traditional wives will do more 
emotion work because they experience less spousal unhappiness with 
the division of domestic labor. 


The institutional and equity models of marriage may also help us better understand the 
relationship between egalitarianism and marital happiness. One of the reasons that the 
companionate model may have difficulty predicting marital happiness is that women who 
identify with egalitarian ideals of marriage may have higher expectations of equality in the 
division of household labor and marital emotion work that are not easily met. Another reason 
that the companionate model may have difficulty predicting marital happiness is that women 
who identify with egalitarian ideals of marriage may have lower levels of normative and 
communal support for their marriages. By adding factors associated with the equity and 
institutional models of marriage, we will see if the companionate theory is vindicated after 
controlling for other factors that may confound the relationship between companionate 
factors and marital quality, as well as the relationship between companionate factors and 
men’s emotion work. Specifically, we hypothesize the following: 


Hypothesis 4a: After controlling for factors associated with the 
institutional and equity models of marriage, indicators of a companionate 
marriage will be positively related to wives’ marital quality. 
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Hypothesis 4b: After controlling for factors associated with the 
institutional and equity models of marriage, marital egalitarianism will be 
positively related to husbands’ marital emotion work. 


The Gender Model of Marriage 


Finally, it is possible that the institutional and equity models of marriage do not account for the 
association between women's traditionalism and their marital happiness. It may be that the 
companionate model of marriage fails to take into account the ways in which contemporary 
women and men remain committed, even if only tacitly, to gendered marriages. 

Specifically, the gender theory of marriage suggests that men and women are considerably 
invested in “doing gender” even when they embrace an egalitarian gender role ideology (West and 
Zimmerman 1987). A range of sociocultural factors account for the plausibility of the gender 
perspective. Specifically, over the life course, women and men are socialized to embrace gender- 
typical patterns of behavior (Maccoby 1998; Thompson and Walker 1989). In turn, the dispositions 
acquired over the life course are reinforced by a range of ongoing cultural and social pressures — 
e.g., cultural conventions, gendered inequalities in the labor force, etc. (Coltrane 1989; Ferree 
1991). For these reasons, women and men face strong internal and external pressures to produce 
gender in their marriages (Atkinson and Boles 1984; Berk 1985; Greenstein 2000). 

Thus, women may be happier in marriages where they are able to successfully produce 
gender. Likewise, men who are married to more traditional wives may be happier and, 
accordingly, more likely to give their wives positive emotion work (Amato and Booth 1995; 
Nock 1998; Wilkie et al. 1998). Contrary to the expectations of the companionate model of 
marriage, the gender model of marriage would predict that marriages that are strongly 
gendered make women happier and make men more likely to engage in marital emotion 
work. If controls for institutional and equity factors do not vindicate the companionate model 
of marriage, we would have some evidence for the gender theory of marriage. Thus, our 
hypotheses derived from the gender model run as follows: 


Hypothesis 5a: Wives will be happier in marriages characterized by 
gender-typical practices. 


Hypothesis 5b: Husbands will be more inclined to do positive 
emotion work in marriages characterized by gender-typical practices. 


Data and Methods 


We analyze data drawn from the second wave of the National Survey of Families and 
Households (NSFH2 [1992-1994]), a nationally representative sample of adults age 23 and 
older. We use data from NSFH2 because the first wave of the survey (NSFH1 [1987-1988]) 
does not have as many measures of positive marital emotion work. NSFH2 re-interviewed 
10,008 of NSFH1's 13,007 original respondents for a response rate of approximately 77 
percent. NSFH2 also interviewed current spouses of its original respondents who were 
married in the second wave. Response rates averaged slightly more than 80 percent for the 
spouses of married respondents (Bumpass and Sweet 1995; Sweet, Bumpass and Call 
1988).? For our analyses, we rely on a subsample of 5,010 couples drawn from NSFH2. The 
statistics and analyses used for this study are based on weighted data, adjusted for 
oversamples of African-Americans and Hispanics and for attrition from NSFH1 to NSFH2. 


1328 © Social Forces Volume 84, Number 3 * March 2006 


Table 1: Four Theories of Marital Functioning 


eee 
A Companionate Model An Equity Model 


Egalitarian patterns of work, earning, and 
housework in marriages foster greater 
emotional intimacy and more emotion work 
on the part of husbands. They do so by 
creating marital role homogamy, and by 
eliminating patriarchal patterns of power and 
authority, thereby allowing husbands to take 
a progressive approach to their marriage that 
includes more emotion work on their part. 


Hypothesis 1a: Wives in egalitarian 
marriages will be more satisfied with their 
marriages. 


Hypothesis 1b: Husbands in egalitarian 
marriages will do more positive marital 
emotion work. 


Hypothesis 4a: After controlling for factors 
associated with the institutional and equity 
models of marriage, indicators of a 
companionate marriage will be positively 
related to wives’ marital quality. 


Hypothesis 4b: After controlling for factors 
associated with the institutional and equity 
models of marriage, marital egalitarianism will 
be positively related to husbands’ marital 
emotion work. 


An Institutional Model 


Husbands and wives who are integrated into 
institutions that endorse marria eee 
churches) and who share a high commitment 
to the institution will construct a “family myth” 
that they are happy with their marriage. This 
commitment will also make spouses more 
likely to trust one another and to adopt a 
long-term view of their relationship; 
accordingly, they are more likely to adopt a 
logic of gift exchange, rather than market 
exchange, in their marriages and to 
experience the benefits this logic accords 
marriage. 


Hypothesis 2a: Wives who possess high 
levels of social and normative commitment to 
marriage will be more satisfied with their 
marriages, especially if they share this 
commitment with their husbands. 


Hypothesis 2b: Husbands are more likely to 
do positive marital emotion work if they have 
high levels of social and normative 
commitment to marriage, especially if they 
share this commitment with their wives. 


Wives who perceive housework 
arrangements as unfair are more likely to 
report lower levels of marital happiness. But 
wives who hold traditional gender attitudes 
expect less equality in the division of 
housework and less emotion work from their 
husbands. Consequently, traditional wives 
are less likely than their more progressive 
eers to view the division of household 
abor as unfair, or their husband’s emotion 
work as inadequate. 


Hypothesis 3a: Traditional wives have lower 
expectations of marital equality in the 
division of household labor and emotional 
work; consequently, they will be happier 
with their marriages, and the marital 
emotion work they receive, because they do 
not associate equity with equality. 


Hypothesis 3b: Husbands married to 
traditional wives will do more emotion work 
because they experience less conflict 
related to spousal unhappiness with the 
division of domestic labor. 


A Gender Model 


Because wives — even wives with 
egalitarian attitudes — have been socialized 
to value gender-typical patterns of behavior, 
wives will be happier in marriages with 
ender-typical practices in the division of 
ousehold labor, work outside the home, 
and earnings. Because husbands — even 
husbands with egalitarian attitudes — have 
been socialized to value gender-typical 
atterns of behavior, husbands will be 
appier in marriages that produce gender- 
typical patterns and will be more inclined to 
invest themselves emotionally in their 
marriages than husbands organized along 
more egalitarian lines. 


Hypothesis 5a: Wives will be happier in 
marriages characterized by gender-typical 
practices. 


Hypothesis 5b: Husbands will be more 
inclined to do positive emotion work in 
marriages characterized by gender-typical 
practices. 
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Dependent Variables 


We focus on three different dependent variables: women’s marital happiness, women’s 
satisfaction with the love/affection and understanding they receive from their husbands, and 
the quality time men devote to intimate interaction with their wives. To measure marital 
happiness, the NSFH2 asked married respondents: “Taking things all together, how would 
you describe your marriage?” Responses ranged from 1 “very unhappy” to 7 “very happy.” 
This serves as our measure of women’s marital happiness. 

We rely on one indirect measure and one direct measure to tap the positive emotion work 
that men do in their marriages. The first dependent variable that taps men’s positive emotion 
work — husbands’ love/affection and understanding — is taken from wives’ reports of their 
happiness with such emotion work. Specifically, wives were asked how happy they were with 
the “love and affection you get from your spouse” and with the “understanding you receive from 
your spouse.” Responses ranged from 1 “unhappy” to 7 “very happy” (we use the average of 
the two responses, Cronbach's alpha = .898). Here, we rely upon spousal reports because the 
NSFH2 did not ask respondents directly about the marital emotion work they performed. There 
are advantages and disadvantages to this empirical strategy. On the one hand, these measures 
do not suffer from self-report bias since they are derived from the partners of the men under 
study. On the other hand, wives’ happiness with different dimensions of their husbands’ 
emotion work may have more to do with their own standards for spousal emotion work than 
with the actual amount of marital emotion work that their husbands are doing. This is an 
especially important limitation in light of equity theory's prediction that ideological commitments 
may cause wives to lower their emotional standards. Accordingly, the results from these indirect 
measures of men’s marital emotion work must be viewed in a more tentative light than results 
derived from direct reports of marital emotion work. 

The third dependent variable of interest - a measure of the quality time a husband spends 
with his wife — is provided by husbands who were asked, in the past month, “about how often 
did you and your wife spend time alone with each other, talking, or sharing an activity.” 
Responses ranged from 1 “never” to § “almost every day.” This measure taps the extent to 
which a husband devotes himself to spending time on emotion work associated with the 
marital relationship. We consider this a measure of re/ationship quality time. 


Independent Variables 


To test the companionate model of marriage, we rely on eight variables that measure marital 
egalitarianism in belief and practice. First, we constructed a husbands’ gender egalitarianism 
scale using four attitudinal measures (Cronbach's alpha = .575). Specifically, respondents 
were asked if they approved or disapproved of “mothers who work fulltime when their 
youngest child is under age 5.” Respondents were also asked their opinion of the following 
three items: “Preschool children are likely to suffer if their mother is employed;” “It is much 
better for everyone if the man earns the main living and the woman takes care of the home 
and family;” and, “If a husband and wife both work fulltime, they should share housework 
tasks equally.” Responses to these items were averaged and coded from 1 to 5, with higher 
scores indicating higher levels of gender role liberalism. We also constructed a measure of 
wives’ gender egalitarianism using the same measures and techniques (Cronbach's alpha = 
600). Third, we created two dummy variables to measure wives’ labor force participation. 
Female respondents working 40 hours a week or more were coded as wives working fulltime. 
Women working 1 to 39 hours a week were coded as wives working parttime (women who 
do not work for pay are the reference category). Fourth, we measured equality in 
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breadwinning by creating two dummy variables. The first is for wives who earn between 33 
percent and 65 percent of household income, and the second for wives who earn 66 percent 
or more of household income (the deleted reference group is wives who produce less than 
a third of household income). Finally, we measured equality in the division of household labor 
by creating one dummy variable for husbands who do between 33 percent and 65 percent 
of household labor, and another for husbands who do 66 percent or more of household labor. 
We also use the measures of gendered marital practice to assess the utility of the gender 
model of marriage. 

To test the component of the institutional mode! focusing on shared normative 
commitments to marriage, we created two scales of marital commitment based on the 
respondent's agreement (from 1 “strongly disagree” to 5 “strongly agree”) with the following 
five items, which emphasize that marriage is the ideal site for sexual activity and childrearing, 
and the importance of marital fidelity: “It is all right for an unmarried couple to live together 
even if they have no interest in marriage;” “It is all right for unmarried 18 year olds to have 
sexual relations if they have strong affection for each other;” “It is all right for a couple with an 
unhappy marriage to get a divorce when their youngest child is under age 5;" “When a 
marriage is troubled and unhappy, it is generally better for the children if the couple stays 
together;” and, “Marriage is a lifetime relationship and should never be ended except under 
extreme circumstances.” Where necessary, items were reverse coded so that higher scores 
reflected greater normative commitment to marriage. The five-item scale for men had a 
Cronbach's alpha of .654 and the five-item scale for women had a Cronbach's alpha of .659. 

Respondents who scored in the top quartile of this scale were coded as having a high level 
of commitment to the institution of marriage. Respondents were then split into four groups: 
couples who shared a high level of commitment to the institution of marriage, only the 
husband was highly committed to marriage, only the wife was highly committed to marriage, 
or neither spouse was highly committed to marriage (the comparison category). 

Another dimension of the institutional model of marriage is the extent to which individuals 
are integrated into social institutions that accord marriage normative importance, which we 
tap through three measures related to church attendance. Respondents were asked how 
often they attended church, from never to more than once a week (coded 0 to 8). 
Respondents were split into four groups: couples who attended church weekly together, only 
the husband attended weekly, only the wife attended weekly, or neither spouse attended 
weekly (the comparison category). 

To test the equity model of marriage, we created three dummy variables that measure 
wives’ sense of fairness with the division of household labor. Wives were asked: “How do you 
feel about the fairness in your relationship in [household chores]?” Responses ranged from 1 
“very unfair to me” to 5 “very unfair to him.” Women who reported any unfairness in 
housework to them were given a dummy value. Likewise, women who reported any 
unfairness in housework to their husbands were also given a dummy value. We also included 
a dummy variable for wives who refused to answer the question. Though clearly an imperfect 
measure, this variable is intended to tap the tendency to avoid “keeping the books” on 
matters of who does how much in the home. The omitted category is wives who reported 
that housework was “fair to both.” 

We also controlled for a number of different variables that might otherwise confound the 
relationship between our independent variables of interest and women’s marital quality as 
well as men’s emotion work. Specifically, we control for the following variables that are known 
to influence marital quality and dyadic relations: age of respondent and his wife, /ength of 
respondent's marriage (in years), the number of preschool and school-age children in the 
home, the emp/oyment status of the respondent, whether the husband or his wife have ever 
been divorced, the couple’s income ($10,000s, logged), the education of wife and husband 
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(from high school dropout to graduate school, coded from 1 to 6), race/ethnicity of the 
respondent (separate dummy variables for black and Hispanic with White/Anglo/Other the 
reference category), and the region of the couple (separate dummy variables for South, 
Northeast and North Central with West the reference category). 

For our statistical techniques, we rely upon a series of ordinary least square (OLS) 
regression models. The first model we test in each set of regressions incorporates measures 
designed to test the validity of the companionate model of marriage. The second model in our 
regressions adds measures designed to test the validity of the institutional model of marriage. 
The third model incorporates variables that assess the utility of the equity model of marriage. 
The third model also allows us to test the gender model of marriage by seeing if gender- 
typical patterns of marital practice predict higher levels of women’s marital quality and men’s 
emotion work even after controlling for institutional and equity factors. Table 3 incorporates 
a fourth model that includes measures of men’s emotion work, which is designed to test the 
relative predictive power of men’s marital emotion work, and women’s assessments of that 
work, compared to other sociocultural factors. All equations were estimated with robust 
(Huber/White sandwich) standard errors that relax the formal assumption of independence of 
observations. Standard errors calculated in OLS will otherwise tend to be underestimated. 
Robust standard errors are suitable for the NSFH given that it relied on a cluster sampling and 
requires weighting. 


Results 


Table 2 reports means and standard deviations for women’s marital quality, men’s marital 
emotion work, and a range of independent variables. The data are weighted after applying 
sampling weights supplied by NSFH. These weights were normalized (making them sum to 
the original sample). 

To test the five hypotheses regarding women’s marital quality, we ran four regression 
models for the subset of married couples in NSFH2. Table 3, which focuses on wives’ marital 
happiness, provides support for Hypotheses 2a, 3a and 5a, but no support for 1a or 4a. Model 1 
indicates that wives who hold egalitarian gender attitudes, who work parttime, and who take a 
larger share of the family breadwinning responsibilities are less happy. Thus, none of the effects 
in Model 1 are in keeping with the companionate model's expectation that egalitarianism in 
practice and belief is associated with higher levels of women’s marital quality. 

Model 2, which adds measures of cultural and social support for the institution of marriage, 
reveals evidence in favor of the institutional model of marriage. Women who share high levels 
of church attendance and normative commitment with their husbands are happier than their 
peers. Wives’ greater happiness when their husbands share their own commitment to marriage 
may indicate that shared commitment promotes trust and a sense of relational security. 

Model 3 provides support for the equity model of marriage insofar as women who report 
that the division of household labor is unfair are significantly less happy in their marriages than 
women who report that the division of household labor is fair. Moreover, Model 3 further 
reduces the effect of gender role ideology; it also reduces the effect of part-time work to 
insignificance. However, it increases the negative effect of male household labor. This 
suggests that women who are not concerned about the fairness of household labor are less 
happy when their husbands do more housework. 

Taken together, Models 2 and 3 provide some support for the notion that one reason that 
the companionate model has not gathered much empirical support is that marital 
egalitarianism is also associated with lower levels of institutional commitment to marriage 
and with higher standards of equality, both of which seem to diminish women's chances of 
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marital happiness. Nevertheless, even after controlling for institutional and equity factors, we 
still find no positive evidence for the companionate theory of marriage. 

Indeed, Models 3 and 4 provide some support for the gender model of marriage insofar as 
women who earn a greater-than-average percentage of couple income — (potentially a marker 
of a husband's lack of success as a breadwinner) — and whose husbands take up a greater 
share of household labor report greater unhappiness. In other words, consistent with 
Hypothesis 5a, women who live in marriages characterized by less gendered patterns of 
earning and housework are less happy in their marriages. 

Model 4 indicates that men’s positive marital emotion work, and women’s satisfaction with 
that work, are significant predictors of women’s marital quality. In fact, model fit increases 
notably with the addition of the two measures of emotion work, from an adjusted R-squared of 
078 in Model 3 to an adjusted R-squared of .533 in Model 4. Wives’ marital happiness and their 
happiness with the affection and understanding they receive from their husbands are very 
strongly related, with one sharing half the variance of the other: r = .731. The correlation 
between wives’ marital happiness and men’s quality time is weaker: r = .212. This means that 
women’s assessments of men’s love, affection and understanding (and to a lesser degree, 
men’s quality time) is by far the most powerful predictor of women’s marital quality in our 
models, which include a range of potential factors that might influence women’s marital quality.“ 
The emotion work of husbands outweighs any effect of gender traditionalism, household labor 
and commitment to marriage. Note also that this emotion work appears to mediate the effects 
of many of the variables associated with the gender, institutional, and equity models of marriage. 

The finding that men’s marital emotion work (and/or wives perceptions of it) plays a crucial 
role in influencing women’s marital quality, and may mediate some of the effects associated 
with the gender, institutional and equity variables, motivates our interest in two measures of 
men's positive emotion work in marriage: wives’ happiness with the love, affection and 
understanding they receive from their husbands, and the one-on-one quality time that men 
devote to their wives. 

Table 4, which explores women’s happiness with the love, affection and understanding they 
receive from their husbands, provides strong support for Hypotheses 2b and 3b, mixed support 
for Hypotheses 1b and 5b, but no support for Hypothesis 4b. Contrary to the expectations of the 
companionate theory, Model 1 of Table 4 indicates that women’s gender role liberalism and 
women's labor force participation are associated with lower levels of women’s happiness with 
the affection and understanding they receive from their husbands. Only husbands’ contributions 
to household labor are positively associated with wife's happiness in this marital domain. 

Here again, however, we are interested in testing how the /nstitutional and equity models of 
marriage may account for the surprising failure of factors associated with the companionate 
marriage to predict higher wife reports of men's marital emotion work. Models 2 and 3 in Table 
4 are consistent with the /nstitutional model, suggesting that wives who share church 
attendance with their husbands and a personal commitment to the institution of marriage are 
more likely to report greater happiness with their husband's affection and understanding. The 
institutional effects reduce the effect of wives’ gender egalitarianism to non-significance. This 
means that one of the reasons that traditional women are happier with their husbands’ emotion 
work is that they tend to share with them higher levels of commitment to the institution of 
marriage. Moreover, the positive effect of men’s household labor is reduced to insignificance; 
suggesting that some wives report greater happiness with their husbands’ affection and 
understanding when their husbands do more household labor because this may signal some 
degree of commitment to marriage. 

Model 3, which adds measures of wife reports of the fairness of the division of household 
labor, indicates that perceptions of fairness in the division of household labor also play a key 
role in influencing wives’ reports of happiness with their husband's affection and 
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Table 2: Descriptive Statistics for Married Men (1992-1994) 





Mean S.D 
Sociodemographic Factors 
Wife's age 46.08 14.39 
Husband's age 49.04 15.07 
Length of marriage 23,54 15.29 
Preschool children 23 55 
School-age children 73 1.06 
Husband works fulltime 67 AT 
Husband works parttime .06 24 
Husband ever divorced AT 38 
Wife ever divorced 16 36 
Couple income (LN) 10.48 1.51 
Wife's education 2.61 2.76 
Husband's education 2.74 3.29 
Black 07 .26 
Hispanic .07 4) 
South 30 48 
Northeast .20 40 
North-central 24 43 
West 21 At 
Companionate/Gender Factors 
Husband's gender role egalitarianism 1.90 10 
Wife's gender role egalitarianism 2.10 14 
Wife works fulltime oe AT 
Wife works parttime 21 At 
Wife earns 33-65% of couple income 36 48 
Wife earns 66%+ of couple income .08 27 
Husband does 33-65% of housework 44 50 
Husband does 66%+ of housework 10 31 
Institutional Factors 
Spouses share weekly attendance .26 44 
Only husband attends weekly .06 24 
Only wife attends weekly 11 32 
Spouses share normative commitment to marriage 15 139 
Only husband has high commitment to marriage p12 30 
Only wife has high commitment to marriage ae) 31 
Equity Factors 
Wife doesn't report household labor fairness .03 16 
Wife considers household labor unfair to her 30 46 
Wife considers household labor unfair to husband .05 22 
Dependent Variables 
Wifes' marital happiness 5.97 133 
Wifes' happiness w/ affection & understanding 5.55 1.52 
Husbands’ quality time 4.78 1.38 


N (Minimum, Maximum Unweighted N) 4603 5010 
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Table 4: OLS Coefficients for Regression of Wives’ Happiness with Husbands’ Affection & 
Understanding (1992-1994) 


1 2 3 
Sociodemographic Factors 
Wife's age .00 -.01 .00 
Husband's age 01 01 01 
Length of marriage = Vane -.01 * “tite 
Preschool children =10' * =12)* -.09 * 
School-age children ao" 14" “Agee 
Husband works fulltime .03 -.05 .00 
Husband works parttime .08 .07 4 
Husband ever divorced Ree a tne ea oo 
Wife ever divorced 01 .03 .00 
Couple income (LN) -.01 -.01 -.02 
Wife's education .02 01 01 
Husband's education O6es 05°" OCee- 
Black 00 8 “357 20, 
Hispanic -.19 -.16 -.13 
South -.13 = 15°." -.10 
Northeast Sob -.18 * -.08 
North-central -.11 -.14 -.06 
Companionate/Gender Factors 
Husband's gender role egalitarianism .00 -.05 -.03 
Wife's gender role egalitarianism = Oui -.06 -.04 
Wife works fulltime aah ven fee G -.16 * 
Wife works parttime eo «21 -19 ™ 
Wife earns 33-65% of couple income -.02 -.03 .05 
Wife earns 66%+ of couple income -.14 -.14 -.11 
Husband does 33-65% of housework .05 .03 -.08 
Husband does 66%+ of housework a 9 .03 
Institutional Factors 
Spouses share weekly attendance 14. ane 
Only husband attends weekly -.12 -.11 
Only wife attends weekly -.01 .02 
Spouses share normative commitment to marriage 0 aa See 
Only husband has high commitment to marriage 10 .07 
Only wife has high commitment to marriage .08 05 
Equity Factors 
Wife doesn't report household labor fairness 236" 
Wife considers household labor unfair to her 403.1." 
Wife considers household labor unfair to husband a2e 0 
(Constant) 6.71 6.05 6.33 
Adjusted R? 03 .04 “15 
N (unweighted) 5010 5010 5010 


p<.05 *p<.0l1 **p<.001 
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understanding. Women who report any unfairness, or especially who fail to respond to the 
question regarding housework fairness, are significantly more likely to be unhappy with the 
emotion work they receive from their husbands. Model 3 also shows that wives’ labor force 
participation continues to predict lower levels of happiness in this domain, even after 
controlling for a range of sociocultural factors. This provides evidence for the gender 
perspective insofar as it shows that increasing commitments to paid labor (for wives) are likely 
to result in more critical assessments of husband's emotion work. In sum, Model 3 shows 
that variables associated with the institutional, equity, and gender models have a robust effect 
on women’s happiness with the love, affection and understanding they receive from their 
husbands. But we find little evidence in support of the companionate theory of marriage. 

These findings documented in Table 3 could mean one of two things. Women’s happiness 
in this domain may represent actual behavior on the part of men. In this case, that would 
mean that men who share their wives’ commitment to the institution of marriage, whose 
wives perceive the division of household labor as fair, and whose wives do not work are more 
likely to be affectionate and empathetic towards their wives. But women’s happiness with this 
type of positive marital emotion work may also-be an artifact of lower expectations of such 
emotion work. In that case, women who share a high level of normative commitment to the 
institution of marriage with their husbands, who are more likely to see the division of 
household labor as fair, and who do not work outside the home have lower standards of 
emotional intimacy, which makes them happier with the love and affection they receive from 
their husbands even if their husbands do not do more of this emotion work than other men. 

Table 5, which focuses on men’s reports of sharing quality time with their wives, provides 
support for Hypotheses 3b and 5b, and no support for Hypotheses 1b, 2b, and 4b. Though we 
find no support for the companionate model, Models 1 through 3 provide evidence for the 
gender theory of marriage. Men who are married to more traditional-minded women and to 
homemakers (the reference group) are more likely to devote themselves to spending quality 
time with their wives (the coefficient for full-time work is negative, but significant at only .10). 

Model 2 provides no support for the institutional model of marriage insofar as it relates to 
religious expression. Men who are not weekly attenders, but whose wives are, do less of this 
type of marital emotion work. Nor is there evidence that normative commitment to the 
institution of marriage is important. This suggests that Model 2 is tapping an absence of an 
important type of marital homogamy, which is known to be associated with higher levels of 
intimacy, more than it is tapping marital commitment. 

But the most striking finding in this table is found in Model 3, where women's perceptions 
of equity in the division of household labor are strongly related to men’s reports of quality time. 
This finding is striking because we would expect to find that the equity model helps to account 
for women’s assessments of men’s emotion work, as it does in Table 3, but not the actual 
emotion work that men do. Specifically, men who are married to women who report unfairness 
in the division of household labor spend less quality time with their wives than men whose 
wives report that the housework is “fair to both” husband and wife. This finding suggests one 
of two conclusions: first, women who are not happy with the division of household labor — and 
probably the level of equality in their marriages more generally — may enter into conflict with 
their husbands or disengage emotionally from their marriages in ways that suppress their 
husbands’ emotion work; alternatively, women who are not happy with the division of 
household labor may have higher expectations of marital equality and male emotion work that 
lead them to enter into conflict with their husbands or to disengage emotionally from their 
marriages in ways that cause husbands to decrease the time they spend with their wives. In 
any case, the consistent predictive power of equity across both measures of men’s marital 
emotion work strongly suggests that perceptions of housework fairness, and marital equality 
in general, are intimately bound up with the health of “her” marriage. 
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Table 5: OLS Coefficients for Regression of Husbands’ Quality Time (1992-1994) 


1 2 3 
Sociodemographic Factors 
Wife's age .00 01 01 
Husband's age Te ie Dio Ot 
Length of marriage = 7 taree a ae Ore 
Preschool children =34 2 woe “oo 
School-age children 276 1S ooo ae ae tare 
Husband works fulltime mie 1 ee Beit “20077 
Husband works parttime .06 .06 07 
Husb. ever divorced ate em ie -.16 * 
Wife ever divorced .08 .09 .09 
Couple income (LN) -.03 -.03 -.03 
Wife's education 01 01 .02 
Husband's education ou" 090 09 28S 
Black oie 4a “ad Tut 
Hispanic eae Outs Dee 
South -.08 -.09 -.07 
Northeast =4155* “15 \* -.13 
North-central -.01 -.01 01 
Companionate/Gender Factors 
Husband's gender role egalitarianism .00 -.01 -.01 
Wife's gender role egalitarianism -.07 * -.08 * -.07 * 
Wife works fulltime -.10 -.10 -.09 
Wife works parttime AD mel as -.11 
Wife earns 33-65% of couple income .03 .04 .05 
Wife earns 66%+ of couple income -.03 -.02 -.01 
Husband does 33-65% of housework .04 .04 01 
Husband does 66%+ of housework 01 01 -.02 
Institutional Factors 
Spouses share weekly attendance 01 .00 
Only husband attends weekly -.08 -.08 
Only wife attends weekly lO SFO 
Spouses share normative commitment to marriage -.04 -.03 
Only husband has high commitment to marriage 11 le 
Only wife has high commitment to marriage .08 .08 
Equity Factors 
Wife doesn't report household labor fairness -.19 
Wife considers household labor unfair to her ee 
Wife considers household labor unfair to husband ee 
(Constant) 4.48 4.48 4.48 
Adjusted R? 15 15 16 
N (unweighted 5010 5010 5010 


‘pe. 0S p< Ol pie ON 
Note: Models estimated with robust standard errors. 
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Table 5 is also significant because it provides additional evidence for the gender model of 
marriage and against the companionate model of marriage. Models 1 through 3 indicate that 
no measure of egalitarianism in practice or belief is associated with higher levels of men 
spending quality time with their wives. Indeed, in keeping with the gender model of marriage, 
wives’ gender egalitarianism and work outside of the home leads to less positive emotion 
work on the part of husbands. 

Overall, then, Tables 4 and 5 indicate that companionate marriages are not characterized 
by higher levels of men’s positive emotion work nor by higher levels of women’s happiness 
with the affection and understanding they receive from their husbands. Even though the 
institutional and equity models do reduce the negative associations between companionate 
factors and husbands’ emotion work, they do not entirely eliminate these associations. Thus, 
consistent with the gender model of marriage, it would appear that women who are in 
marriages that are characterized by more traditional gender beliefs and practices are happier 
with the emotion work they receive and do receive more such emotion work from their 
husbands. Table 4 also indicates that women who share a high commitment to marriage with 
their partners are more inclined to view their husbands’ emotion work through a rose-colored 
lens. Finally, consistent with the equity model of marriage, Tables 4 and 5 suggest that 
women who are not concerned with the equality of the division of tasks or who think they are 
fair are happier with the positive emotion work they receive and get more of that emotion 
work from their husbands. 


Discussion and Conclusion 


Recent work indicates that marital quality declined over much of the past four decades, 
although the rate of decline seems to have leveled off in the 1990s (Glenn 1991; Rogers and 
Amato 1997; Popenoe and Whitehead 2004). The research to date suggests that this 
development may be, in part, a product of the fact that women with increasingly egalitarian 
gender role attitudes are married to men who have not adopted a sufficiently egalitarian 
approach to marriage (Amato and Booth 1995; Chafetz 1995). Insofar as we find strong 
support for the equity model, this study suggests that part of the decline in marital quality is 
indeed related to the continuing mismatch between women’s attitudes and marital equality. 
Moreover, we saw evidence that women who are more egalitarian-minded and more upset 
with the division of household labor receive lower levels of positive emotion work from their 
husbands, perhaps because they are more likely to initiate conflict with their husbands. Thus, 
rising expectations among women for marital equality may also have the unintended effect of 
lowering investments in marital emotion work on the part of men; this, in turn, may be 
associated with declines in marital quality for American women. 

But the findings in this study also lend support to institutional and gender accounts of this 
decline in marital quality. From an institutional perspective, we find that shared church 
attendance and normative support for the institution of marriage are associated with higher 
levels of women’s marital happiness. Thus, declines in religious attendance over the past four 
decades (Steensland, Park, Regnerus, Robinson, Wilcox and Woodberry 2000), along with the 
liberalization of attitudes to divorce and extramarital sex (Thornton and Young-DeMarco 2001), 
may also account for recent shifts in marital quality insofar as they reduce the social and 
normative supports that foster higher investments in marriage. From a gender perspective, our 
findings suggest that increased departures from a male-breadwinning/female-homemaking 
model may also account for declines in marital quality, insofar as men and women continue 
to tacitly value gendered patterns of behavior in marriage. Specifically, we find that the 
gendered character of marriage seems to remain sufficiently powerful as a tacit ideal among 
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women to impact women's marital quality even apart from the effects of the continuing 
mismatch between female gender role attitudes and male practices. Of course, this is also 
indicative of the fact that we find virtually no evidence for the companionate model, since 
women are not happier in marriages marked by egalitarian practices and beliefs. (For a 
summary of our findings, see Table 6.) 

Our findings also speak to the role of emotion work in women’s global marital quality. First, 
it is important to highlight our finding, judging by the dramatic increase in model fit, that men’s 
emotion work (and women's assessments of that work) is the most crucial determinant of 
women's marital quality. It is more important than patterns of household labor, perceptions of 
housework equity, female labor force participation, childbearing, education and a host of other 
traditional predictors of global marital quality. This finding suggests that the functions, character, 
and stability of contemporary marriages are intimately tied to their emotional well-being. 

We find little evidence in support of the companionate model of marriage when it comes 
to men’s emotion work. For the most part, marriages that are more egalitarian in belief and 
practice are not marked by higher levels of men’s positive emotion work or by women’s 
happiness with such emotion work. Instead, we find modest evidence that wives’ gender 
traditionalism is independently related to higher levels of men's positive emotion work in 
marriage. We also find evidence that homemaking wives report greater happiness with their 
husband's emotion work, and may be more likely to receive such work from their husbands. 
In other words, adherence to traditional beliefs and practices regarding gender seems to be 
tied not only to global marital happiness but also — surprisingly enough — to expressive 
patterns of marriage. 

We also find evidence for the institutional model of marriage, which stresses the 
importance of social and normative support for marriage. Wives who share high levels of 
church attendance are more likely to report happiness with their husband's emotion work in 
marriage. Moreover, wives who share a normative commitment to marriage with their 
husbands are more likely to report happiness with the emotion work done by their husbands, 
probably because they seek to legitimate their own investment in married life. Thus, socially- 
conservative practices and (possibly) beliefs appear to be linked to lower expectations of 
marital emotion work on the part of women. But it is also possible that they are associated 
with more expressive marriages. In any case, women who share a commitment to the 
institution of marriage with their husbands express greater happiness with the expressive 
state of their relationship. 

This study also demonstrates that women who are not happy with the fairness of the 
division of household labor are less satisfied with their husband's positive emotion work and 
less likely to receive such emotion work. We suspect that higher expectations of intimacy and 
equality among women, especially more egalitarian-minded women, have led them to view 
their husbands’ emotion work more critically; we also suspect that these expectations have 
increased marital conflict and — in turn — dampened men’s marital emotion work (Chafetz 
1995; Hochschild 1989). To repeat Chafetz (1995: 78): “The typical marriage has been 
increasingly reconceptualized by many women as short on intimacy and equality and 
therefore unacceptable.” Thus, the irony here is that — at least over the short term — the 
increased popularity of companionate ideals of marriage seems to have contributed to a 
decrease in the prevalence of the companionate marriage in practice. 

Future research should seek to reconcile our findings regarding the association between 
gender egalitarianism and women’s marital quality with other research that comes to different 
conclusions about the nature of that association (Amato et al. 2003). New research should 
also explore whether women’s expectations about marital equality are indeed linked to marital 
conflict and, in turn, lower levels of men’s emotion work. Future research should also do more 
to distinguish different types of married women, insofar as women with distinct orientations 
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Table 6: Summary of Significant Associations Between Independent Variables and Wives’ Marital 
Quality (1992-1994) 





Associations 


Marital Affection/ Quality 
Happiness Understanding Time 


Companionate/Gender Factors 


Husband's gender role egalitarianism ns ns ns 
Wife's gender role egalitarianism - - - 

Wife works fulltime ns - ns 
Wife works parttime - - - 

Wife earns 33-65% of couple income - ns ns 
Wife earns 66%+ of couple income - ns ns 
Husband does 33-65% of housework ns ns ns 
Husband does 66%+ of housework ns rr ns 


Institutional Factors 


Spouses share weekly attendance + + ns 
Only husband attends weekly ns ns ns 
Only wife attends weekly ns ns - 

Spouses share normative commitment to marriage + + ns 
Only husband has high commitment to marriage ns ns ns 
Only wife has high commitment to marriage ns ns ns 
Equity Factors 

Wife doesn't report household labor fairness ns - ns 
Wife considers household labor unfair to her - - - 

Wife considers household labor unfair to husband ns - - 


Emotion Work 
Wives' happiness w/ affection and understanding it 
Husband's quality time + 





to the worlds of work and family may be influenced differently by the factors associated with 
the companionate, institutional, equity, and gender models (Hakim 2001). Finally, future 
research should focus explicitly on the direction of causality. Some of the association 
between women’s marital quality and shared religious attendance we found might reflect the 
fact that happily married women are more likely to attend church (Booth, Johnson, Branaman 
and Sica 1995). Longitudinal data should seek to determine how much of the association 
between women’s marital quality and our institutional, equity and gender factors is 
attributable to these factors and how much of it is attributable to the effect that high-quality 
marriages have on women’s religious attendance, normative commitment to marriage, sense 
of fairness and willingness to engage in traditional gender practices. 

In conclusion, our results suggest that the road to successful “new families” (Goldscheider 
and Waite 1991) is more circuitous and difficult than originally thought. While it is true that 
changes in men’s behavior are required for this transformation (Goldscheider and Waite 
1991), it also appears that contemporary couples could benefit from a heightened 


1342 ¢ Social Forces Volume 84, Number 3 * March 2006 


appreciation of the role that shared religious practice and normative commitments to 
marriage play in supporting women’s marital quality and the expressive dimension of marital 
life. Our results also suggest that more traditional beliefs and practices regarding gender play 
a positive role in the quality and expressive character of many women’s marriages, even apart 
from the dramatic shifts in gender role ideology in the last few decades. At least for many 
American women, this study indicates that “her” marriage is most happy when it combines 
elements of the new and old. 


Notes 


1. Our use of the term companionate is to be distinguished from Burgess’ use of the 
concept in that we link a contemporary companionate theory of marriage to gender role 
egalitarianism (Burgess, Locke and Thomes 1963). 


2. The primary concern with using NSFH2 is that some of the couples that were married in 
NSFH1 are divorced at NSFH2. Since divorce is, in all likelihood, negatively related to 
women’s marital happiness and men’s marital emotion work, we suspect that our sample 
under-represents couples that had lower levels of marital quality and male marital emotion 
work at NSFH1. Accordingly, we may have fewer observations of our dependent variables 
at lower values than we would otherwise observe, and reduced variance. This suggests 
that the size of any sociocultural effects in NSFH2 may be smaller than they would have 
been at NSFH1. Such bias, even with adequate power, would tend to work against 
discovering statistically significant relationships. Following standard practice, we do not 
interpret a lack of statistical significance to imply an absence of an effect. 


3. We include region as a control variable because religious attendance and gender 
egalitarianism are both strongly associated with region. 


4. We also analyzed the same set of factors for men’s marital happiness (tables available 
upon request). We found that a model that includes all these factors, including women's 
emotion work, has an R-squared of .441. This suggests that wives’ emotion work, and 
husbands’ happiness with it, is about 20 percent less important for men’s global marital 
happiness than it is for women’s global marital happiness. 
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Abstract 

The tendency of members of many ethno-religious groups to marry within their group has 
been considered evidence for the persistent role of ascription in modern society. What ts the 
role of the family of origin in this process? To answer this question, we study the marriage 
choices of Jews in the Netherlands, using a unique dataset and a novel analytical approach 
(i.e., multilevel analyses of sibling-data). Our models show that almost a third of the variation 
in Jewish endogamy can be attributed to a common family factor. Measured indicators of 
family background point to two underlying mechanisms: the intergenerational transmission 
of ethnic identities and the intergenerational provision of endogamous meeting and mating 
opportunities. Together, these mechanisms explain 75 percent of the total family influence. 


The degree to which members of ethnic and religious groups marry within rather than outside 
their group is often viewed as an indicator of the strength of ascription in society. Studies on 
marriage choice have shown that most forms of ethnic and religious intermarriage have become 
more common over time (Kalmijn 1991; Lieberson and Waters 1988), in line with the frequently 
made claim that the importance of ascription in society has declined (Blau and Duncan 1967). It 
has also been recognized, however, that this transition is far from complete. Ethno-religious 
boundaries in society may have become weaker, but they have certainly not disappeared. Most 
of the interest of marriage researchers has focused on new immigrant groups and their 
descendants, groups that have increased in size and that have become increasingly important 
in debates about integration and multiculturalism. Older ethno-religious groups, groups that are 
less visible, and groups that have declined in size have received comparatively little attention. 
This is particularly true for the position of Jewish persons in Europe. 

We re-examine the role of ascription in contemporary society by analyzing the marriage 
choices of Jews in the Netherlands after the Second World War. Using a unique data set and a 
novel analytical approach — multilevel analyses of siblings — we examine to what extent family 
background plays a role in explaining Jewish endogamy. Although it is often assumed that 
ethno-religious endogamy is an indicator of ascription, the degree to which this is true depends 
on how important the family of origin is in this respect. In other words, if choices are largely 
made within the group and the family factor is important, ethnicity is largely ascribed. If choices 
are endogamous but the parents play no role, choices are more individual in nature and ethnicity 
becomes ‘achieved’ rather than ‘ascribed.’ Under the assumption that one’s ethnic background 
is not visible in day-to-day encounters, people may choose their ethnic identity out of the ethnic 
options they have (Waters 1990). We argue that the importance of ascription in society should 
not only be assessed by measuring ethno-religious marriage boundaries, but also by assessing 
the extent to which ethno-religious endogamy is dependent on family background influences. 


Direct correspondence to Matthijs Kalmijn, Department of Sociology, Tilburg University, P.O. Box 
90153, 5000 LE Tilburg, Netherlands. E-mail: M.Kalmijn@uvt.nl. 
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There are two ways in which family background will foster endogamy: through the 
intergenerational transmission of preferences and through the intergenerational provision of 
opportunities (Kalmijn 1998; Lieberson and Waters 1988). The first mechanism suggests that 
the more strongly parents are orientated toward their ethno-religious identity, the greater the 
chances that their children will marry endogamously. The underlying reasoning is that children 
from families that stress their ethno-religious roots will attach more importance to their own 
ethno-religious identity, and that this translates into a strong preference for a spouse in one’s 
own group. In addition, if the children have not adopted the preferences of their parents and 
want to marry outside the group, they may receive sanctions from their parents that will 
discourage them from pursuing the particular marriage candidate. 

The second mechanism suggests that the more homogeneous the contexts in which the 
parents are embedded, the higher the chances that their children will marry endogamously. 
Parents are often engaged in contexts that provide opportunities for their children to marry 
endogamously. Important examples are friendship networks, churches and other ethno-religious 
organizations, schools and geographic areas such as cities and neighborhoods. Such settings are 
typically not intended as a meeting place for marriage partners, but they may promote ethno- 
religious endogamy as a by-product by providing a homogeneous poo! of candidates to choose 
from. Such contexts are particularly important for groups that are very small in a society because 
the costs of finding someone within the group rise tremendously with declining relative size. 

Exposure to homogeneous contexts may be associated with endogamy for two reasons. 
There may be a direct effect because involvement in such contexts affects the marriage choices 
of the children by changing the pool of eligibles from which they can choose. The effect may also 
be spurious, however, because engagement in homogeneous contexts may in turn be caused by 
a sense of ethno-religious identity and by norms of endogamy. If the effect is spurious, we should 
find that the effect of involvement in contexts disappears after taking into account indicators of 
ethno-religious identity. If involvement in contexts is a matter of opportunity, we should find that the 
influence of contexts remains significant after taking into account attachments to ethno-religious 
identities. In a similar way, it can be argued that parental attachment to ethnic identities may have 
a direct and an indirect effect on the marriage choices of their children. Such attachments may 
have direct effects through the socialization of ethnic norms, but they may also have indirect 
effects because they translate into greater opportunities for marrying inside the group. 

Earlier studies of intermarriage among Jews in Europe and the United States have mostly 
been descriptive, showing that there is a significant marriage boundary between Jews and 
Gentiles. This has been shown for the prewar period in the Netherlands (Ultee and Luijkx 
1998), for the postwar period in the Netherlands (Van Praag 1971), for France (Bensimon and 
Della Pergola 1986), for Britain (Miller, Schmool and Lerman 1996), and for the United States 
(Lazerwitz 1995; Chiswick. 1997; Goldstein 1992). Intermarriage has also been found to 
increase across cohorts (Lazerwitz 1995). Much less is known, however, about the causes of 
endogamy. An exception is Waite and Friedman (1997), who use logistic regression analyses 
of Jewish endogamy in the United States to assess family influences. Waite and Friedman 
showed that people raised in a reform religious household are more likely to intermarry than 
people raised in a conservative or orthodox household. Mixed evidence was obtained for the 
role of religious rituals (Bar Mitzvah) and for attending Jewish schools. 

Our analyses expands this line of research by analyzing a new country and, more importantly, 
by applying a sibling design. In our survey, information was collected on all the siblings of the 
respondent. Because siblings share their parents and the circumstances under which they are 
brought up, sibling similarities in adult marriage choices reflect the influence of family . 
background (cf., Kuo and Hauser 1995). We estimate multilevel models on the marriage choices 
of multiple siblings; this allows us to decompose the total variance in marriage choices into an 
individual- and a family-level component. This makes it possible to estimate the tota/ influence of 
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family background without including direct indicators of family background. Subsequently, we 
use measured indicators of the family of origin to assess how much each of the identified 
alternative family background mechanisms account for the total family influence. 

Before turning to the data, some information on the situation of Jews in the Netherlands is in 
order (Van Solinge and De Vries 2001; Moore 1997). Compared to other countries that suffered 
from the Nazi occupation, the destructive impact of the Holocaust on the Jewish population has 
been particularly strong in the Netherlands. In the direct postwar years, there was a pronounced 
drive towards emigration among Dutch Jews, in particular to Israel and the United States. 
However, in later years many of these emigrants returned to the Netherlands. As in other countries, 
Jews in the Netherlands are concentrated in the larger cities and particularly in Amsterdam. A 
minority of Jews is formally affiliated to Jewish congregations, and this proportion remained more 
or less stable after the 1980s. Jews tend to form a socio-cultural elite in the sense that they are 
highly educated and occupy high-status occupations. As far as demographic behavior is 
concerned, marriage tends to occur late, cohabitation is common, and divorce frequent. 


Data 


Acquiring a representative sample of Jews in the Netherlands is a major challenge. The very small 
proportion of the Dutch population made up by Jews makes it unfeasible to start from a random 
sample of the population. To solve this problem, we followed the method of sampling Jewish 
names which was used in other countries (Bensimon and Della Pergola 1986; Miller, Schmool 
and Lerman 1996). The first stage of our survey involved a random selection of distinctive Jewish 
names (DJNs) from the Amsterdam phone register by three panels of experts. A name was 
added to the list of DJNs if it was listed by at least two of the three panels. The result was a 
sample of 453 DJNs. The Dutch telephone company then provided names and addresses for all 
telephone numbers linked to these 453 names in the entire country. Subsequently, a random 
sample was drawn of these addresses. Samples were drawn within regions in order to ensure a 
correct regional distribution of the Jewish sample. The final stage involved a screening by 
telephone. Persons who had at least one Jewish parent were asked to participate. This resulted 
in a total of 781 people who took part in the survey. In addition, 88 Jewish partners were 
interviewed. The response rate was 54 percent. Since the sampling procedure could not be 
expected to reach intermarried Jewish women who use their husband's name rather than their 
own name, a snowball sample of 167 such persons was added to the random sample. 
Considerable effort has been put into examining the representativeness of the realized 
sample. It has been compared to the population on five key aspects (age, sex, region, marital 
status and number of Jewish parents). The reference population was constructed using 
estimation procedures based on a variety of vital statistics and other data sources (Van Solinge 
and De Vries 2001). The only emerging difference was that too many single Jewish women and 
too few Jews with a non-Jewish father were interviewed. Both differences stem from the use 
of names to identify Jewish persons (i.e., single women are more likely to have an entry in the 
telephone register than partnered women and Jews with a non-Jewish father usually have a non- 
Jewish name). Weights were used to adjust the realized sample to this reference population. 
We use information about the current partner relationships of both our primary 
respondents and all their living siblings. This analysis is based on current partners of 
respondents and siblings because questions about the sibling's former partners were not 
asked. Current partners include partners with whom the respondent or sibling does not live. 
To avoid complications arising from stepfamily formation, we only included siblings who were 
born to the same parents as the respondents, and therefore, presumably, were raised in the 
same family circumstances as the respondent. The total number of selected families is 849 
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and the total number of respondents and siblings combined is 1,725 (with an average of two 
siblings per family). 


Measures 


The dependent variable is endogamy (inmarriage). We compare Jews who currently have a 
partner with at least one Jewish parent to Jews who have a partner with no Jewish parents. We 
do not differentiate among Jewish partners with different types and numbers of Jewish parents. 

The independent variables are measures of family background. We first consider a set of 
indicators of the intergenerational transmission of /dentities: 


Mixed Parental Marriage 


Three groups are distinguished: both parents are/were Jewish, only the father is Jewish, and 
only the mother is Jewish. The first group serves as the reference category. 


Religious Orthodoxy 


Information on the level of orthodoxy in the parental home is obtained from two questions on 
the way the respondent was brought up. Respondents had to rate two topics (religion and 
holidays/customs) on a three-point scale ranging from “not Jewish” to “very Jewish.” The two 
items correlate highly (.76). A scale score was calculated by taking the mean of the scores 
on the two items. 


Parental Divorce and Non-Jewish Stepparent 


It can be assumed that ethnic identities in the parental home are weaker when children were 
brought up with a non-Jewish stepparent. We include a dummy variable indicating this, as 
well as a control variable indicating whether the parents were divorced. 

We subsequently include family background measures that are related to the 
intergenerational provision of meeting opportunities. It should be noted at the outset, however, 
that such measures are always a combination of preferences and opportunities. Nonetheless, 
these measures are more strongly related to opportunities than the measures just discussed. 


Parental Networks 


In the series of questions about how the respondent was brought up, respondents also had 
to rate the friends and acquaintances of their parents (on a three-point scale ranging from “not 
Jewish" to “very Jewish’), 


Religious Organization Member 


Respondents reported on whether their parents were members of a Jewish religious 
community when the respondent attended elementary school. 
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Jewish Education 


We do not have sibling-specific data on Jewish education, but we assume that siblings will 
mostly have attended the same or similar schools. The variable indicates whether the 
respondent ever attended a Jewish day school or attended Jewish religious lessons. 


Proportion of Jews in Place of Birth 


To measure structural opportunities in geographic terms, we included the proportion of Jews in the 
place of birth of the respondent, using external data (Van Praag 1971). Since the distribution of the 
Jewish population across major regions is fairly stable across the post-war period, we do not use 
year-specific municipality data (Van Solinge and De Vries 2001). Jews who were born abroad were 
assigned the mean value on this variable and a dummy variable is included indicating whether a 
respondent was born abroad. Note that a nontrivial number of persons was born abroad (Table 1), 
but few of these entered the Netherlands with a spouse. In fact, most of them were young, 
unmarried or coming to the Netherlands to marry a Dutch spouse (Van Solinge and De Vries 2001). 


Table 1: Means and Standard Deviations of Independent Variables (Weighted Sibling Sample) 





Standard 
Mean Deviation 
Identity 
Mother non-Jewish (dichotomous) oil 
Father non-Jewish (dichotomous) 24 
Parents religiously orthodox (1-3) 1.56 67 
Parents divorced (dichotomous) “8 
Non-Jewish stepparent(s) (dichotomous) .05 
Contexts and Identity 
Parents Jewish friends (1-3) 1.87 Wee 
Parents members religious community (dichotomous) 36 
Gone to Jewish school or religious education (dichotomous)? 36 
Percentage of Jews in place of birth * 16 18 
Born abroad (dichotomous)? 3 
Controls 
Woman (dichotomous) 42 
Year of birth (-1900) 48 16 
Cohabiting (dichotomous) Po 
Non-resident partner (dichotomous) 07 
Educational level parents (1-6) 4.00 1.64 
Educational level parents missing (dichotomous) .04 
Other 
Average number of siblings 2.03 
Number of cases 1725 


nn 
® Refers to the respondent and not to his/her siblings. 
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All the measures we use apply to the family of origin of the respondent. By limiting the 
sample to siblings born to the same parents as the primary respondent, these measures also 
apply to the family of origin of the siblings. We should keep in mind, however, that in some 
cases, the reports reflect the perceptions of the respondent about what was happening in the 
family of origin and that such perceptions may differ among siblings. 

We include the following control variables: union status, distinguished into married, 
cohabiting, or not living together (this information is available for all siblings), parental 
education (averaged across father and mother), birth cohort and sex. Age at marriage and 
education are only available for the respondent and not for the siblings. Analyses on primary 
respondents show that after controlling for the other variables listed above, no direct effects 
of age at marriage or education remain, so we feel safe in excluding these variables from the 
sibling models. Means and standard deviations are in Table 1. 


Method 


We start examining the role of family background by presenting a cross-tabulation of marriage 
choices of all sibling pairs in our dataset. This cross-tabulation provides a straightforward way 
to get a first impression of how much siblings resemble each other in terms of their marriage 
choices. Next, we present results from a series of multilevel models of marriage choices of all 
siblings in our dataset. In the literature, sibling data are often analyzed through covariance 
models (Kuo and Hauser 1995), but recently multilevel or random-effects models have 
become popular as well (Curtis, Diamond and McDonald 1993; Van Poppel, Schellekens and 
Liefbroer 2002). In our view, there are no compelling reasons for favoring either one method. 

In multilevel models, differences among families and differences among individuals are 
estimated simultaneously (Goldstein 1995). The model provides a way to assess how much 
of the total variation in marriage choices is attributable to the common family environment 
shared by siblings and how much is attributable to factors unique to the individual. The model 
also allows for a correct estimation of the effects of measured family variables and enables us 
to assess to what extent variation among families is explained by these measured 
characteristics. Because the dependent variable is dichotomous rather than continuous, a 
logistic regression model is estimated. To estimate this model for multilevel data, we use a 
technique proposed by Goldstein (1995), i.e., restrictive iterative generalized least squares. 

To assess the role of alternative interpretations of family background effects, we estimate 
a series of models. The first model contains an intercept only. This model allows us to 
estimate the proportion of the total variance that is attributable to the family rather than to the 
individual. Next, we estimate a model containing variables linked to the intergenerational 
transmission of preferences and a model containing variables linked to the intergenerational 
provision of opportunities. In the fourth model, both types of variables are included to assess 
their relative importance. In a fifth model, finally, the control variables are added. 

A convenient measure to summarize the importance of the family factor is the so-called 
‘variance partition coefficient’ (VPC) (Goldstein 1995). This coefficient gives the proportion of the 
total variance attributable to the shared family environment. However, a complicating factor in 
assessing the VPC in our models is that we estimate a logistic regression model. In such a 
model, the individual level variance and the family level variance cannot be easily compared to 
one another, because the first depends on the expected value of the dependent variable and the 
second is measured on a logistic scale. Fortunately, Goldstein, Browne and Rasbash (2002) have - 
developed a method to estimate VPC in such models, by applying what they call a method of 
‘model linearization.’ We use this method to calculate the VPC for our model. 
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Results 


We start with some descriptives, using a sample of all first marriages or cohabiting 
relationships (Table 2). We observe that on average, 54 percent of all first marriages and 58 
percent of all first (married and unmarried) cohabitational unions are exogamous. This 
percentage fluctuates across union cohorts, but there is no clear trend. Overall, women are 
somewhat more likely to marry a partner with a Jewish background than men, but this 
difference has decreased among more recent union cohorts. Because we do not have data 
on the non-Jewish population, odds ratios can only be calculated when additional 
assumptions are made. Assuming that there are equal numbers of Jewish men and women, 
that the marriage behavior of Jewish men and women is the same (taking the overall 
percentage as the starting point), and that the married Jewish population is about .3 percent 
of the total married population, the (log) odds ratio would be 6.2. Compared to other ethnic 
and religious groups in Europe and the United States, this can be interpreted as indicating 
high levels of endogamy (Alba and Golden 1986; Lieberson and Waters 1988). 


Table 2: Percentage of Jews Whose First Cohabitation and Whose First Marriage Was With a Non- 
Jewish Spouse, By Union Cohort and Marriage Cohort, Respectively (Weighted Individual Sample) 


First cohabitation First marriage 

Union/ Marriage Men Women Total N Men Women Total N 
Cohort 

Before 1946 59 36 44 48 59 31 41 46 
1946-1959 68 59 59.179 69 56 60 176 
1960-1969 58 47 Blas 186 56 38 Al PiSi 
1970-1979 69 53 62 174 63 46 BOs, < itZ0 
1980-1989 59 63 Gi 2 9td24 65 57 61 83 
1990-1999 67 61 64.112 54 52 ga. =100 
Total 64 53 58 823 61 48 54. —s«69 


EEE 


To get a first impression of the overall importance of the family of origin, one may cross- 
tabulate partner choices of sibling pairs. We look at all pairs of siblings within a family, 
including pairs containing the respondent. The focus here (and in the regression models) is on 
current relationships, not on first relationships. It turns out that among persons who have a 
Jewish partner, the chance is 71 percent that the (younger) brother or sister also has a Jewish 
partner. If someone does not have a Jewish partner, the chances are only 20 percent that his 
or her sibling has a Jewish partner. The differential between these two chances is the odds 
ratio and this ratio is 3.6 and statistically significant. In other words, siblings are very much 
alike in the type of spouse they choose. 

The results of the multilevel regression models are in Table 3. Model 1 is an empty model 
which allows us to assess the overall family influence. Following Goldstein, Browne and 
Rasbach (2002), we calculate the variance partition coefficient (VPC) as follows: VPC = [0 
aw] [1 + exp(Bo + Bix)}? {L oy? m7] [1 + exp(Bo + Bix)}? + a (1- 7)}' where o,,’ is the family 
variance and wr is the expected mean in the model under consideration, i.e., 7 = exp(Bo + Bix) 
/[1 + exp(Bo + B:x)]. To calculate the proportion of unexplained variance at the family level, we 
start from Model 1 in Table 3. This is an empty model containing an intercept only. In this 
model , 0? =2.22, exp(Bo) = e *' = .403 and m =e *'/(1 + e™) = .287. Substituting these 
values in the formula for the VPC gives a VPC of .312. This implies that almost a third of the 
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variation in marriage choices is due to family differences. In line with what the odds ratios for 
sibling associations told us, this figure shows that family background is an important factor in 
understanding intermarriage choices of Dutch Jews. 

We now turn to the measured family background variables in Model 2, 3 and 4. Model 2 
contains ‘parental identity’ variables, Model 3 contains variables that are a mix of ‘parental 
context’ and ‘parental identity’ characteristics, and Model 4 contains both these sets. By 
comparing the effects of identity variables between Model 2 and Model 4, we get information 
about whether the effects of attachments to ethno-religious identities are indirect, through 
parental contexts. By comparing effects of Model 3 and Model 4, we obtain evidence about 
the degree to which effects of context variables are spurious. If exposure to ethnically 
homogeneous contexts is merely a reflection of attachment to ethno-religious identities and 
such contexts do not by themselves foster endogamy, the effects observed in Model 3 
should become smaller in Model 4. 

The regression results of Model 2 show that children of mixed marriages are much less 
likely to marry within the group than children with two Jewish parents. The effects are 
substantial in size, but depend on whether the father or mother is non-Jewish. Having a non- 
Jewish mother reduces the odds of endogamous marriage more strongly than having a non- 
Jewish father. In other words, the children of mixed marriages intermarry more often, and this 
is even more so when only the father is Jewish and the respondent is consequently not 
considered as Jewish if the Halachic definition is applied. This is in line with our expectations, 
but we note that the effect of having a non-Jewish father is still statistically significant. Hence, 
it is not just the formal definition of a Jewish identity that matters, it is also the social identity 
that counts. These effects are reduced somewhat when including other variables in the 
model (models 4 and 5), suggesting that the effects are partly indirect. The effects remain 
highly significant, however, showing that there is also a direct effect. We interpret this as 
evidence that there is direct transmission of endogamy across generations. 

Religious orthodoxy, as measured by adherence to Jewish religious customs and 
holidays in the parental home, also has a strong effect on endogamy. In families where 
religion plays an important role, the children are much less likely to marry outside their 
group. This effect remains significant in the multivariate models, when context variables 
are included. We do observe, however, that the effect of parental orthodoxy is reduced 
by about one third when including other variables in the model. The two final indicators for 
religious identity, parental divorce and having non-Jewish stepparents did not prove to be 
statistically significant. 

Variables that reflect an attachment to Jewish identity as well as an involvement in social 
settings that may foster endogamy are included in Model 3. The first indicator has a strong 
effect: when parents are a member of a religious community, the children are more likely to 
marry endogamously. This effect becomes insignificant, however, after we take into account 
identity variables. We therefore must conclude that the effect of religious membership is 

spurious: it is due to the intergenerational transmission of the Jewish identity and not to the 
homogeneous Jewish contexts in which the children are embedded. 

For the parental network, we find significant effects as well. When parents had more 
Jewish friends and acquaintances when the children were growing up, the children are more 
likely to marry endogamously. The effect becomes insignificant, however, when identity 
variables are added. In other words, homogeneous parental friendship networks are primarily 
a reflection of a stronger Jewish identity, they do not encourage endogamous marriages of 
the children directly by providing a more selective pool of people to choose from. 

A different pattern is observed for Jewish schools. When parents send their children to a 
Jewish school, the chances are significantly higher that the children will end up with a Jewish 
spouse. When controlling for Jewish identity variables, the effect is halved, but it remains 
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statistically significant. In other words, people who went to a Jewish school when they were 
young are more likely to marry endogamously than other Jewish persons, even if we take into 
account that they are more strongly socialized into a ethno-religious identity than persons 
who did not attend a Jewish school. This result favors the hypothesis about meeting and 
mating opportunities. 

The place of birth also turns out to play a role. People who were born in localities where 
the proportion of Jews is higher, are more likely to marry endogamously. The effect remains 
marginally significant and does not change in magnitude after all other variables are controlled 
for. This is another piece of evidence favoring the role of opportunity in intermarriage. 

We end by discussing the effects of individual characteristics. We observe much higher 
chances of an endogamous relationship when the couple is married rather than cohabiting. 
We also observe lower levels of endogamy when the couple is not living together, but the 
couples who cohabit differ more strongly from the married than couples who are just living 
apart. We also observe that when all variables are included in the model, women are more 
likely to marry within the group than men. This may be due to stronger endogamy norms 
for women. Finally, we find no effect of birth cohort and no effect of the educational level of 
the parents. 

The various models have added measured family background variables to the model. The 
first and empty model showed that 31 percent of the total variance could be attributed to 
family factors. To what extent are the measured family background characteristics able to 
explain the variance among families? We can answer this question by comparing the family 
variance in the various models. Model 2 which only contains family identity variables reduces 
the family variation by 68 percent. The model which contains mixed context and identity 
variables explains 59 percent of the family-level variation. The model containing both sets 
explains 75 percent of the family variance. In other words, about three quarters of the total 
family effect on intermarriage is picked up by our measured family characteristics. This is a 
substantial amount, showing that we succeeded quite well in measuring the ways in which 
parents influence the marriage choices of their children. 

Note that the estimate for the family variance in Model 5 is higher than that for Model 4. 
This may seem puzzling at first sight, but it results from the introduction of individual level 
variables in Model 5. These will lead to a reduction in the individual level variance. As a result, 
the relative amount of unexplained variation at the family level will rise, causing a possible 
increase in the absolute size of the family variation in its wake. However, because we 
introduced only family level variables between Model 1 and Model 4 a comparison of family 
variance estimates between these models was unproblematic. 

The results presented in Table 3 are based on a rather complex sampling design, including 
weighting, snowball sampling, and the inclusion of multiple household members. As a result, 
one may wonder how robust these results are. In order to gain insight into this question, we 
replicated the full model on an unweighted sample, a sample without snowball cases, a 
sample without father-Jews, and a sample without household members (results available on 
request). The conclusion from these sensitivity analyses is that they enhance our confidence 
in the results. For most alternative specifications, the results strongly resemble those from our 
weighted full sample. If results differ, the differences seem to point to biases introduced by 
using a selective sub-sample rather than to biases in our full sample. 


Conclusion 


Our analysis has shown in a novel and more convincing way than has been done before that 
the family of origin plays a crucial role in fostering ethnic endogamy. By analyzing the similarity 


The Family Factor in Jewish-Gentile Intermarriage * 1357 


of siblings, we can for the first time assess the total influence of the family of origin on 
marriage choices along lines of ascription. Our results show that about a third of the variation 
in these choices is attributable to the family of origin. The figure can be compared to figures 
obtained in stratification research. Sibling analyses of educational attainment in the United 
States, for example, show that about half the variance in educational attainment can be 
attributed to family background (Kuo and Hauser 1995). Our analyses suggest that family 
background is less important for intermarriage than it is for stratification outcomes. 

We subsequently examined two sociological mechanisms that may account for this 
variation: the mechanism of intergenerational transmission of preferences and the 
mechanism of the intergenerational provision of opportunities. Indeed, our measures of 
these two mechanisms were able to account for three quarters of the family variation in 
marriage choices. This is a sizeable proportion, suggesting that these mechanism are 
indeed influential. The mechanism of the intergenerational transmission of preferences 
turned out to be by far the most important of the two, once again suggesting that a 
socialization that puts an emphasis on the importance of one’s ethno-religious identity 
leaves a strong mark on its recipients. ’ 

The results also lead to a more general conclusion. In the past, ethnic and religious 
endogamy has often been considered as evidence for the importance of ascription in 
society. Similarly, that most forms of ethnic and religious intermarriage have increased, has 
been regarded as evidence for a decline in the importance of ascription. There have been 
exceptions to this general pattern, however, and we present the case of Jews in postwar 
Holland as another exception. The level of Jewish endogamy is high and has not declined 
over time, suggesting that ascribed group boundaries can still be important in modern 
society. 

Ethnic intermarriage is not necessarily evidence of ascription, however, because people 
may also choose their ethnic and religious identities. If people marry within the ethnic group 
because they themselves have chosen that as an important personal identity, endogamy may 
also be regarded as ‘achieved’ rather than ‘ascribed’ (Waters 1990). To obtain a stronger case 
for the role of ascription, it is therefore important to assess the influence of the family of origin 
on the type of marriage choices that people make. This paper has done that and has shown 
that about a third of the variance in ethnic endogamy is due to family background. In other 
words, we have double evidence on the importance of ascription: The marriage choices of 
Jews and Gentiles are often made within the ethnic group and these choices are to a large part 
governed by the family of origin. 
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Abstract 

Using data from the National Education Longitudinal Study of 1988, we identify parental age 
as influential in the parental provision of economic resources, social capital and cultural 
capital to adolescents, as well as in parental educational expectations for their children. At the 
bivariate level, the relationship is curvilinear, suggesting that having comparatively young or 
old parents is disadvantageous to teenagers, at least with regard to resource allocation. With 
controls for socioeconomic background and family structure, however, the pattern typically 
becomes positive and linear: as the age of the parent rises, so too does the transmission of 
resources to adolescent offspring. These patterns hold for most economic, social and cultural 
resources, although the pattern is strongest for economic ones and weakest — albeit still 
significant — for more interactional ones. Although maternal age is the primary focus of this 
article, supplementary analyses also confirm a generally positive relationship between 
paternal age and parental resources. These results suggest that parental age may warrant 
attention similar to that given to family structure, race and gender. 


Introduction 


Sociologists, especially those coming from human capital, rational choice and status 
attainment traditions, increasingly direct their attention to understanding the transmission of 
resources from parents to children (Becker 1981; Coleman 1988; Schneider and Coleman 
1993: Steelman et al. 2002). Meanwhile, sociologists from an aging and life course 
perspective persuasively demonstrate that age has profound social and psychological 
implications (Elder 1994; Mirowsky 1995; Riley 1987). Unfortunately, few scholars merge 
these two areas of inquiry to consider the relationship between parental age and resource 
allocation to progeny. Among the exceptions, Mare and Tzeng (1989) initially observe a 
curvilinear relationship between father’s age and son's educational attainment that flattens to 
a linear and positive relationship when other socioeconomic factors are taken into account. 
They opine that — but could not test whether — these patterns owe in large measure to 
children's differential access to resources corresponding with parental age. Contending that 
parental age remains “a neglected aspect of the process by which parents affect their 
offspring” (Mare and Tzeng 1989:108), they call upon social scientists to scrutinize parental 
age more closely. 
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McLoughlin and Judith Ouimet for their input. Authors contributions to this article are equal. Direct 
correspondence to Brian Powell, Department of Sociology, Indiana University, 744 Ballantine Hall, 
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Yet, more than 15 years later, this recommendation remains largely ignored. (For 
exceptions, see Conley 2001; Kalmijn and Kraaykamp 2005; and McLanahan 2004.) Instead, 
social scientists seem to ignore parental age, relegating it to the periphery or assuming that 
whatever impact it has is accounted for by other factors such as socioeconomic status. At 
best, the subject is relegated to such broad categories as to effectively preclude meaningful 
comparisons (other than between teenage and non-teenage parents). If it can be shown that 
parental age is related to resource allocation, especially if its linkage is net of other typically 
studied features, then parental age should not be overlooked. Indeed, parental age may 
warrant attention similar to that given to family structure, race and gender. In this article, we 
explore this possibility. Analyzing data from the National Education Longitudinal Study of 
1988, we examine whether maternal age is related to the allocation of cultural, economic and 
social capital to adolescents, as well as to parental expectations for their children’s education. 


Background 


As the family has undergone swift and dramatic changes, so too have the ages at which 
people mark major milestones, such as when they have their first or last child (Hamilton, 
Martin and Sutton 2004; Walter 1986).' Declining age at menarche, among other factors, 
increases the risk of adolescent pregnancy (Frisch 1975; Maynard 1997), while rapid 
technological developments allow women to become mothers long past the age once 
thought possible (Woliver 1995). Abortion and contraception, postponement of marriage and 
an increase of women in the work force, have led many women to delay childbearing until 
their late 30s or early 40s (Baldwin and Nord 1984; Hamilton et al. 2004). High divorce and 
remarriage rates further complicate fertility, as in the case of a twice-married woman with two 
widely-spaced sets of progeny born to her at different life stages. Thus, while people still 
visualize phases in their careers or the number of children they will have, there is now much 
more latitude and less urgency about the timing of such events (Buchmann 1989; Settersten 
and Hagestad 1996). Evidence of this may be found in the fact that more children are born to 
women aged 35 and older than to teenagers, and the number of births for women in their 30s 
and 40s is the highest in decades (Baldwin 1976; Hamilton et al. 2004; Martin 2002). 

Demographers acknowledge the role that age plays in determining structural parameters 
of the family such as family size, while life course scholars underscore how age shapes the 
privileges and expectations that people experience over the life course (Bumpass, Rindfuss 
and Janosik 1978; Hazelrigg 1997; Mirowsky and Ross 1999). Elder (1995) cogently argues 
that the life course of parents, which to a great degree corresponds to their ages, alters the 
experiences of their offspring. 

The benefits and costs of having younger or older parents elicit lively discussion, but little 
empirical research (Baldwin and Nord 1984; Maynard 1997). The sparse work that has been 
done typically is limited to comparisons of dichotomized age categories — mostly between 
children of teenage and non-teenage parents (Geronimus, Korenman and Hillemeir 1994: 
Moore, Morrison and Greene 1997) — rather than reflecting the full range of childbearing 
years. (For an exception, see Wakschlag et al. 2000.) These studies may perpetuate an 
assumption of homogeneity in the experiences of children whose mothers gave birth in their 
20s or later. Moreover, although sociologists have demonstrated the highly influential role that 
resources play in children’s daily lives and future prospects (Downey 1995; Schneider and 
Coleman 1993; Steelman et al. 2002), studies on parental age characteristically focus on 
outcomes (e.g., birth weight, children’s physical and mental health) other than resource 
outlays to children (Wakschlag et al. 2000; Rosenzweig and Wolpin 1995). 
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Hypothesized Linkages between Age and Resource Allocation 


How then might parental age figure into the transmission of resources to children? Five 
competing expectations can be derived from existing literature. 


Parental Age Is Positively Linked to Resource Allocation 


This proposition is simple: having an older parent is beneficial, at least with respect to 
resource allocation. This position is clearly articulated by McLanahan (2004), who sees 
advancing maternal age as a proxy for greater parenting quality and, correspondingly, parental 
resources. One of the most salient mechanisms by which older parents afford advantages 
to children is via the economically more favorable position that older parents typically enjoy 
over their younger counterparts. This position does not necessarily deny the influence of 
selectivity factors (i.e., the possibility that adults with higher income are more likely to delay 
having children — a possibility that we discuss later), but points to the literature that 
consistently shows that adults’ socioeconomic situation usually improves over time, in part 
because parental income grows and then reaches a plateau over the life cycle (Featherman 
and Spenner 1988; Ross and Mirowsky 1999). Accordingly, the greater resource allocation 
by older parents may be a function of their ability, regardless of such factors as parental 
predilections, to invest in their children. Perhaps the best evidence to date on this topic is 
provided by Eppenshade (1984), who finds a tendency for older parents to spend more on 
their children than do younger parents. This evidence, however, is incomplete because of the 
limited nature of resources studied. 

Financial wherewithal, however, may not be the sole or even primary reason that older 
parents provide more for their children. One prominent view of aging, labeled “age as 
maturity,” highlights the emotional, intellectual and social maturity that accompanies aging 
(Mirowsky and Ross 1992). Advancing age may be linked to readiness for parenthood, 
preferences and tastes, energy, networks, self-efficacy and competence, and life 
experiences, which in turn boost investments in children (Heuvel 1988; Walter 1986). 

Younger parents may be less equipped than their older counterparts to assume parental 
responsibilities. Some studies report that age is positively associated with parental 
satisfaction, and others find that younger mothers are more stressed, depressed and anxious 
than older ones (Hofferth 1987; McLanahan and Adams 1987). Competing demands that 
younger parents encounter also may impinge on both monetary and non-monetary resources 
(Mare and Tzeng 1989). 

In contrast, older parents may be more prepared for parenthood because they have had 
the chance to experience young adulthood without making the sacrifices of parenthood. 
Moreover, older parents may have had more time and prospects to construct valuable 
networks, may have more leisure time to spend with their children, or may channel more of 
their energy toward their children. Parental cultural tastes also may shift with age, resulting 
in greater participation and investments in cultural capital. Similarly, religious activity may be 
positively linked to parental age (Stolzenberg, Blair-Loy and Waite 1995), as might other 
activities that increase the extent to which parents and children are embedded in the 
community. Finally, if priorities and goals change and awareness of one’s own mortality 
becomes increasingly pronounced as one ages, then older parents may be more concerned 
than younger ones with passing on a legacy to children. lf so, older parents may be more 
willing to invest in and sacrifice for their children, net of their socioeconomic advantage over 


younger parents.” 
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Parental Age Is Negatively Linked to Resource Allocation 


Alternatively, later parenthood may have drawbacks. Functional impairment usually occurs in 
later life (Maddox and Clark 1992; Mirowsky 1995, 2002), and the chance of further physical 
difficulties increases with age. Consequently, older parents may be less adept than their 
younger counterparts at socializing their children because of the energy, time and strength 
required for the rigors of this demanding role (Juster 1985; Walter 1986). The large age gap 
between children and older parents also may be problematic; older parents may be less able 
to “take the role of the other” (i.e., of their child) and, therefore, may not be able to relate to 
children, especially adolescents, as effectively as younger parents (Heuvel 1988; Rossi 1980). 
Even if older parents are more connected to social networks that could benefit their children, 
this advantage does not necessarily mean that they are better able to capitalize on such ties. 
Younger parents may be better situated to impart whatever advantages they possess 
precisely because they are closer in age to children and, thus, may more easily communicate 
with them. 

Even if older parents have greater resources than do younger parents, older parents may 
find themselves caught in a “caring squeeze” in which they devote economic, time and other 
resources to both aging parents and children (Brody 1981; Pavalko and Artis 1997). The reality 
of approaching retirement also may constrain parental investments in children. Consequently, 
older parents may be less willing to assume new fiduciary responsibilities, such as paying for 
a child's college education. In contrast, younger parents, despite their lower earnings, may be 
more willing to invest in their children and even go into debt, because they can anticipate 
more years in which losses can be recovered and debt reduced. 


The Relationship of Parental Age to Resource Allocation is Curvilinear 


The two sets of expectations discussed above clearly diverge regarding the direction of the 
relationship between parental age and resource allocation, but they do share an assumption 
of linearity. An alternative position is one of nonlinearity — specifically, curvilinearity. That is, the 
disadvantages of having a much younger or older parent may co-exist. For example, if 
younger parents earn less money while older parents have greater financial obligations, then 
parents who fall in neither category (e.g., those who were in their late 20s at the birth of their 
children) may invest more in their children than will younger or older parents. Conversely, the 
advantages of being a younger or older parent may occur simultaneously. If so, the optimal 
timing for parenthood may be at both age extremes, with those in the middle age groups 
investing less. 


The Relationship between Parental Age and Resource Allocation Is Contingent 
on the Type of Resource 


As the interest in studying parental investments has grown, sociologists also have recognized 
that resources should not be viewed as a uni-dimensional concept. Resources vary in their 
liquidity, recoverability, divisibility and convertibility (Anheier, Gerhards and Romo 1995). 
Moreover, scholars differentiate several clusters of capital, including the following: (1) 
economic resources, i.e., the amount of money that parents spend on children or save for 
their future plans and security (Eppenshade 1984; Haveman and Wolfe 1994: Steelman and 
Powell 1991); (2) social capital, which, as described by Coleman (1988), consists of parental 
time and energy spent in interaction with children and the extent to which parents build 
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network ties that widen children’s opportunities (Portes 1998); and (3) cultural capital, which 
refers to culturally beneficial activities, such as music lessons or trips to museums (Bourdieu 
1977; DiMaggio 1982; DiMaggio and Mohr 1985). 

Considering the vast scope of resources, we should not automatically assume that all 
resources are similarly affected by parental age. In certain domains children quite possibly are 
advantaged by having younger parents, whereas in other domains they are advantaged by 
having older ones. Given the competing explanations offered above, one also might anticipate 
that certain resources will be more strongly influenced by parental age and others less so. 
Given the firmly established link between parental age and income, we might expect parental 
age to be more strongly associated, at least at the bivariate level, with economic resources 
than with those reflecting social or cultural capital. However, even among economic resources, 
we may need to differentiate between what parents have accumulated for their children (which 
may be positively linked to parental age) and, say, what parents are willing to financially 
sacrifice in the future (which may benefit children of younger parents). By the same token, 
parental age may be more positively linked to certain elements of social capital (e.g., parents’ 
network ties) but negatively associated with others-(e.g., parental interaction with children). 


The Relationship between Parental Age and Resource Allocation Is Due to 
Other Sociodemographic Factors 


Social scientists have identified different demographic profiles of individuals and families with 
older (or younger) parents (Martin 2002; Rindfuss, Swicegood and Rosenfeld 1987; Walter 
1986). Therefore, the seeming relationship between age and resources — whether positive or 
negative, linear or curvilinear, direct or indirect — may, upon closer inspection, be because of 
or via these sociodemographic or other factors. This possibility need not be interpreted as 
denying the significance of parental age per se, but rather as offering alternative explanations 
behind age differences than those posited above. 

As an illustration, parental education has been linked to both parental age and allocation of 
resources (Becker 1981; Mare and Tzeng 1989; Menaghan and Parcel 1995; Muller and 
Kerbow 1993; Schneider and Coleman 1993). If parental education influences age of 
parenthood (not only for one’s first child, but for all parities), then any ostensible effect of 
parental age on resources may be instead a function (either completely or partially) of parental 
education. Some, however, have argued the reverse: that age of parenthood also influences 
parents’ educational attainment (Hogan 1981). If so, then the effect of parental age on 
resource allocation may not be artifactual but rather indirect, i.e., via the effect of education. 

It therefore is possible that any observed effects of maternal age will lessen or disappear 
once selectivity processes linked to the timing of birth are taken into account. Scholars 
studying the effects of teenage parenting have made great, although not entirely successful, 
efforts to include appropriate controls, especially family background and maternal 
characteristics that affect both birth timing and various outcomes for children and their 
mothers. Others, most notably Geronimus and associates (Geronimus and Korenman, 1993; 
Geronimus et al. 1994), contend the deleterious effects of teenage motherhood that are 
found in cross-sectional models disappear in fixed-effects models (using sibling/cousin data), 
thereby demonstrating that these effects are due to selectivity. This contention, however, has 
been critiqued on methodological grounds and is inconsistent with or even diametric to other 
studies that also use fixed-effects models (Hoffman, Foster and Furstenberg 1993; Hotz, 
McElroy and Sanders 1997; Kalmijn and Kraaykamp 2005; Levine, Pollack and Comfort 2001).* 

Our data cannot definitively settle the issue of causality, but we can take into account a 
broader array of potentially confounding variables and include a more comprehensive set of 
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sensitivity analyses than typically have been used. In particular, if parental age varies by racial 
groups and racial groups vary in their ability or willingness to invest in their children (Powell 
and Steelman 1993; Roscigno 1998; Roscigno and Ainsworth-Darnell 1999), then we need 
to test whether the inclusion of race in models reduces the effects of parental age on 
resources. Similarly, family structure may come into play. Marital status, the number of 
children in the home, and birth order have been linked to both parental allocation of resources 
and parental age (Blake 1989; Downey 1995; McLanahan and Bumpass 1988; Steelman and 
Mercy 1980). Not taking into account these factors could overestimate the genuine effect of 
parental age. 

How ignoring potentially confounding factors obscures the relationship between parental 
age and resource allocation, however, may not be that simple. If the relationships between 
parental age, resources and such factors as parental education, race and family structure are 
not all in the same direction, it is possible that the effect of parental age emerges, strengthens 
or even shifts from curvilinear to linear (or vice versa) only upon controls for these variables. 


Research Questions 
Taking into account the aforementioned competing possibilities, we ask two questions: 
1. Is parental age linked to the allocation of resources to children? If so, then how? 


2. How does the inclusion of sociodemographic and familial factors alter the relationship 
between parental age and the allocation of resources to children? 


Data, Measures and Methods 


Data 


Our analysis relies on the National Education Longitudinal Survey of 1988 (hereafter 
referred to as NELS) - a nationally representative survey of eighth graders designed to 
provide information on an array of factors thought to influence children’s development, 
educational performance and life outcomes. NELS is distinctive from most other national 
data sets on adolescents because it also surveyed one parent/guardian in 1988 (and in a 
follow-up in 1992). A core goal of NELS is to provide data on the role that parents play in 
the educational development of their children. Consequently, both parents and children 
were asked an extensive number and range of items about parental behaviors and 
attitudes on investments in children. To date, this is the most comprehensive data set on 
parental investments linked to adolescents’ education, superseding other surveys (e.g., 
High School and Beyond) as the most frequently used data set addressing adolescents, 
parental resources and education. 

NELS is especially well-suited for this analysis. First, as noted above, its detailed measures 
of cultural, economic and social resources are unparalleled. Second, its inclusion of extensive 
information on background factors that may be confounded with parental age allows an 
assessment of the possibility that parental age effects are artifactual — particularly, a result of 
the link between parental age and these socioeconomic factors. Third, it includes measures 
of family/sib structure (in particular, number of siblings, ordinal position and marital status of 
parents); these are factors that if excluded from models might exacerbate or, in some cases, 
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mask potential effects of parental age. Fourth, its large sample size facilitates the application 
of multivariate analyses and enables us to test whether the effects of parental age vary by 
family structure (e.g., two-parent vs. one-parent households) and race/ethnicity, Fifth, its focus 
on adolescents allows us to examine parental flow of resources at a critical juncture in 
childhood and at a period when parental investments may be especially intensive and 
consequential. 


Measures of Independent and Dependent Variables 


Maternal Age 

Because mothers comprise the vast majority (approximately 80 percent) of parental 
respondents in NELS, our primary analysis examines the effects of maternal age and, 
consequently, is based on those cases in which the biological or adoptive mother is the 
respondent. Parents’ year of birth and children’s year of birth were provided by responding 
parents and students, respectively. From this information, we calculate maternal age at child's 
birth. Average maternal age at the birth of the focal child in is 25.7 years. Although most 
women were between the ages of 20 and 29 when this child was born (20-24: 31.8 percent; 
25-29: 34.2 percent), 23.2 percent were 30 years of age or older, and 6.9 percent were 35 
years of age or older. Summary statistics and descriptions of maternal age and other variables 
from NELS used in our study are displayed in Table 1. 


Parental Resources 
NELS has an impressive array of items that fall under the rubric of “parental resources.” To 
provide as comprehensive a test as possible, we analyze the effect of maternal age on 17 
resources. Our selection of these variables is prompted by three considerations. First, 
because we are interested in whether parental age wields similar effects on cultural, 
economic and social capital, we want to ensure that these forms of parental resources are 
reflected by our choice of items. These categories should not be seen as mutually exclusive, 
for there surely is overlap among them. For example, we classify cultural classes (i.e., 
attending classes outside school in art, music, dance, language and computers) as cultural 
capital, although one could argue that, because of the costs incurred, it should be seen at 
least partly as a type of economic capital. Parental educational expectations are the most 
difficult to classify; some scholars categorize expectations primarily as an indicator of social 
capital, while others see expectations as a measure of cultural capital, while still others 
question whether these should be viewed as a resource per se (Downey 1995; Roscigno and 
Ainsworth-Darnell 1999). As a result, educational expectations are treated separately from the 
three clusters of resources. Second, to facilitate comparisons to previous analyses of parental 
investments, we include variables that typically are incorporated in such studies (Downey 
1995: Freese and Powell 1999; McNeal 1999; Roscigno and Ainsworth-Darnell 1999; 
Teachman, Paasch and Carver 1996). Third, we select items that have been posited to be 
linked to later status outcomes such as educational progress or occupational success 
(Dimaggio 1982; Downey 1995; Roscigno and Ainsworth-Darnell 1999; Schneider and 
Coleman 1993; Steelman et al. 2002).° 

Our operationalization of cultural capital includes the aforementioned cultural classes, as well 
as children’s attendance at art, science and history museums. NELS covers a broader set of 
economic resources than cultural ones. We focus on seven indicators of parental expenditures 
on children’s education: whether parents have saved for their children’s post-secondary 
education and, if so, how much they saved and when they first began saving,° maternal 
willingness to go into debt for their child's college education, whether the child attends a private 
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school, the number of educational items (e.g., newspapers and encyclopedias) families possess, 
and whether the family has a computer that is used for educational purposes by the child. 

Social capital also is measured by seven items: the number of the child's friends and their 
parents that the mother knows,’ participation in parent-teacher organizations, volunteering at 
the child’s school, other forms of involvement with the school (i.e., attendance at school events 
and school meetings, visiting child’s classes, and speaking with the child's teacher/counselor), 
the frequency that parents help with children’s homework, the frequency that parents “do 
things” with child, and the frequency that children and parents talk about school (e.g., school 
activities, course selection, things studied in class, and plans for high school). 


Analytic Strategy 


We begin with simple bivariate models in which resources are regressed on maternal age. 
The specific statistical technique chosen is contingent on the type of resource analyzed: OLS 
regression is used for most items (e.g., number of educational objects), but others require 
logistic regression (e.g., availability of a computer) or tobit models (e.g., amount of money 
saved for the child’s post-secondary education). Because the relationship between maternal 
age and resources may not be linear, we also test for non-linear transformations of the 
parental age variable, focusing specifically on curvilinearity (i.e., the inclusion of a parabolic 
function). These analyses are useful in assessing whether the effects of age are consistent 
regardless of the type of resource and whether the magnitude and direction of the age effect 
vary by resource. 

We then investigate the extent to which the relationship between age and resources shifts 
after taking into account other factors: family income (the natural log) from the previous year, 
educational level of the most highly educated parent of the child, race (measured in dummy 
variables which differentiate among African American, Latina/o, Asian/Pacific Islander, 
American Indian/Alaskan Native, and White), child’s gender, number of siblings, ordinal 
position of the child, and the marital status of the mother.® *: '° As discussed earlier, not taking 
into account these factors may obscure or magnify the effect of parental age on these 
resources. For example, income may influence the relationship between parental age and 
resource allocation in two ways. Parental income may be the mechanism by which the 
relationship occurs. That is, the effect of parental age on resources may be via the economic 
advantage that older parents typically have over their younger counterparts. Alternatively, if 
higher income adults delay parenthood more often than those with lower income, then the 
relationship between parental age and resources may be largely artifactual. While we cannot 
arbitrate between these two possibilities, we can determine if the link between maternal age 
and resource allocation persists even with the addition of income to the multivariate models. 
We also can test whether any curvilinear patterns detected in the bivariate model remain so 
or become linear with the inclusion of the control variables. 


Results 


Maternal Age and Cultural Classes: An Illustration 


Table 2 illustrates the relationship between maternal age and resources, in this case, cultural 
classes — 1.e., the extent to which children have attended classes outside the school in art, 
music, dance, language and computers. The regression in Model 1 reveals a significant 
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bivariate relationship between maternal age and cultural classes (b = .218, p < .001)." In 
other words, children whose mothers are older attend more cultural classes on average than 
do their counterparts with younger mothers. This relationship is due in part to controls such 
as income and education that are added in Model 2; however, the relationship remains 
significant (b = .146, p < .001). An inspection of the standardized effects (not shown here) 
suggests that although the influence of maternal age is less than that of parental education, 
income, or the child's sex (with girls participating in more classes than boys do), maternal 
age appears to be more important than the race categories, marital status of the parent, 
sibship size and birth’ order. 


Table 2: Unstandardized OLS Regression Coefficients for Cultural Classes on Maternal Age and 
Control Variables for Maternal Respondents (N = 15,401)# 


Independent 
Variable Model 1 Model 2 Model 3 Model 4 
Age of mother 21ers .146*** 1.234*** ion 
(decades) (.013) (.014) (.087) (.085) 
Age of mother BAGO -.023 
(squared) (.016) (.015) 
Oot .080*** 
Family income (.007) (.007) 
.241*** .240"*"* 
Parents’ education (.007) (.007) 
-.030 -.026 
African American? (.023) (.023) 
OUT .149*** 
Asian American? (.038) (.038) 
034 -.034 
Latina? (.024) (.024) 
-.067 -.065 
Native American? (.073) (.073) 
fou al es 
Female child (.014) (.014) 
-.006 -.005 
Sibship size (.006) (.006) 
-.037*** -.038*** 
Birth order (.008) (.008) 
-.074*** -.075*** 
Married (.018) (.018) 
Intercept .085 -1.332 -1.241 -1.490 
R2 : 018 .180 027 .180 


i 
* Robust standard errors are in parentheses. 


> White is the referent category. 
*p <.05 **p<.01 *** p< .001 (two-tailed) 
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The effect of birth order, which often is confounded with maternal age (i.e., later-born 
children are more likely than first-born children to have older mothers), is notable.'* In Model 2, 
birth order and cultural classes are significantly related: later-born children attend fewer cultural 
classes. This effect would not have been detected in models that exclude maternal age (these 
models are available upon request). In this case, then, the effect of birth order is underestimated 
until maternal age is added."* As will be seen later in this article, the exclusion of maternal age 
may also overestimate the direct effect of birth order on other dependent variables. 

As hypothesized earlier, the effect of parental age may not be strictly linear. Models 3 and 
4 explore whether maternal age has a curvilinear (i.e., parabolic) relationship with cultural 
classes. These models differ from Models 1 and 2 by adding a quadratic (i.e., squared) term 
for maternal age. In Model 3, maternal age has a significant curvilinear effect on cultural 
classes before controls are incorporated. Upon calculating the inflection point, we see that 
age has a positive (but diminishing) effect on cultural classes until mothers are 33.4 years old 
at the child’s birth, after which maternal age has a negative impact.” Once controls, however, 
are added in Model 4, the curvilinear relationship vanishes: the squared term is no longer 
significant.", ” Instead, the relationship between maternal age and cultural classes becomes 
significant, positive, and linear (Model 2)."° This shift from a curvilinear effect of age in bivariate 
models to a linear one in multivariate models mirrors the patterns identified by Mare and 
Tzeng (1989). 

To visually depict the aforementioned patterns, Figure 1 displays the relationship between 
maternal age and cultural classes. Maternal age is measured as a series of dummy variables 
representing intervals of maternal birth years, with mothers who were teenagers at their 
child's birth as the reference interval. A curvilinear pattern is seen in Panel A of Figure 1, which 
shows the unadjusted (i.e., no controls), unstandardized regression coefficients for each 
interval. However, Panel B of Figure 1, which displays the adjusted (i.e., including controls) 
regression coefficients, shows that the impact of maternal age (although less in magnitude 
than in the bivariate model) increases in approximately equal increments for each additional 
age interval. 


Maternal Age and Parental Resources 


Table 2 and Figure 1 present evidence of an advantageous effect of having older mothers, at 
least with respect to attendance in cultural classes. Of course, maternal age may not operate 
in a similar fashion — in either direction or magnitude of its effect — for other resources. Table 
3 summarizes the effects of age on 17 different types of resources, reflecting the three 
general categories of resources discussed earlier — cultural, economic and social — as well as 
parental educational expectations. 

Patterns presented in Table 3 are remarkably consistent. At the bivariate level (Model 1), 
maternal age Is significantly related to all 17 resources. With the exception of two variables 
(i.e., willingness to go into debt for children’s education and helping with homework), the 
relationship is positive: maternal age is linked to greater investment in these resources. Even 
with controls added in Model 2, the effects of maternal age, which typically are reduced 
somewhat, remain significant and positive for these 15 resources. In addition, without 
controls (Model 3), significant curvilinear effects are found for 16 of the 17 items (the 
exception being willingness to incur debt); however, when controls are included in Model 4, 
the significant curvilinear effect disappears for all but two variables (i.e., educational objects 
in the home and educational expectations).'"® The overall pattern, notwithstanding the few 
exceptions, is a positive and linear relationship, strongly and consistently indicating an 
advantage to youths with older mothers. 
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Figure 1. Unstandardized Coefficients from Regressions of Cultural Classes on Maternal Age 


Panel A: Bivariate Regression 
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Panel B: Multivariate Regression with Control Variables 
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Table 3: Unstandardized Coefficients from Regressions of Parental Resources on Maternal Age 
and Control Variables for Maternal Respondents» 





Model 1 Model 2 Model 3 Model 4 
Linear Age  LinearAge Curvilinear Age — Curvilinear Age 
(decades) (decades) (decades) (decades) 
Without With Without With 
Parental Resource Controls Controls° Controls Controls** 
Cultural Resources 
T.204"" 
(.087) 
218"** .146*** a1 GO 
Cultural classes (.013) (.014) (.016) NS® 
1.546*** 
(.117) 
rea .149*** mor 
Cultural activities (.017) (.020) (.021) NS 
Economic Resources 
2.099*** 
(.201) 
26h oo me S ple 
Saved for college’ (.029) (.038) (.036) NS 
2,1590~" 
(.247) 
mow Ni -.429*** 
When started saving (.036) (.041) (.044) NS 
13.524*** 
(.940) 
2.020" S12 -2.078*** 
Amount saved? (.134) (.144) (.168) NS 
mgood) -.300*** 
Willing to incur debt' (.040) (.048) NS NS 
3.855""* 
(.264) 
‘Ori See ool “000 
Private school! (.032) (.045) (.045) NS 
wale = 1.060*** 
(.149) (.142) 
Og h mia O02 -.134*** 
Educational objects (.022) (.024) (.027) (.025) 
3.439"** 
(.231) 
443*** ‘oot aoe 


Computer in home’ .030 041 041 NS 
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Table 3: Unstandardized Coefficients from Regressions of Parental Resources on Maternal Age 
and Control Variables for Maternal Respondents *” (continued) 


Model 1 Model 2 Model 3 Model 4 
LinearAge §_LinearAge  Curvilinear Age = Curvilinear Age 
(decades) (decades) (decades) (decades) 
Without With Without With 
Parental Resource Controls Controls° Controls* Controls 
Social Resources 
4,043*** 
(.282) 
Know friends and their S35e0 ons -.639*** 
parents (.041) (.046) (.051) NS 
12267" 
(.104) 
Parent-teacher .249*** 220" -.178"** 
organizations (.015) (.018) (.019) NS 
2.218 
(.242) 
418*** 347" -.319*** 
Volunteer at school’ (.032) (.042) (.042) NS 
.939"** 
(.126) 
Involvement with OTe .109*** ri fay abe 
school (.018) (.021) (.023) NS 
.386*** 
(.110) 
.043** .060** -.062*** 
Do things with child (.016) (.020) (.020) NS 
.209* 
(.090) 
-.028* -.034* -.043** 
Help with homework (.013) (.016) (.016) NS 
13707 
(.173) 
.148*** .098*** eta 
Talk about school (.025) (.030) (.031) NS 
1.478*** ao 
(.085) (.080) 
‘Educational i ae st AO en -.230°™* -.035* 
Expectations (.012) (.013) (.015) (.014) 


*Ns range from 8,876 to 16,746. 

Robust standard errors are in parentheses. 

“Controls include logged family income and marital status (from 1988 for resources from 1988 
or 1990 Survey and from 1992 for resources from 1992 Survey), parents’ education, race of 
mother, sex of child, sibship size, birth order and dummy variables for all controls to indicate 
missing values. ; ; 

‘The first and second numbers are the coefficient and standard error, ae for the linear 
component of the curvilinear term; the third and fourth numbers are the coe cient and 
standard error for the squared component of the curvilinear term. 

‘NS indicates that the squared term for age (curvilinear) is not significant at a = .05 (two-tailed). 

ogistic regression of dichotomous dependent variable; OLS regression used unless specified 
otherwise. ! 

’Tobit regression of censored dependent variable. 


*p<.05 **p<.01 **p < .001 (two-tailed) 
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Comparing different categories of resources, we see that although its effect on the 
variables comprising cultural capital investments (cultural class attendance and participation 
in cultural activities) is strong, maternal age exhibits similarly strong or even stronger effects 
on economic resources. The older the mother, the more likely and the earlier parents started 
to save for college, the more they actually saved for college, the more likely the child attended 
a private high school, and the more likely the child used a computer in the home for 
educational purposes. ' 

Yet, there are two exceptions to this significant, positive and linear pattern for 
economic resources. The first, educational objects at home, has a curvilinear relationship 
with maternal age, even in multivariate models. Closer inspection of Model 4 suggests 
that the inflection point when controls are added is approximately 40 (39.6). Thus, for all 
but the oldest mothers (i.e., those 40 or above at the birth of the child), the impact of 
increasing age is beneficial.” 

The second exception is maternal willingness to incur debt for their children’s education. 
Its relationship to maternal age is negative in both the bivariate (b = -.263, p < .001) and 
multivariate (b = -.300, p < .001) models; moreover, as seen in Models 3 and 4, there is no 
evidence of a curvilinear relationship for this item.*' These patterns suggest that older 
mothers are more debt aversive than younger mothers. Interestingly, the relative 
(standardized) effect of maternal age on this item is stronger than those of education or 
income (available upon request). Several possibilities explain this pattern. Because they 
already have saved for their children’s education and/or their current financial situation 
appears secure, older mothers might have responded negatively to the question because 
they view the need to go into debt as implausible; however, in auxiliary analyses in which we 
included parental savings behavior (e.g., the likelihood of, timing of and the amount of 
parental saving) and other controls such as familial income, the effect of maternal age 
remains significantly negative. We believe a more parsimonious explanation is that the older 
mothers’ greater hesitancy to go into debt is a function of the increased salience of other 
future financial obligations, such as the need to secure sufficient funds for retirement and the 
relatively short window of time to recoup the debt incurred. Although the pattern for parental 
willingness to go into debt runs counter to that for other economic resources, it provides 
strong evidence for the argument that parental age is a key and distinct factor in shaping 
parental views toward investments in children.” 

Investments in social capital also appear to be greater among older mothers, although the 
effect is somewhat weaker than those for economic or cultural capital resources. Older 
mothers know a greater number of their children’s friends and their children’s friends’ parents, 
are more active in parent-teacher organizations, are more likely to volunteer in school, are 
generally more involved in schools, “do things” more frequently with their children, and 
discuss school more frequently with their children than do younger mothers. These patterns 
hold in both bivariate (Model 1) and multivariate (Model 2) models; moreover, as in the case 
of most of the cultural capital and economic resources, what appears to be a curvilinear 
relationship at the bivariate level (Model 3) transforms into a linear, but significantly positive 
relationship with the addition of controls (Model 4). We do, however, find that maternal age 
is significantly (albeit at the .05 level) but negatively linked to helping with homework, a 
pattern that holds in both bivariate and multivariate models. One possibility for this anomaly 
is that children of older parents, in part because of their greater investments in cultural, 
economic and social resources, have fewer academic problems and may need less help with 
their homework. This explanation is borne out in supplementary analyses: grades and 
standardized test scores are negatively associated with parents helping with homework and 
upon including grades and/or standardized tests scores in the models, the effect of maternal 
age is no longer significant. 
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As seen in Models 1 and 2, maternal age also is strongly linked to parental educational 
expectations (b = .212, p < .001, Model 1; b = .110, p < .001, Model 2). The coefficients in 
Models 3 and 4 suggest curvilinearity; however, as in the aforementioned case of educational 
objects in the home, the point of inflection is high (43.6 years). 

A few comments regarding the control variables used in Models 2 and 4 are warranted. 
Consistent with previous literature, the effects of maternal education and familial income on 
parental resources generally are quite pronounced (Downey 1995; Freese and Powell 1999). 
An inspection of the standardized coefficients (not shown here) indicates that the effect of 
maternal education is stronger than that of maternal age on all but one resource (i.e., 
willingness to go into debt). Similarly, familial income’s effect is greater than that of maternal 
age on 13 of the 17 resources. But what we find even more striking, however, is that the 
influence of maternal age is generally stronger than any of the other controls — variables that 
have received far more empirical scrutiny than has parental age. All told, its effect is greater 
than that of the race categories on 13, sex on 12, marital status on 14, sibship size on 14, and 
birth order on 15 of the 17 resources. Of note, the effect of birth order generally is reduced 
when parental age is included in the models. For example, although auxiliary analyses 
suggest later-born children have access to more educational objects than do first-born 
children, this advantage appears to be primarily a function of maternal age. That is, when later- 
born children are the beneficiaries of greater resources, it is precisely because they also are 
more likely to have older mothers. 


Supplemental Analyses Using NELS 


To check if our findings are robust across alternative model specifications, we performed 
several additional analyses. We first tested whether the observed effects of maternal age vary 
across different family configurations. Few differences emerged when comparing single- 
parent versus two-parent families (the one exception being the amount of money saved for 
college) or biological/adoptive parents vs. stepparents (the exception being willingness to 
incur debt).” 

In addition, we checked whether the effects of maternal age differ for first-born children 
and later-born children. The effect of maternal age is significantly greater for first-born than for 
latter-born children for only four resources, but in each case, the effect of maternal age is 
significant for both. We also tested whether maternal age operates differently depending on 
race/ethnicity. Again, of the few differences that materialized, these effects were generally 
stronger for whites than for blacks and, especially, Asian Americans and Latinas. 

Finally, as shown in Appendix B, we compared the effects of maternal age to those of 
paternal age.” We ran models separately for maternal and paternal age and then included 
both maternal and paternal ages in the models simultaneously and performed Wald tests to 
determine if the effects of maternal and paternal age differed significantly. 

With few exceptions, these auxiliary analyses show that paternal age and maternal age 
operate similarly. In the multivariate models, the effects of paternal and maternal age 
significantly differ for only four items. The only item that is more strongly linked to paternal 
age than to maternal age is an economic resource: the amount of money saved for college. 
Of the other three items, which fall under the rubric of social resources, volunteering in school 
and knowing child's friends and their parents are more positively linked to maternal age, while 
helping with homework continues to be more negatively linked to paternal age. 

These supplementary analyses indicate that the patterns observed in Table 3 are largely 
consistent across different family configurations, ordinal position and racial groups and hold 
for both maternal and paternal age.” 
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Discussion 


Intuitively, age should be an appealing sociological variable; indeed, this is the claim of many 
life course scholars. Age is an ascribed, visible status that has been related to other social 
phenomena, such as depression, experiences in work and empathy. Especially during a 
period of time when delayed childbirth by women has been characterized by the media and 
by some academicians as both selfish and self-destructive (Hewlett 2002), it is 
counterintuitive that sociologists have not been more vigilant in examining how parental age 
is linked to the conferral of advantages and disadvantages to children. In this article, we 
explicitly examine how parental age is linked to the intergenerational flow of resources to 
adolescent offspring. Our broader goal, however, is to encourage greater attention to age as 
a component of stratification that affects the life experiences and prospects not only of the 
incumbent but also of family members. 

What we find throughout this research is a remarkably consistent pattern that suggests 
that advancing age of mothers — and, as suggested by supplementary analyses, fathers — 
provides an advantage to youths. With few exceptions, this relationship persists across the 
different forms of capital studied here. That the general pattern holds across so many 
measures of resources and across multiple model specifications increases our confidence 
that the impact of parental age is genuine. 

The nexus among resources, parental age and socioeconomic status, as well as other 
sociodemographic factors, deserves further comment. Paralleling Mare and Tzeng’s (1989) 
analysis of men’s educational attainment, we observe a curvilinear relationship between 
parental age and resource allocation at the bivariate level that transforms into a linear one 
upon controlling for parental education, income, gender, race and family and sibling structure. 
Moreover, part of the bivariate relationship between parental age and resources can be 
attributed to these factors. As noted earlier, two distinct possibilities arise: a selection effect 
in which those who are socioeconomically advantaged have children at a later age and an 
indirect effect in which the impact of age on resources occurs via these socioeconomic 
variables. Regardless of the interpretation one endorses (and we believe that it is likely that 
both explanations apply to some degree), it is noteworthy that a substantial portion of the link 
between parental age and resources is independent of these factors. Unfortunately, data 
restrictions in NELS precluded us from also considering the implications of familial wealth in 
our models. It has been well documented that wealth is positively linked to the flow of 
resources, especially economic ones, and to age (Conley 1999, 2001). Therefore, part of the 
remaining effect of parental age on resources in the multivariate models may be via 
differences in wealth.” 

Implications of the components of sibling structure are more irregular than those of 
parental education or income. After one takes into account parental age, the effects of family 
size and birth order veer off in different directions for different outcomes. But one implication 
is clear: studies of family structure that have ignored or downplayed the role of parental age 
unwittingly may have over- or under-estimated the effects of these family structure features. 
In light of our findings, results generated by such studies need to be reconsidered. 

Perhaps even more striking is that the effects of parental age, although not as powerful 
as those of parental income or education, are more consistent and at least as strong if not 
stronger than those of other variables that have received a goodly share of empirical attention 
from social scientists. Indeed, scholars have argued, often persuasively, that if we want to 
understand how families impart resources to their children, we need to examine the influence | 
of gender, race and family structure. To this list, we add parental age. Taken in its totality, this 
impressive body of findings hints that the significance of parental age in children’s lives may 
be more consequential than has been commonly assumed. 
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That said, we remain cautious about overreaching here. Despite our attempt to explore 
as Comprehensive a set of resources as possible, older parental age may not be a consistent 
advantage for other resources or parental traits, such as the degree of intergenerational 
conflict and parental rigidity/flexibility in interacting with their children. In addition, because we 
investigate parental transfers of resources to adolescents only, further research is needed to 
see whether this pattern is also found in early childhood. Younger children may require a 
different set or form of parental investments that younger parents may be more able to 
provide. To illustrate, one could make the argument that having younger parents is 
advantageous to young children in areas such as play, sports and other activities that demand 
considerable physical energy or stamina. Even if older parents also prove advantageous for 
both young children and adolescents, these advantages might be offset when children reach 
adulthood and face greater obligations to provide financial and other support for their aging 
parents. 

In addition, it would be instructive to determine whether, and if so how, parental age’s link 
to the distribution of resources translates into such outcomes as physical and emotional well- 
being and educational success. As mentioned earlier, our focus on parental resources is 
based in part on the increasing body of sociological evidence that these resources do affect 
children’s educational performance and other status outcomes. However, we contend that 
these resources are important not only because of their potential influence on later status 
outcomes, but also because they are indicators of the quality of life experienced by children 
at this juncture in their life course.” 

We also should be cautious in overgeneralizing beyond — and even within — contemporary 
life in the United States. The robust patterns presented here beg the question of whether the 
advantage of having an older parent is applicable across subcultures, societies and history. 
Surely, the social context in which families are embedded may reduce or exacerbate the 
extent to which parental age wields influence. As an illustration, if certain groups in the United 
States (e.g., Mormons and Asian immigrants) or in other countries (e.g., Arabs in Israel) rely 
heavily on extended kin and community networks, these alternative strategies to distribute 
resources to children may counterbalance any disadvantage/advantage that results from the 
parents’ age or other structural features of the family (Powell, Werum and Steelman 2004; 
Shavit and Pierce 1991). Moreover, state theorists have documented the wide diversity in the 
extent to which governmental efforts are directed to children (Esping-Andersen 1999). For 
example, in social democratic (e.g., Norway) and corporatist (e.g., Germany) countries that 
offer more generous social welfare than in the United States, parental age may be less 
consequential in the amount of parental resources allotted to children as well as the 
implications of these resources for children’s life chances. Other governmental programs, 
such as affirmative action policies in India or Malaysia, also may weaken the influence of 
parents and parental age for certain groups — in particular, those to whom the policies are 
directed (Pong 1997). Whereas these suggest scenarios in which the effects of parental age 
are lessened, it also is possible that under certain conditions, especially in developing 
countries or historical periods with scarcer resources and lower life expectancies, the 
relationship between parental age and resources (and children’s life chances) is negative. That 
is, the edge may go to children of younger women because they are better suited to 
successfully bear children (especially if obstetrical services are not widely available), to 
withstand the rigor of work required to obtain resources, and to survive through the child's 
formative years, and because the socioeconomic advantages that typically increase with age 
in the United States may not be applicable. 

Certainly, our greatest concern is that of causality. Sociologists have struggled with the 
issue of causality and this case is no exception. We cannot with certainty confirm that the 
relationships found here are causal or due to age per se. If most parents deliberately plan the 
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timing of their children’s birth, then there still may be some unmeasured factors 
systematically related to parental age that make its relationship to resource transmission 
partially spurious. Geronimus and Korenman (1993) and Geronimus et al. (1994) go even 
further and assert that maternal age effects — defined in their work as the distinction between 
teenage mothers and mothers who gave birth in their early 20s — are entirely spurious. 
Although challenged for methodological reasons and for being inconsistent with other studies 
using statistical approaches (Hoffman et al. 1993; Hotz et al. 1997; Kalmijn and Kraaykamp 
2005: Levine et al. 2001), Geronimus and associates’ claim underscores the difficulties in 
attributing causality in the study of maternal age. We try to minimize this problem by taking 
into account sociodemographic variables most closely linked to parental age and by doing 
an extensive set of sensitivity analyses. Paradoxically, if selectivity processes involving 
socioeconomic status shape both fertility timing and subsequent maternal life chances, our 
controls for post-birth socioeconomic status inadvertently may deflate rather than inflate 
maternal age effects, opposite the contention of Geronimus et al. (1994). The upshot is that 
our analyses may yield conservative estimates of the differences between children of young 
and older mothers. Given current data and methodological constraints, we believe that our 
best strategy to ultimately identify the true influence (or lack of an influence) of parental age is 
a multi-methodological one that will rely on the insights of an admixture of cross-sectional, 
longitudinal and fixed-effects quantitative analysis as well as qualitative analysis. We, thus, 
see our findings not as firmly establishing the causal link between parental age and the 
intergenerational flow of resources, but rather as encouraging more dialogue on this topic. At 
a minimum, however, our findings confirm that, regardless of the reason, parents who have 
children at a later age offer more resources than do their peers who have their children at an 
earlier age. 

By no means should our analyses be seen as an unequivocal endorsement for women to 
delay parenthood. Although our findings suggest that children born to older parents may fare 
better, at least in terms of resources, there also are personal and even societal trade-offs 
should delayed childbearing decisions become increasingly normative (Hewlett 2002; 
Morgan 2003). For example, women who postpone childbearing may find themselves in the 
position of remaining childless or not having the number of children desired, which may result 
in interpersonal difficulties for couples. Delayed childbearing also has been implicated in the 
unremitting decrease in the fertility rate, which some demographers worry may prompt 
political upheaval and social dislocation (Bongaarts and Feeney 1998; Morgan 2003). 

Other demographers recently have expressed a different but related concern about the 
future: that differences in parental age, along with differences in education, employment and 
marital status that have accompanied the second demographic transition, will create 
“diverging destinies” for children (McLanahan 2004). Explicitly testing for the association 
between parental age and resource allocation that is implied by these predictions, our results 
establish yet another area in which family background bestows advantages or disadvantages. 
This source of inequality may become even more salient with the technological and social 
changes that may increase the dispersion of ages of parenthood. If future analyses confirm 
what we have presented here, then sociologists should bring age more closely to the 
forefront of the study of intergenerational processes and stratification more broadly. 


Notes 


1. Our interest here is not on the transition to parenthood per se, but rather on the nexus 
between parental age and the allocation to children regardless of the child's parity (birth 
order). Nonetheless, supplementary analyses focus on the impact of parental age on the 
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flow of resources to first born children and yield results similar to those presented in this 
article. 


. This position also squares with sociobiological/evolutionary psychological scholarship. 
Sociobiologists and evolutionary psychologists might note that the greater sacrifice by 
older parents is a rational strategy to the extent that their primary goal is to increase their 
fitness —i.e., perpetuate their genes (Wilson 1975). Because their future fertility is far more 
constrained than that of younger ones, older parents should be more likely to invest in 
their children whereas younger ones may be likely to save some of their investments for 
offspring yet to be born. 


There are other ways to delineate the forms of capital. Downey (1995), for example, 
dichotomizes capital into economic and interactional resources, while Freese and Powell 
(1999) differentiate among economic investments, cultural capital investments, parental 
involvement in child's education, supervision of child's activities and social capital 
investments. 


. Among these criticisms are data limitations (in particular, using an unrepresentative and 
disproportionately disadvantaged sample that by focusing primarily on teenage mothers 
makes it difficult to assess the effects of a more complete range of maternal ages) and 
that the comparison between their fixed-effects and cross-sectional models overstated 
the omitted variable bias because their cross-sectional models excluded available 
sociodemographic variables that should have been included. 


. This is not to say that each of these resources actually does positively influence these 
outcomes or does so in a uniform way. Recently, for example, Morgan and Serensen 
(1999), Carbonaro (1999), and Hallinan and Kubitschek (1999) debated whether social 
capital positively influences educational performance. Moreover, as we will see later, the 
link between parental assistance in homework and academic performance is counter to 
what we might expect. Scholars also have questioned whether the effects of social, 
cultural and economic resources are consistent across different races and social classes 
(McNeal 1999; Roscigno and Ainsworth-Darnell 1999). 


Most resources we examined come from the first wave of NELS; however, some items 
come from subsequent waves. For example, the parental questionnaire in the second 
follow-up (1992) asks parents when they began to save for their children’s post-secondary 
education. 


_ Alternative measurements of this item (Morgan and Sorensen 1999) yield results very 
similar to those presented here. 


. The sample restriction to maternal respondents reduced the original sample size (of 
22,651 parents) by approximately 20 percent (to 17,600). The exclusion of mothers who 
did not provide information on their age further reduces the sample to 16,958. For each 
regression model, cases also were deleted if mothers did not provide complete 
information on the resource used as the dependent variable. For all variables other than 
the independent and dependent ones, mean values are imputed for missing values. All 
models also include dummy variables indicating mean substitution. Reliance on other 
techniques to adjust for missing cases (e.g., listwise deletion and regression imputation) 
yields results that are consistent with those presented here. 
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Supplementary analyses indicate that the patterns discussed here are unaltered when the 
educational level of the mother is used instead of the educational level of the most highly 
educated parent or when several categories of marital status (e.g., currently married, 
formerly married, cohabiting, single) are used in lieu of a dichotomy (i.e., married, not 
married). 


Supplementary analyses explore alternative measures of parents’ socioeconomic standing: 
a composite indicator of SES provided in NELS, maternal and paternal employment, and 
the SEI and type (measured by 16 occupational categories) of mothers and fathers’ 
occupations. Patterns regarding maternal age persist regardless of the measures selected. 


. The use of ordered logit models instead of OLS regression results in virtually the same 


patterns as those presented here. 


. Child's sex is included not only because of its potential link to resources, but also because 


others have suggested a positive relationship between maternal age and the likelihood 
of giving birth to a daughter (Nicolich, Huebner and Schnatter 2000). That said, we find 
no empirical support for this suggestion: the correlation between maternal age and sex of 
child (female = 1) is not significant (r = -.02) 


. Maternal age is more strongly linked to birth order than to other control variables. 


. The unique effects of birth order and sibling size are difficult to disentangle. For example, 


an only child also is the first born. Nevertheless, in auxiliary analyses that included 
alternative parameterizations of the sibship features, a strong maternal age effect endures. 


If Y= bo + biX + buX? where Y is cultural classes and X is maternal age (in decades), 
then the inflection (maximum) point is equal to -b1/2bi. 


Even if the square term were significant in Model 4, the pattern still would not support the 
idea of a curvilinear effect of maternal age on cultural class. The calculated inflection point 
increases to 59.3 years, |.e., past the childbearing years of virtually all women. 


As expected, maternal age is positively linked to income, education, number of siblings 
and being married. Closer inspection suggests that some of these patterns are curvilinear: 
the relationship between maternal age and education is positive until the inflection point 
of 31.6 years. Similarly, a disproportionate number of unmarried women are located at 
both extremes of maternal age (with an inflection point of 31). Some racial differences 
also emerge: white and Asian American mothers are generally older than their Latina and 
Native American counterparts, who are older than their African American peers. 


In supplementary analyses not shown here, we examined other nonlinear transformations 
such as logarithmic functions, cubic functions, square root of age and inverse of age 
(Mirowsky and Ross 1992). None of these approaches provides a consistently better fit 
than the use of the parabolic function in the bivariate models or of the linear function in the 
multivariate models, nor do they alter the substantive conclusions of the analyses 
presented regarding the influence of parental age. 


For clarity of presentation, Table 3 presents the coefficients only for significant curvilinear 
patterns. All coefficients for maternal age are displayed in Appendix A. 
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Even with this curvilinear pattern, children whose mothers gave birth in their 40s fared 
better than children whose mothers gave birth at much earlier ages. For example, if we 
use the case of a mother who gives birth to a child at the age of 45, educational resources 
still would be greater than those to children whose mothers were younger than 35 when 
they gave birth. 


Excluded from these analyses were mothers who responded “we haven't thought about 
it yet” to this item. As a result, the sample size here is less than for the other economic 
items that were asked in the first wave of NELS. However, if we did include this response 
in a multinomial logistic regression, maternal age still would be negatively linked to 
willingness to go into debt. 


Most patterns presented in this article arguably could be incorporated readily within a 
sociobiological or evolutionary psychological frame; however, we suspect that there 
might be more difficulty in reconciling this orientation with the finding that younger 
parents show a greater predisposition to sacrifice for their children (as indicated by their 
willingness to go into debt). 


Supplementary analyses considered the effect of maternal age on another ostensible 
indicator of parental resources: parental monitoring of child’s behavior (i.e., a scale 
gauging the degree to which mothers try to find out how their child spends free time, with 
whom the child associates, where the child goes after school, where the child goes at 
night, and how the child spends his/her money). The positive relationship between these 
two variables at the bivariate level is not significant; however, at the multivariate level, the 
positive effect of maternal age on monitoring is significant. Thus, one could interpret this 
pattern as generally consistent with the position that maternal age is positively linked to 
parental investments. We nevertheless question whether monitoring Is an appropriate 
measure of parental resources, in part because its relationship to education and income 
is weak. Rather, a high score on the monitoring scale may reflect other parental traits, 
such as an orientation to conformity or authoritarianism (Freese and Powell 1999). 


When comparing the effects of maternal age, we typically used a .05 level of significance. 
Because there are few stepparents in the sample (234 mothers, 156 fathers), however, we 
used a more liberal .10 level of significance to compare stepparents with biological/adoptive 
parents; however, this still resulted in few differences in the effects of maternal age across 
different familial configurations. Tables summarizing these analyses are available upon 
request. 


Information about paternal age usually was provided by the mother. As a result, the 
possibility of error in maternal responses regarding fathers’ age (and vice versa when the 
father is the respondent) arises. While we cannot test this explicitly, we have compared 
distributions of maternal and paternal reports on mothers’ age (and on fathers’ age) and 
are reassured by their similarity. Additional analyses that restricts the sample to self- 
reports only (i.e., using parental respondents when examining the effect of paternal age 
and using maternal respondents when examining the effects of maternal age) yield results 
consistent with those reported above. 


To ascertain whether the patterns found here are unique to NELS, we also conducted 
supplementary analyses of High School and Beyond (HSB) and the National Longitudinal 
Survey of the High School Class of 1972 (NLS-72). Despite some differences between 
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these two data sets and NELS (e.g., youth reports of parental age in NSL-72 vs. parental 
reports in NELS and a more limited set of indicators of parental resources in HSB and 
NLS-72 than in NELS), the general patterns from these auxiliary analyses are strikingly 
similar to those discussed earlier. 


Some evidence regarding this possibility can be gleaned by our supplementary analyses 
of the High School and Beyond data set, which, despite its other limitations (e.g., only a 
small percentage of parents were asked to complete the parental questionnaire; fewer 
measures of parental resources), included a limited set of questions regarding familial 
wealth (i.e., home equity, other assets and debts). These analyses revealed significant and 
positive relationships between familial wealth and both maternal age and some resources 
(parental savings for college, willingness to incur debt, private school attendance and 
educational objects in the home). Nevertheless, in these analyses, the effect of maternal 
age on these resources remains significant, although reduced somewhat, even upon the 
inclusion of familial wealth in the models. 


Preliminary analysis of the implications of parental age for educational outcomes looks 
promising. As also shown elsewhere (Downey 1995; Roscigno and Ainsworth-Darnell 
1999; Schneider and Coleman 1993), the resources in NELS studied here, with few 
exceptions, are significantly and positively related to (in both bivariate and multivariate 
models) standardized math and reading test scores (in 8", 10" and 12" grade), school 
grades, high school graduation and college matriculation. The most conspicuous 
exception, as noted earlier, is parental assistance with homework, which is negatively 
linked to some of these educational outcomes. Notably, bivariate and multivariate models 
of these educational outcomes (OLS for test scores and grades and logit and multinomial 
logistic regression for educational attainment) confirm a significant (p < .001) link to 
maternal age. Additional tests, however, suggests some curvilinearity in the bivariate 
patterns: the inflection points for each educational outcome hover between 31 and 32 
years. Even more important, however, the relationship between maternal age and the 
educational outcomes (controlling for parental education, income, sibship structure, race 
and marital status) is to a large degree a function of parental resources. In the linear 
models, the effects of maternal age are attenuated by one-third to two-thirds with the 
inclusion of the parental resources. In the parabolic models, the curvilinear effects 
generally remained statistically significant with the addition of the parental resources; 
however, the inflection points typically increased beyond the childbearing years of virtually 
all women. We remind the readers, however, that these are preliminary findings that will 
require more elaboration. 
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Appendix A. A Complete Listing of Unstandardized Coefficients from Regressions of Parental 
Resources on Curvilinear Maternal Age and Control Variables for Maternal Respondents, 
National Education Longitudinal Study ** 


Model 1 Model 2 Model 3 Model 4 
Linear Age Linear Age  CurvilinearAge — Curvilinear Age 
(decades) (decades) (decades) (decades) 
Without With Without With 
Parental Resource - Controls Controls® Controls* Controls** 
Cultural Resources 
1.234" wis 
(.087) (.085) 
210° .146*** -.185*** -.023 
Cultural classes (.013) (.014) (.016) (.015) 
15467" 368" 
(.117) (.117) 
245°" .149*** oie, -.039 
Cultural activities (.017) (.020) (.021) (.021) 
Economic Resources 
2.099*** -.138 
(.201) (.225) 
ot aoe? OO .050 
Saved for college (.029) (.038) (.036) (.040) 
2.759*** 410 
(.247) (.240) 
ou .280*** -.429*** -.023 
When started saving! (.036) (.041) (.044) (.042). 
1ecb24e -.278 
(.940) (.873) 
2028""* ‘ole. -2.078*** 13 
Amount saved‘ (.134) (.144) (.168) (.154) 
-.090 -.606* 
(.275) (.295) 
20a -.300*** -.031 .054 
Willing: to incur debt® (.040) (.048) (.049). (.052). 
pao lolehes .895™* 
(.264) (.290) 
Oth oar) = 553 -.045 
Private school? (.032) (.045) (.045) (.050) 
aloes 1.060*** 
(.149) (.142) 
DOr 314" -.502*** -.134** 
Educational objects (.022) (.024) (.027) (.025) 
3.439*** ooh 
(.231) (.255) 
443*** mio. 050 -.063 
Computer in home® (.030) (.041) (.041) (.045) 


eS 
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Appendix A. A Complete Listing of Unstandardized Coefficients from Regressions of Parental 
Resources on Curvilinear Maternal Age and Control Variables for Maternal Respondents, 
National Education Longitudinal Study *” (continued) 


Model 1 Model 2 Model 3 Model 4 
Linear Age Linear Age — Curvilinear Age —Curvilinear Age 
(decades) (decades) (decades) (decades) 
Without With Without With 
Parental Resource Controls Controls® Controls* Controls** 
Social Resources 
4.043*** 454 
(.282) (.276) 
Know friends and their so a -.639*** .050 
parents (.041) (.046) (.051) (.049) 
1.2263 ron. 
(.104) (.105) 
Parent-teacher .249*** 220" -.178"** -.028 
organizations (.015) (.018) (.019) (.019) 
2210 ° 524* 
(.242) (.261) 
.418*** a -.319** -.031 
Volunteer at school? (.032) (.042) (.042) (.045) 
.939*** 126 
(.126) (.127) 
Involvement with MOTs .109*** 15 -.003 
school (.018) (.021) (.023) (.023) 
.404*** 128 
(.110) (.115) 
.045** .056** -.065*** -.013 
Do things with child (.016) (.019) (.020) (.020) 
.209* -.037 
(.090) (.094) 
-.028* -.034* -.043"* .0004 
Help with homework (.013) (.016) (.016) (.017) 
13370" 102 
(.173) (.176) 
.148*** .098*** -.222*** -.001 
Talk about school (.025) (.030) (.031) (.031) 
1.478*** mon 
(.085) (.080) 
Educational p21 ane sO -.230"** -.035* 
Expectations (.012) (.013) (.015) (.014) 


“Ns range from 8,876 to 16,746. 

Robust standard errors are in parentheses. 

“Controls include logged family income and marital status (from 1988 for resources from 1988 
or 1990 Survey and from 1992 for resources from 1992 Survey), parents’ education, race of 
mother, sex of child, sibship size, birth order, and dummy eaicbles for all controls to indicate 
missing values. 

‘The first and second numbers are the coefficient and standard error, respectively, for the linear 
component of the curvilinear term; the third and fourth numbers are the coefficient and 
standard error for the squared component of the curvilinear term. 

“Logistic regression of dichotomous dependent variable; others use OLS regression unless 
specified otherwise. 

obit regression of censored dependent variable. 
*p<.05 **p<.01 ***p < .001 (two-tailed) 
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Appendix B. Unstandardized Coefficients from Regressions of Parental Resources on Paternal 
Age and Control Variables, National Education Longitudinal Study*” 


Parental Resource 


Cultural Resources 


Cultural classes 


Cultural activities 
Economic Resources 


Saved for college® 


When started saving 


Amount saved® 


Willing to incur debt® 


Private school* 


Educational objects 


Computer in home® 


Model 1 
Paternal 


Age Only, 


Father 
Sample 
Without 
Controls 


164*** 
(.024) 


155*** 
(.033) 


Pie 
(.055) 


156" 
(.078) 


1.816" 
(.256) 


-.184* 
(.077) 


572" 
(.061) 


374" 
(.042) 


324** 
(.054) 


Model 2 
Paternal 
Age Only, 
Father 
Sample 
With 
Controls® 


“oo 
(.026) 


074" 
(.037) 


168" 
(.069) 


146 
(.088) 


1.205** 
(277) 


227" 
(.093) 


AG0*** 
(.081) 


457% 
(.044) 


092 
(.071) 


Model 3 
Maternal (M) 
and Paternal (P) 
Age, Mother and 
Father Sample 
Without 
Controls® 


153***(M) 
(.022) 
060** (P) 
(.020) 


198**(M) 
(.028) 

020 (P) 
(.025) 


133** (M) 
(.048) 
091" (P) 
(.042) 


328***(M) 
(.051) 
024. (P) 
(.044) 


935***(M) 
(217) 
1.102**(P) 
(.191) 


-.091 (M) 
(.065) 
-.194***(P) 
(.058) 


A45***(M) 
(.051) 
201***(P) 
(.046) 


356***(M) 
(.033) 
103"**(P) 
(.030) 


272**(M) 
(.047) 
098" (P) 
(.042) 


Model 4 
Maternal (M) 


and Paternal (P) 
Age, Mother and 


Father Sample 
With 
Controls® 


103***(M) 
(.021) 
089***(P) 
(.018) 


128***(M) 
(.028) 
054* (P) 
(.025) 


042 (M) 
(.053) 
180"**(P) 
(.047) 


254**(M) 
(.050) 
113" (P) 
(.043) 


226 (M) 
(.200) 
4.326"**(P) 
(174) 


-.180** (M) 
(.068) 
-.176"* (P) 
(.060) 


421***(M) 
(.060) 
295***(P) 
(.052) 


170**(M) 
(.031) 
124***(P) 
(.028) 


088 (M) 
(.055) 
091 (P) 
(.048) 


See 
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Appendix B. Unstandardized Coefficients from Regressions of Parental Resources on Paternal 
Age and Control Variables, National Education Longitudinal Study*” (contin ued) 


Model 1 Model 2 Model 3 Model 4 
Paternal Paternal Maternal (M) Maternal (M) 
Age Only, Age Only, and Paternal(P) and Paternal (P) 
Father Father Age, Mother and Age, Mother and 
Sample Sample Father Sample Father Sample 
Without With Without With 
Parental Resource Controls Controls® Controls® Controls® 
Social Resources 
.444***(M) .154* (M) 
(.069) (.066) 
Know friends and their .166 .046 -.166™* (P) -.084 (P) 
parents (.088) (.097) (.061) (.058) 
.184***(M) .147***(M) 
(.026) (.026) 
Parent-teacher al30e .097** .027 (P) .074** (P) 
organizations (.029) (.033) (.023) (.023) 
.355***(M) .311***(M) 
(.051) (.057) 
077 .018 -.074 (P) -.044 (P) 
Volunteer at school’ (.063) (.081) (.046) (.050) 
.090** (M) .078* (M) 
(.031) .030 
Involvement with .063 .082* -.014 (P) OFT™ (P) 
school (.037) (.040) (.027) (.027) 
-.006 (M -.007 (M 
(.025) (.026) 
-.040 -.037 -.017 (P) .004 (P) 
Do things with child (.030) (.036) (.023) (.023) 
032 (M) 022 (M) 
(.022) (.022) 
-.107*** -.106*** -.086***(P) -.065***(P) 
Help with homework (.025) (.029) (.019) (.020) 
.097* (M) .044 (M) 
(.042) (.042) 
.076 .079 -.015 (P) .055 (P) 
Talk about school (.049) (.055) (.037) (.037) 
.136***(M) .047* (M) 
(.020) (.019) 
Educational Ue ot elle .051™ (P) .056***(P) 
Expectations (.024) (.025) (.018) (.016) 





“Ns range from 1,097 to 3,760 for Models 1 and 2 and 8,708 to 13,812 for Models 3 and 4. 
Models | and 2 are based on cases in which the father is the parental respondent, while Models 
3 and 4 are restricted to two-parent households, regardless of the sex of respondent. Robust 
standard errors are in parentheses. oe a. ; 

Bolded coefficients in Models 3 and 4 indicate a significant difference (at the .05 level) between 
the maternal age and paternal age coefficients (based on Wald tests). 
“Controls include logged family income and marital status (from 1988 for resources from 1988 
or 1990 Survey and from 1992 for resources from 1992 Survey), parents’ education, parents’ 
race, sex of child, sibship size, birth order, sex of parental respondent and dummy variables for 
ll controls to indicate missing values, . 

oa regression of dichotomous dependent variable; OLS regression used unless specified 
otherwise. 
“Tobit regression of censored ort variable. 
*p<.05 *p<.01 ***p < .001 (two-tailed) 
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Abstract 

This paper examines how school engagement influences the timing of family formation for 
youth. We pay particular attention to variation across four racial/ethnic groups and by 
generation status, variation that reflects the diversification of U.S. society through 
immigration. Using data from the National Education Longitudinal Study (NELS), we 
employ discrete-time multinomial logistic regression models examining the likelihood of 
childbearing or marriage in late adolescence. We find that the delaying effects of school 
enrollment and engagement vary by race/ethnicity, suggesting that strategies for 
socioeconomic success that focus on delaying family roles are more important among some 
groups than others. The results also indicate that controlling for school enrollment and 
school engagement reduces differences in early marriage and non-marital childbearing by 
generation status. 


Introduction 


The relationship between race/ethnicity, schooling and family formation during the transition 
to adulthood is difficult to disentangle because so much is intertwined in this portion of the 
life course. Furthermore, most investigators have focused on the effect of adolescent 
marriage and childbearing on subsequent educational and economic attainment rather than 
the reverse. Research on adolescents from different backgrounds suggests that for some 
race/ethnic groups, early family formation might present more of an impediment to 
subsequent educational attainment than for others (Forste and Tienda 1992). But schooling 
may also affect the likelinood of marriage and childbearing (Manlove 1998; Moore et al. 1998). 
This issue is complicated by the fact that not all racial and ethnic groups appear to take the 
same paths to school completion or family formation (i.e., Hogan and Kitagawa 1985). The 
potential variation is further augmented by immigration processes, which increase ethnic 
diversity and also introduce variation in a temporal dimension, as national origin groups alter 
their family formation across generations in the new host society setting. 

Resolving such inconsistencies has become increasingly important as these processes 
play a critical role in theories of racial differences in status attainment and family formation in 
early adulthood. We ask whether group differences in the timing of family formation can be 
attributed to differences in school engagement prior to any family formation event. We focus 
on this possible role of school engagement on family formation for youth from four different 
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racial/ethnic groups that have featured in the theories of central interest. Much of the research 
on school engagement and family formation is limited to blacks and whites (e.g., Moore et al. 
1998). The theoretical guidance for predicting group differences in early family formation is 
limited when applying research to the multi-ethnic, multi-generation reality of today’s youth. 
We go beyond previous research by focusing on the likelihood of marriage or childbearing for 
a cohort of adolescents from four racial/ethnic groups, following them through their early 20s 
and exploring differences for both boys and girls.. We also evaluate the extent to which school 
engagement is related to generation status differences in the timing of family formation. This 
becomes especially salient today because the increasing diversity of the adolescent 
population is so closely tied to increases in immigration and the growth of the second 
generation. Our analysis expands previous research by incorporating expanded measures 
of generation and immigrant status. 


Background 


Young adulthood is a time of great change. The major transitions — into and out of school, 
union formation, and parenthood — are not only more likely to occur but are also likely to occur 
at a more rapid pace in young adulthood than at older ages (Hogan and Astone 1986; Marini 
1984; Rindfuss 1991). It is hardly surprising that there has been disagreement about which 
transitions cause which. Studies focused on the effect of family formation activities in 
adolescence often found a negative relationship between early family formation and 
educational or socioeconomic attainment (Forste and Tienda 1992; Hoffman, Foster and 
Furstenburg 1993a; Mare and Winship 1991; Moore et al. 1993). Others, however, have 
concluded that when the endogeneity of school enrollment and adolescent childbearing is 
accounted for, childbearing has no effect on the continuation of education (Geronimus and 
Korenman 1992; Olsen and Farkas 1989; Ribar 1994; Upchurch et al. 1993). 

Our approach is similar to that of researchers who have set the causal arrow to suggest 
that continued engagement in school (enrollment and family investment) is associated with 
slower transitions to later family formation (Manlove 1998; Mare and Winship 1991; Moore et 
al. 1998; Thornton et al. 1995). School enrollment may be incompatible with family formation 
for several reasons. The roles of student and parent can be extremely demanding and hence 
difficult to maintain in tandem. Young parents need time for childcare and employment to 
support their new families, making it difficult to invest in schooling. 

However, racial and ethnic differences have been found suggesting early family formation 
is less of a barrier to school completion for blacks than for non-Hispanic whites and Hispanics 
(Forste and Tienda 1992). Here we ask whether racial/ethnic variation in engagement in school 
mediates the relationship between race and early family formation? In other words, not 
whether having an early birth reduces ultimate educational attainment but whether 
differences in various aspects of school involvement in adolescence help explain differences 
in later family formation patterns, including early childbearing and marriage. 


Ethnicity, Theory and Early Family Formation 


Where do racial and ethnic group differences in family formation timing originate? Both race 
and access to economic resources are important in determining expectations of and 
pathways to family formation (Brewster 1994; Stier and Tienda 1997) as well as educational 
expectations (Beutel 2000; Hockaday et al. 2000; Kao 2000; Karraker 1992). For example, in 
spite of higher educational expectations, Black youth are more likely to engage in family 
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formation behaviors than others, suggesting a more tenuous connection between school and 
delay of family formation for this group than non-Hispanic whites. 

The link between socioeconomic attainment (including schooling) and delay of family 
formation is important in determining group differences. For example, groups for whom 
opportunities for structural incorporation are blocked (i.e., they have little hope of gaining the 
socioeconomic status of the majority group) may make earlier or non-normative transitions to 
family formation in general and parenthood specifically. In the logic of this “blocked 
opportunity” approach, family formation will be less impeded by school enrollment for groups 
facing structural impediments to achieving upward mobility because schooling itself is less 
linked to upward mobility. In other words, the effect of schooling on the timing of family 
formation may vary by race/ethnicity because these groups vary in their expectations for 
school attainment and timing of family roles. 

Others have argued that some groups, instead of reacting to limited opportunities by 
disengaging from school and forming families earlier, remain attached to school and delay 
family formation. This is the “minority group status” hypothesis (Goldscheider and Uhlenberg 
1969). Minority group members are expected to delay their family formation because their 
minority status makes it more difficult to achieve upward mobility, given structural 
impediments to socioeconomic attainment. 

These different strategies may distinguish groups, as the research above has found, and 
may also characterize different generations (Espenshade and Ye 1994). Children of immigrant 
parents (i.e., immigrant youth or second generation youth born to immigrant parents) may be 
particularly encouraged by their immigrant parents or other immigrants to remain in school 
and forgo early family formation, both marriage and parenthood, relative to the third 
generation which has less contact with the immigrant generation. For example, one study 
finds Southeast Asian and Mexican origin girls born in the United States perceived greater 
odds of having a non-marital birth than those born in another country (East 1998). Thus, while 
racial and ethnic differences in the effect of school engagement on family formation may be 
linked to the prevalence of immigrants within the groups, this previous research also 
suggests variation in early family formation across generations and an increased risk of early 
childbearing with adaptation in the United States (Manlove et al. 2000; Upchurch et al. 2001). 

Based on the expectations of the ‘minority group status’ hypothesis and other studies of 
fertility, it is the differential exposure to the receiving context that is the key to influencing 
family formation patterns across generations. These differences, therefore, may be more 
pronounced among immigrant youth who have been in the United States for different periods 
of time than between second and third generation youth. For example, immigrant youth who 
enter the United States after schooling has begun will have had some of their socialization 
experiences in the country of origin but those who entered as very young children 
(sometimes termed the 1.5 generation) may more closely resemble their second generation 
peers. Both 1.5-generation youth and second-generation youth will spend the majority of their 
childhoods in the United States in families with immigrant parents. Few studies of family 
formation have considered generation status among adolescents or incorporated length of 
residence in the United States into the analysis. 


Engagement in School 


School enrollment is expected to delay the entrance into marriage or parenthood. Other 
measures of school engagement that directly identify the students’ participation in their 
education, such as time spent on homework or participation in class, may also help explain 
racial and ethnic differences in outcomes (Portes and MacLeod, 1996). We take “school 
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engagement” here to indicate the level of this entire range of characteristics from attendance 
through the degree of involvement among attendees. 

The status attainment model, so familiar in sociology, emphasizes the importance of family 
characteristics on school engagement and completion (e.g., Haveman, Wolfe and Spaulding, 
1991: Jencks, Crouse and Meuser, 1983; Kalmijn, 1994). Attitudes and parental participation 
in a youth's education are fundamental to increasing human capital in the form of skills and 
achievement. Differences in the link between school enrollment and family formation may 
reflect differences in family involvement in schooling (Lareau 1989; Manlove et al. 2000; 
Moore et al. 1998). Adolescents’ greater educational aspirations lead to a reduction in early 
sexual activity, greater contraceptive use and a greater likelihood of pregnancy termination 
(Abrahamse et al. 1988). Parental aspirations and expectations influence children’s own 
expectations and attitudes towards school and vice versa (Goldenberg et al. 2001; Hossler 
and Stage 1992). Overall, therefore, families encouraging greater involvement in schools may 
be more successful at keeping adolescents in school and delaying early family formation. 

Further complicating the racial/ethnic differences of the effect of schooling on family 
formation timing is the possibility that school engagement decreases among youth in the 
second-generation relative to foreign-born youth, as some research has suggested. 
Immigrant youth may maintain greater attachment to schooling than U.S.-born youth due to 
higher aspirations and involvement of parents and others within immigrant families who may 
promote high achievement (Glick and White 2004; Hao and Bonstead-Bruns 1998; Stanton- 
Salazar 1997). There is considerable variation here as some ethnic groups are more likely to 
evidence a decrease in commitment to formal education among second-generation youth 
than others (Portes and MacLeod 1996; Portes and Rumbaut 2001). We will be cognizant of 
this possibility and examine interactions of race/ethnicity and generation status as well. 

Most of the recent work that has examined the relationship between school enrollment 
and early family formation has focused on early (and non-marital) childbearing, not marriage. 
However, in research that reverses the causality in this analysis, Astone and Upchurch (1994) 
find that both pathways to family formation have a negative effect upon school completion. 
This relationship varies across racial/ethnic groups; Forste and Tienda (1992) find that early 
marriage affects school completion only for whites, but early childbearing has a negative 
effect for whites, blacks and Hispanics. There could also be differences in early marriage vs. 
non-marital childbearing across generation status because, for some immigrant communities, 
a young age at marriage may be in keeping with traditional patterns. Non-marital fertility, on 
the other hand, is less likely to receive collective support within many immigrant 
communities. (See Zhou and Bankston (1994) on Vietnamese immigrants for example.) We 
examine both early marriage and early childbearing as outcomes in order to examine these 
possible differences. However, while the size of coefficients may vary, we expect the direction 
of the effects will be the same whether predicting early marriage or predicting non-marital 
adolescent childbearing. 


Hypotheses 


We form three specific hypotheses for our analyses of marriage and non-marital childbearing. 
Our first hypothesis focuses on the direct effects of our measures of school engagement, 
predicting that parental expectations, previous school performance, school enrollment and 
intensity of school engagement will be negatively associated with family formation, especially 
non-marital births. As an extension of this hypothesis, we further expect some of the racial 
and ethnic variation in the timing of this early family formation to be reduced when school 
engagement is included in the analysis. 
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Second, the effect of school enrollment and engagement on early family formation may 
differ for minority group members, relative to the relationship observed for non-Hispanic 
whites. Based on expectations from theories of minority group status, we would expect the 
effect of school enrollment and engagement on family formation to be greater for minority 
youth than non-Hispanic white youth. Based on expectations from the “blocked 
opportunities” approach, we would expect the effect of school enrollment and engagement 
on family formation to be smaller for minority youth than non-Hispanic white youth. We 
include interactions between school enrollment/engagement and race/ethnicity to examine 
these opposing possibilities. We examine three very different minority groups and anticipate 
that we may find each possibility applies for different groups. 

Third, we expect early family formation transitions to be slowest among youth who 
immigrated later in childhood, followed by those of longer residence in the United States and 
second-generation youth and fastest among youth from the third and higher generation. We 
formulate this hypothesis based on the expectations of the “minority group status” 
hypothesis and other studies of fertility across generations that suggest recent adult 
immigrants delay family formation because they are investing in human capital acquisition — 
attending school, working many hours, etc. This perspective assumes that human capital 
acquisition is incompatible with family formation. We hypothesize that this greater investment 
in human capital acquisition may diminish across generations /f children of immigrants or 
grandchildren of immigrants face less pressure to engage in human capital acquisition after 
the family has become settled in a socioeconomic position in the United States. Thus, we test 
interactions between school engagement and generation status. We also test interactions of 
generation status and race/ethnicity to determine if early family formation patterns by 
generation status vary from one group to another. 

Although most of the research examining early childbearing has focused on group 
differences for girls only, there is some research on boys as well (Michael and Tuma 1985). 
Studies looking at racial/ethnic comparisons find that black boys are more likely to make the 
transition to parenthood before age 20 than white boys (Hanson et al. 1989). However, the 
question of whether the connection between school and family formation will operate in the 
same manner for boys as girls remains unanswered. The analyses presented here will 
consider variation in the effect of school engagement on early family formation across racial 
and ethnic groups separately for boys and girls. 


Data and Methods 


We utilize an event history approach with nationally representative, longitudinal data to assess 
differences in the prevalence and timing of entrance into marriage and parenthood among 
youth from four racial/ethnic groupings (non-Hispanic white, black, Mexican origin and Asian 
origin) for both girls and boys. We present results for males and females separately for two 
reasons. First, timing and pathways to family formation are different for males and females 
and have different implications for subsequent educational attainment (Goldscheider and 
Waite 1986: Xie, et al. 2003). Thus, it seems likely that the impact of school engagement on 
their family formation may also differ. Second, the underreporting of family formation 
(particularly of non-marital fertility) is more likely for males than females (Rendall et al. 1999). 

Our data come from the first four waves of the National Education Longitudinal Study 
(NELS). The NELS survey began with a cohort of students in eighth grade in 1988. We employ 
data from three follow-up interviews conducted in 1990, 1992 and 1994. This dataset has the 
advantage of including detailed information about the family of origin as well as the 
educational and labor market activities of the youth, along with information on any early 
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transitions to parenthood and union formation. We exploit the longitudinal data by examining 
racial/ethnic variation in the timing of these key transitions. 

The data include information on when a respondent married or had a child between 1988 
and 1994. Our sample then consists of all respondents who participated in the base year 
survey (1988) through at least the second follow-up in 1992 leaving us with an initial combined 
sample of 16,489 respondents. We omit 122 respondents who had a child prior to 1988 (n = 
21) or were missing dates of first birth and/or marriage (n = 101). 

We keep the cases for which the respondent reported being non-Hispanic white (n = 
11,606), black (n = 1,608), Mexican (n = 1,305), or some Asian group (n = 989) due to the 
smaller numbers of other Hispanics (n = 687) and Native Americans (n = 172). Although there 
is considerable variation in the ethnic origins of “Asian” youth, we are constrained to use this 
broader pan-ethnic grouping due to the limited sample size. However, the much lower 
incidence of marriage or childbearing in all Asian groups suggests that this is not 
substantively inappropriate, unlike the case of other Hispanics (analysis available on request). 
The large difference between Asian family formation patterns and Mexican, the other group 
with a recent migration history increases Asians’ importance, despite the limitation of keeping 
them all in one group. Previous analyses with more ethnic groups led to similar conclusions 
as those presented here; the results are available upon request from the authors. 

These decisions leave us with 15,508 cases in our sample. Respondents excluded from the 
sample through omission or attrition are more likely (p < .05) than those remaining in the data to 
be black, male, have immigrant parents and low SES, and are less likely to live with both parents 
in 1988. Further, these missing cases tend to have lower educational expectations and test 
scores than those remaining in our sample. Overall, however, these cases remain representative 
of the experience of the youths from the four ethnic groups in the interval we study. 

We use an event history model (discrete time multinomial logit) to capture detailed timing 
of transition and the influence of relevant covariates. Once individuals have had a non-marital 
birth or marriage, they are removed from the observation set because they are no longer “at 
risk” of initial family formation. This is a particularly useful way to model marriage and birth in 
the presence of a set of time-varying covariates, and events that are reflected on an annual 
scale (Goldstein, White and Goldstein 1997; White, Moreno and Guo 1997). We employ 
multinomial logistic regression in STATA adjusting for the design effects, i.e. svymlogit. 


Dependent Variable 


We rely on multinomial logistic regression models to predict the likelihood of getting married, 
having a child or no family formation in late adolescence. In order to construct the dependent 
variable, the data are converted to person-year format, such that a respondent who remained 
in the data for the entire length of the survey period (1988-94) would contribute seven 
observations. This includes one observation for each year from 1988 through 1994 because 
events prior to 1988 are not included. 

Using accounts of date of first birth or first marriage, we are able to ascertain when an 
individual experienced an event. We model marriage and parenthood with a multinomial logistic 
model based on whichever event occurs first. Recognizing that decisions about school 
enrollment could be made on the basis of an impending birth or marriage rather than the event 
itself, we lag the timing of the birth or marriage by nine months. In other words, a respondent's 
characteristics are measured at the approximate time of conception rather than the birth itself 
(Manlove 1998). Dating the length of an engagement is less precise, but this seems a 
reasonable mid-point between very long engagements and impulse weddings. In this way, we 
model the effect of school enrollment on the likelihood of conceiving a child that will be carried 
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to term and not the likelihood of leaving school after conception has occurred; similarly we 
model school enrollment on the likelihood of planning a marriage in nine months or less. 


Independent Variables 


The background measures are constructed from variables in the first wave of the study, 1988, 
when the respondents were in eighth grade. These measures include basic demographic 
information, home environment, living standards and generation status. We provide weighted 
descriptive statistics on these measures for the sample as a whole and separately for our key 
groups (Table 1). There are no substantive differences by gender in these measures. 

Demographic measures include the respondent's age, socioeconomic status and family 
structure in 1988, our first year of observation. Higher socioeconomic status is associated 
with a lower likelihood of non-marital birth and should also be associated with a delay in 
marriage in adolescence and very early adulthood (South 2001). Socioeconomic status is 
represented by a standardized index covering several measures including parents’ education, 
occupation and family income, which has a mean of 0 and a standard deviation of 1. Family 
structure is a categorical variable based on the respondent's living arrangements in 1988: 
respondent lives with (1) mother and father (reference group); (2) one parent and the parent's 
partner (married or cohabiting); (3) single mother; (4) other family type. 

Generation status is constructed from questions regarding parents’ and respondents’ 
place of birth. First generation respondents are those who were born outside the United 
States and who entered the United States in the six years prior to 1988. Our 1.5-generation 
respondents are those who were born outside the United States but who entered the country 
earlier. Thus, most of our 1.5-generation respondents entered the United States before or 
during the very first years of formal schooling. Second-generation respondents are those born 
in the United States with at least one foreign-born parent and those in the third and higher 
order generation are those born in the United States to US-born parents. There is 
considerable racial/ethnic variation in generation status. More than 40 percent of the Asian- 
origin youth are foreign born — from the first or 1.5 generation —- compared with approximately 
15 percent of Mexican origin youth and only 1 to 2 percent of blacks and whites. 

We employ several measures of support and encouragement for school engagement. 
Parental educational expectations for the child are directly measured in a parent survey 
administered in 1988. While most parents in all groups expect their children to at least attend 
college (75-90 percent), there are larger differences in the proportions who expect them to 
complete college or go further with their education (78 percent of Asian origin youths 
compared with 62 percent of non-Hispanic white youths, 53 percent of black youths, and 39 
percent of Mexican origin youths). Our multivariate analyses include parental expectations 
as a single continuous variable (range of values from 1-6) because preliminary analyses reveal 
an underlying linear relationship. 

We also expect other parent actions to transmit a commitment to schooling (Moore et al. 
1998). We include measures of direct parent-child interaction with a scale that measures how 
often a child discusses school-related issues with parents: programs at school, things studied 
in class and school activities. The scale ranges from 0 to 6 with a value of 0 indicating 
respondents never discussed any of the three items with their parents and a value of 6 
indicating respondents frequently discussed all three items with their parents. 

We include a measure of the score on a standardized academic test administered in the 
base year (1988) to control for prior educational performance, which could influence both 
parental expectations and the overall likelihood of youth continuing in school. We observe higher 
scores for white youth and those of Asian origin than black youth and those of Mexican origin. 
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Table 1: Summary Statistics 
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Table 1: Summary Statistics (continued) 





Non- 
Hispanic Mexican Asian 
White Black origin origin - Total 
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Source: National Education Longitudinal Study, 1988-1994. 
Note: Frequencies are weighted. Standard deviations are in parentheses. , 
*p<.05 **p<.01 ***p < .001. T-tests reflecting significant differences from non-Hispanic 
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Our time-varying measure of school engagement reflects the predictions of our conceptual model 
in which school enrollment and level of engagement among those enrolled exert independent effects 
on the timing of family formation and reduce the differences in family formation by generation status 
and race/ethnicity; the effect of school enrollment and engagement may also differ by generation 
status and race/ethnicity (Manlove et al. 2000). Our measure of school engagement distinguishes 
among those not enrolled in school and, for those enrolled, those more and less engaged. First, we 
address enrollment status using retrospective data for the interim between waves. A respondent was 
coded as being in school if he or she did not drop out of school in.a given calendar year. If the 
respondent did leave school ina given year, but then returned to school within the year, he or she was 
then coded as being in school. No cases are censored for the enrollment variables. Those who left 
school in any year of our data and returned to school in a subsequent year (n = 1,101) were coded as 
not enrolled in the year they left school and enrolled in the year they returned. 

We then further distinguish among respondents enrolled in school by the extent to which 
students are prepared for class. NELS respondents are asked to report the frequency with 
which they attend class without pencil, paper, books or homework. These same questions are 
asked in 1988, 1990 and 1992. We have confidence that these items are measuring the same 
concept: the three items are correlated with each other in each year with a range from .62 to .77. 
Students who report that they seldom or never attend class without paper, pencils, books or 
homework are coded as having a higher level of engagement in school. These respondents are 
more prepared for class than their counterparts who, although enrolled in school, report that 
they often or sometimes show up for class without these materials. For our purposes, 
engagement in school has three categories: not enrolled, lower engagement (i.e., enrolled but 
sometimes unprepared for class) and higher engagement (i.e., enrolled and seldom or never 
unprepared for class). We use all three years of questions thus creating a time-varying measure 
of engagement for respondents enrolled in school. We assume that students who remain in 
school during the non-survey years are remaining at their previous level of engagement. In all 
years, we are able to account for those who drop out of school altogether. 

Our primary hypotheses are focused on school as an activity that should delay family 
formation, not the level of schooling itself. Nonetheless, we note that our measure of enrollment 
in school distinguishes high school enrollment (primarily prior to 1992) from postsecondary 
school enrollment (particularly in the years after 1992). Thus, we may expect lower school 
enrollment and greater family formation after 1992. We also examine whether part-time post- 
secondary enrollment differs from full-time enrollment in predicting the timing of marriage or 
non-marital birth. Part-time enrollment in post-secondary education may be more compatible 
with family formation than full-time enrollment. School engagement in 1993 and 1994 is also 
measured with three categories: not enrolled, lower engagement (i.e., enrolled parttime in post 
secondary education) and higher engagement (i.e., enrolled fulltime in post secondary 
education). This allows us to use the same categorical variable across years, simplifying the 
number of interaction terms. Preliminary analyses (not shown) indicate that separating the 1993 
and 1994 engagement variables from the engagement measures for 1988-1992 does not yield 
substantively different results from those we present here. 

There is some racial/ethnic variation in engagement, as seen in Table 1. Asians are the 
most likely to be highly engaged in school throughout the observation period and Mexican- 
origin and black respondents are less likely to be highly engaged than others or to be enrolled 
at all, particularly after 1992. To account for a possible change in schooling after high school 
is completed, we include a variable for high school graduation that reflects the respondent's 
status as a graduate. The variable takes a value of 0 until the year the respondent graduates 
from high school at which time the variable takes on a value of 1 for the remaining years of 
observation. This captures the post 1992 effect, because the majority of those who graduate 
from high school do so in 1992. 
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Results 


We begin our observation when the NELS respondents are in eighth grade in 1988 and our final 
observation occurs seven years later in 1994. Although the majority of respondents do not get 
married or have a child in the period of observation, those who do are more likely to become young 
parents than young spouses. Nearly 15 percent of all girls formed families by having a birth by 1994 
and 8 percent married first, leaving approximately 78 percent without any family formation. Boys 
were even less likely to have a family formation event by 1994 (11 percent). Like girls, however, 
births were the more common way to begin families than marriage (7 percent vs. 4 percent). 
There is also considerable racial/ethnic variation in family formation. Asian-origin 
respondents, male and female, have the lowest levels of early family formation, followed by non- 
Hispanic whites (only 10 percent of Asian-origin students have any family formation by 1994). 
The overwhelming majority of black respondents who experience any family formation by 1994 
do so by becoming parents (31 percent of girls and 15 percent of boys), with few marrying first 
in the period of observation. Mexican-origin youth are more likely than other groups to marry 
before becoming parents (12 percent of girls and 8 percent of boys). Mexican-origin girls are 
almost as likely as black girls to have a non-marital birth as their route to family formation. 


Figure 1. Proportion of Respondents with Family Formation by Enrollment Status and Year 
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The timing of early family formation varies by enrollment status as well as shown in Figure 
1. By 1991, the likelihood of family formation remains fairly small among those who are 
enrolled in school, while the likelihood of family formation among those not enrolled in school 
increases dramatically. By 1994, slightly less than half of the girls not enrolled in school have 
either married or had a first non-marital birth compared with fewer than 10 percent of those 
enrolled in school. Although family formation is much less prevalent among boys, we see 
similar patterns for them as well. 

The multinomial logistic regression models demonstrate how much of the variation in 
family formation is explained by race/ethnicity, family background, generation status and 
parental involvement and how much of this difference is mediated by enrollment and by level 
of engagement in school (i.e., lower engagement where respondents are enrolled but 
frequently unprepared for class and higher engagement where respondents are enrolled but 
rarely unprepared for class). Table 2 presents the results girls and Table 3 replicates the 
models for boys. The following discussion focuses on the results for both girls and boys 
when they are similar; we note where the results differ substantially. 

Model 1 of these two tables simply includes measures of time and the racial/ethnic 
indicators. This model shows statistically significant variation in timing of family formation 
across racial/ethnic groups with less marriage among black respondents compared to non- 
Hispanic whites and a considerably greater likelihood of a non-marital birth for blacks and 
those of Mexican origin than for non-Hispanic whites. There is some variation by gender 
among Asian origin respondents, as girls but not boys are less likely than non-Hispanic whites 
to make the transition to a non-marital birth. 

The results of Model 2, now with family background and generation status, demonstrate 
that some of the racial/ethnic differences are reduced by the inclusion of family background 
and SES, particularly for boys. As we expected, higher socioeconomic status slows entry to 
family formation with an increase in one standard deviation of SES reducing the likelihood of 
non-marital birth or marriage. Family structure also matters. Consistent with previous 
research, coming from a home with a parent and a new partner speeds the very early family 
formation we are able to observe here (Cherlin et al. 1995; Wu and Thompson 2001). Further, 
girls from single-mother households are more likely to have a non-marital birth and are less 
likely to marry than girls from two-parent households. 

We also observe that most racial/ethnic differences persist when generation status is 
included. Models excluding generation status (not shown) reveal that the reductions in the size of 
the racial/ethnic coefficients are primarily due to socioeconomic status and family structure, not 
generation status. This indicates that group differences in the duration in the United States or 
proximity to the migration experience are not related to the differential pace of family formation. 

However, consistent with our expectations, generation status does have its own 
independent relationship with early family formation. Overall, the 1.5-generation girls and 
second-generation girls are less likely to experience a marriage or a non-marital birth than 
either first- or later-generation girls. Among boys, only the first and 1.5 generations are 
distinctive, with substantially slower timing to a non-marital birth. Thus, for boys we find some 
support for the hypothesis that first generation youth will be less likely to make early family 
formation transitions, particularly non-marital childbearing, than those in the higher 
generations but the result does not suggest support for a segmented process of increasing 
off-time events for second-generation youth in the case of early family formation. Also notable 
is the fact that some significant racial/ethnic differences persist (particularly for girls) despite 
the fact that more Asian-origin and Mexican-origin respondents are represented in the first-, 
1.5- and second-generation groups. 

In the next model, we find strong effects of our measures of school engagement. Higher 
parental expectations are associated with slower family formation via birth for boys and girls 
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and marriage for boys. However, we observe no effect of parent-child interaction for girls; for 
boys interaction is associated with a lower likelihood of making the transition to parenthood. 
Previous academic performance, here measured with 1988 standardized test scores, also 
matters such that better academic performance is associated with less likelihood of early 
family formation for girls. It appears that school engagement measured through high 
performance and high expectations is associated with a lower likelihood of birth or marriage 
throughout late adolescence. 

The results also demonstrate that enrollment/level of engagement are important for 
predicting the timing of early family formation, as we originally hypothesized. School 
enrollment is significantly and negatively associated with marriage or birth among both girls 
and boys. Likewise, the level of engagement in school also helps predict the timing of early 
family formation. Girls and boys who exhibit high engagement are the least likely to 
experience marriage or non-marital birth. We examined the possibility that high school 
enrollment and college enrollment might have different effects on family formation timing 
since some of the school enrollment we observe after 1992 was college enrollment. We do 
not find any significant differences in the effect of enrollment in high school or college. Our 
results strongly suggest that it is the intensity or degree of engagement in school at both the 
secondary and post-secondary levels of schooling that most strongly predict the timing of 
early family formation events. 

Consistent with our expectations, there are some changes in the racial/ethnic coefficients 
when we moved from Model 2 to Model 3, although they are modest. Controlling for these 
measures of school engagement, Asian-origin girls are no longer significantly different from 
non-Hispanic whites. There is also a small reduction in the coefficient for Mexican-origin and 
black girls, but they are still more likely than whites to make the transition to first birth. The 
picture is less clear for boys. Black boys are still less likely to marry and more likely to have a 
non-marital birth when compared to non-Hispanic whites. 

There is also some change in the generation status coefficients when we include school 
engagement in the models. We now observe that the likelihood of early non-marital 
childbearing for 1.5- and second-generation girls and 1.5-generation boys is no longer 
significantly different from that of the third or higher generation. These results suggest 
support for our hypotheses that school engagement is part of the process that produces 
some generation status differences and, albeit more modestly, some differences by 
race/ethnicity. 

Our theoretical discussion suggested that the importance of engagement with school in 
delaying family formation should vary across racial/ethnic groups and by generation status, 
although the theories differ in their predictions. Our final model (Model 4) presents the 
interactions of level of school engagement with race/ethnicity. We find a significant interaction 
between race/ethnicity and school engagement for Mexican origin girls, for whom a high level 
of school engagement poses less of an impediment to marriage compared to non-Hispanic 
white girls, consistent with the “blocked opportunities” approach. The only other interactions 
to achieve statistical significance are those for black boys’ family formation transitions 
(particularly for non-marital childbearing). These results are also consistent with the blocked 
opportunities hypothesis in which school engagement has a smaller delaying effect on family 
formation for some minority youth. 

Based on the expectation that second generation youth may have less attachment to 
schooling and hence earlier family formation than immigrant youth, we tested interactions 
between generation status and school enrollment. Likelihood ratio tests comparing models 
with interactions to those without interactions are significant, indicating interactions are 
adding explanatory power. The models reveal a significant negative interaction for second- 
generation girls, suggesting enrollment in school has a greater impact on delaying their 
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family formation than for girls from the third or higher generation. In other words, second- 
generation girls who are enrolled in school are even less likely than third- or higher- 
generation girls enrolled in school to engage in early family formation, particularly non- 
marital birth. This is consistent with prior research and our interpretation of the minority 
group hypothesis. 

We also observe a negative interaction between school enrollment and marriage among 
the first-generation girls. However, the number of events (births or marriages), and cell 
sizes, particularly for interactions with first generation youth, are not large enough to give 
us confidence in the inferential power of the results of the interactions with generation 
status. The same holds for the interactions of generation status and race/ethnicity. 
Although we do not present predicted values (see below) based on the interactions with 
generation status, the interactions of race/ethnicity and enrollment status bolster the 
suggestion that engagement in school does differentially influence early family formation 
warranting further examination. 

Interpreting the pattern of family formation by race/ethnicity and enrollment status is 
simplified by examining the predicted probabilities found in Figure 2 for non-marital births and 
Figure 3 for marriage. The cumulative probabilities of either birth or marriage are clearly lower 
for boys than girls. Further, the patterns for boys are less differentiated by enrollment status 
or level of engagement in school, graphically demonstrating the smaller relationship between 
school enrollment and family formation for boys than girls. 

Another clear pattern found in Figure 2 is the lower probability of non-marital births among 
Asian-origin and white youth than Mexican-origin or black youth, particularly for girls. We 
observe a higher predicted probability of non-marital births for girls who are not enrolled in 
school than for those enrolled in school, regardless of level of school engagement, for all 
race/ethnic groups. We also note that non-marital births are most likely as a first family 
formation event among Mexican-origin and black girls who are not enrolled in school followed 
by white girls. Asian-origin girls have the lowest probability of early family parenthood: Asian- 
origin girls not in school are even less likely to have a birth than Mexican origin or black girls 
who are in school. There is also some variation by level of school engagement. The lowest 
probabilities of non-marital births occur among Asian-origin girls who are highly engaged in 
school. 

Among boys, there is much less difference in the probability of becoming a parent 
between those enrolled vs. not enrolled in school, and the ethnic patterns are somewhat 
different, particularly among those not enrolled in school. Those of Mexican-origin have 
somewhat lower probabilities of becoming a father than blacks. Asian-origin boys are 
indistinguishable from black boys when examining the results for those not enrolled in school. 
Likewise, the lines for Asian and white boys who are highly engaged in school are also 
indistinguishable from one another. As a result, gender differences are greatest among those 
of Mexican origin, with high levels for girls but not boys. 

The predicted probabilities of marriage for girls and boys (Figure 3) also demonstrate 
differences across racial/ethnic groups. Here we observe that Asian origin and white girls who 
are not enrolled in school are more likely to get married than their co-ethnic peers who are 
enrolled in school. However, the significant interaction with school engagement suggests a 
greater likelihood of marriage among Mexican origin girls enrolled in school with very little 
difference between those who are enrolled but exhibit lower or higher school engagement. 
Thus the relationship between school enrollment and marriage operates differently among 
Mexican origin girls than the other groups. Black girls and boys are the least likely to marry. . 
For boys, the differences by level of school engagement are less pronounced but we still 
observe a greater probability of marriage for those who are not in school among those of 
Asian origin, Mexican origin and whites. 


Educational Engagement and Early Family Formation + 1409 


Figure 2: Predicted Probabilities of Nonmarital Births, 1988-1994 
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Figure 3: Predicted Probabilities of Marriage, 1988-1994 


Cumulative Predicted Probability of Marriage, Girls 









































336 
oO 
25 
.20 
a 
AQ 
05 + 
.00 J 
1988 1989 1990 1991 1992 1993 1994 
.--m--- Mexicans Low Engagement ——s— Mexicans High Engagement 
—s. - Mexicans Not Enrolled .--x--- Blacks Low Engagement 
——.— Blacks High Engagement — >« = Blacks Not Enrolled 
.--@--- Whites Low Engagement —»— Whites High Engagement 
—~ - Whites Not Enrolled ---©--- Asians Low Engagement 
—e— Asians High Engagement —o. - Asians Not Enrolled 
Cumulative Predicted Probability of Marriage, Boys 
.35 



































1988 1989 


1990 ° 


1991 


1992 1993 1994 





.--m--- Mexicans Low Engagement 
—s. - Mexicans Not Enrolled 
— + — Blacks High Engagement 
.--@--- Whites Low Engagement 
—~¢ - Whites Not Enrolled 
—»— Asians High Engagement 





——n— Mexicans High Engagement 
.----- Blacks Low Engagement 
—»« ~ Blacks Not Enrolled 
—e— Whites High Engagement 
---@--- Asians Low Engagement 
—vo. - Asians Not Enrolled 








Educational Engagement and Early Family Formation * 1411 


Discussion 


Our analysis sheds considerable light on ethnic differences in the relationship between 
education and family formation. We confirm that early family formation varies considerably 
across all four broad groups compared here. Differences by ethnicity in family formation are 
appreciable, even when family background, generation status and other characteristics are 
controlled. Activities enhancing socioeconomic attainment such as schooling are clearly 
associated with slower family formation transitions overall. But, the increasing ethnic diversity 
of the United States has broadened the path to family formation among adolescents and 
young adults. Some family roles appear to be more consistent with the student role for some 
groups (i.e., Mexican-origin girls and marriage) than others. 

Taking our hypotheses in turn, we find support for our hypothesis that higher parental 
educational expectations, superior previous school performance and school enrollment and 
engagement are negatively associated with early family formation. The predicted probabilities 
of early birth or marriage among those enrolled in school are appreciably lower than for those 
who are not enrolled. Additionally, those students who are prepared for class consistently in 
high school or who are enrolled fulltime in post-secondary institutions are less likely to make 
the transition to early family formation (marriage or non-marital birth) than those who are 
enrolled but are often unprepared for class in secondary school or are enrolled only parttime 
in post-secondary schools. This finding coupled with the importance of parental expectations 
and parent-child interactions as well as previous test scores suggests that respondents 
whose academic careers are supported and rewarded are also likely to have slower family 
formation, at least in their adolescent years. 

We also find that racial and ethnic differences in the timing of early family formation are 
somewhat reduced with the inclusion of schooling variables. Differences between Asian 
origin and non-Hispanic white girls are moderated, in particular. Future research with larger 
samples of Asians may be able to detect variation within this pan-ethnic group providing a 
greater understanding of which national origin groups are the least likely to engage in early 
family formation. We might also gain greater insight into the extent to which this relationship 
was apparent among boys as well as girls. With our data, few Asian boys engage in any 
family formation making it difficult to draw substantive conclusions for them. 

The variations in early family formation by schooling and by race are more consistent with 
those studies suggesting schooling presents less of an impediment to early family formation 
among some groups than others, the “blocked opportunities” approach. This is particularly 
the case for girls. For example, for Mexican-origin girls there is clearly a more tenuous 
connection between school and delay of family formation than for other girls. The persistence 
of significant racial/ethnic differences in early family formation, particularly for blacks and 
Mexicans, however, implies that the “blocked opportunities” approach (i.e., that minority 
youth become disengaged from school and, in turn, increase their likelihood of early family 
formation) is at best a partial explanation. Further, if we had a larger sample of Asian-origin 
youth our results suggest that this group, at least, may conform instead to the “minority group 
status” hypothesis such that school enrollment is more strongly associated with delays in 
family formation than for the other groups we can compare here. 

We also set out to evaluate how the legacy of immigration might be manifest through 
variation in generation status. We observe that even in an analysis that controls for a host of 
personal and family characteristics in addition to ethnicity, variation across generation — an 
obvious legacy of recent immigration — can still be detected. The “minority group status” 
hypothesis, that pursuit of higher socioeconomic attainment (through schooling) results in 
slower family formation, receives partial support here for our analyses by generation status 
when examining adolescent girls. Here, the second generation (along with the 1.5 generation 
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among girls) has the slowest transitions to family formation of the groups we examine. 
Further, the sharp cross-generation differences in the likelihood of having a birth are reduced 
when measures of school involvement are introduced. Thus, our results demonstrate that the 
lower likelihood of early non-marital childbearing in particular among 1.5- and second- 
generation girls (vs. 3rd and higher generation girls) is partially explained by their greater 
participation in school throughout early and late adolescence. Boys’ early family formation 
also varies by generation status but in this case it is the first generation's lower likelihood of 
non-marital childbearing that persists. Thus, immigrant boys appear less likely to take this non- 
normative path to family formation than their third and higher generation counterparts even 
when school engagement and background traits are taken into account. Perhaps the factors 
that influence family formation timing are not as closely linked to the factors that influence 
schooling for immigrant boys as they are for girls. 

This analysis only observes very early family formation. This is a crucial window, however, 
because such transitions during adolescence have been characterized as an off time event. 
Young people in their late teens and early 20s in the United States are expected to be in 
school, and the role of student is viewed as inconsistent with the role of spouse or parent. 
Racial/ethnic patterns might be different among those who obtain more education and wait 
longer to form families. Nevertheless, among these groups, it is clear that no single 
hypothesis or unitary theory is sufficient to capture the range of patterns visible among 
immigrant groups and their descendents in the United States. 


References 


Abrahamse, Allen F, Peter A. Morrison and Linda J. Waite. 1988. “Teenagers Willing to Consider Single 
Parenthood: Who is at Greatest Risk?” Family Planning Perspectives 20:13-18. 


Astone, Nan Marie, and Dawn M. Upchurch. 1994. “Forming a Family, Leaving School Early, and Earning a 
GED: A Racial and Cohort Comparison.” Journal of Marriage and Family 56:759-771. 


Beutel, Ann M. 2000. “The Relationship between Adolescent Nonmarital Childbearing and Educational 
Expectations: A Cohort and Period Comparison.” 7he Sociological Quarterly 41:297-314. 


Brewster, Karin L. 1994. “Race Differences in Sexual Activity among Adolescent Women: The Role of 
Neighborhood Characteristics.” American Sociological Review 59: 408-424. 


Cherlin, Andrew J., Kathleen E. Kiernan and P Lindsay Chase-Lansdale. 1995. “Parental Divorce in Childhood 
and Demographic Outcomes in Young Adulthood.” Demography 32:299-318. 


East, Patricia L. 1998. “Racial and Ethnic Differences in Girls’ Sexual, Marital, and Birth Expectations.” Journa/ 
of Marriage and the Family 60:150-162. 


Espenshade, Thomas J., and Wenzhen Ye. 1994. "Differential Fertility within an Ethnic-minority: The Effect 
of Trying Harder among Chinese-American Women.” Socia/ Problems 41:97-113. 


Forste, Renata, and Marta Tienda. 1992. “Race and Ethnic Variation in the Schooling Consequences of 
Female Adolescent Sexual Activity." Socia/ Science Quarterly 73:12-30. 


. 1996. “What's behind Racial and Ethnic Fertility Differentials?” Population and Development Review 
22:109-133 Suppl. 


Geronimus, Arline T., and Sanders Korenman. 1992. “The Socioeconomic Consequences of Teen 
Childbearing Reconsidered.” Quarterly Journal of Economics 107:1187-1214. 


Glick, Jennifer E., and Michael White. 2004. “Post-Secondary School Participation of Immigrant and Native 
Youth: The Role of Familial Resources and Educational Expectations." Socia/ Science Research 
33:272-299. 


Educational Engagement and Early Family Formation * 1413 


Goldenberg, Claude, Ronald Gallimore and Leslie Reese. 2001. “Cause or Effect? A Longitudinal Study of 
Immigrant Latino Parents’ Aspirations and Expectations and their Children’s School Performance.” 
American Educational Research Journal, 38: 547-82. 


Goldscheider, Frances K., and Linda J. Waite. 1986. “Sex Differences in the Entry into Marriage.” American 
Journal of Sociology 92: 91-109. 


Goldscheider, Calvin, and Peter Uhlenberg R. 1969. “Minority Group Status and Fertility." American Journal 
of Sociology 74:361-372. 


Goldstein, Alice, Michael J. White and Sidney Goldstein. 1997. “Migration, Fertility, and State Policy in Hubei 
Province, China.” Demography 34:481-491. 


Hanson, Sandra L., Donna Ruane Morrison and Alan Ginsburg L. 1989. “The Antecedents of Teenage 
Fatherhood.” Demography 26:579-596. 


Hao, Lingxin, and Melissa Bonstead-Bruns. 1998. “Parent-Child Differences in Educational Expectations and 
the Academic Achievement of Immigrant and Native Students.” Sociology of Education 71:175-198. 


Haveman, Robert, Barbara Wolfe and James Spaulding. 1991. “Childhood Events and Circumstances 
Influencing High School Completion.” Demography 28:133-157. 


Hockaday, Cathy, Sedahlia Jasper Crase, Mark C. Shelley Il and Dahlia F Stockdale. 2000. A Prospective 
Study of Adolescent Pregnancy.” Journal of Adolescence 23:423-438. 


Hoffman, Saul D., E. Michael Foster and Frank F Furstenberg Jr. 1993. “Reevaluating the Costs of Teenage 
Childbearing.” Demography 30: 1-13. 


Hogan, Dennis P, and Nan Marie Astone. 1986. “The Transition to Adulthood.” Annual Review of Sociology 
12:109-130. 


Hogan, Dennis P, and Evelyn M. Kitagawa. 1985. “The Impact of Social Status, Family Structure, and 
Neighborhood on the Fertility of Black Adolescents.” American Journal of Sociology 90:825-855. 


Hossler, Don, and Frances K. Stage. 1992. “Family and High School Experience Influences on the 
Postsecondary Educational Plans of 9" Grade Students.” American Educational Research Journal 
29:425-451. 


Jencks, Christopher, James Crouse and Peter Meuser. 1983. “The Wisconsin Model of Status Attainment: 
A National Replication with Improved Measures of Ability and Aspiration.” Sociology of Education 
56:3-19. 


Kalmijn, Matthijs. 1994. “Mothers Occupational Status and Children’s Schooling.” American Sociological 
Review 59:257-275. 


Kao, Grace. 2000. “Group Images and Possible Selves among Adolescents: Linking Stereotypes to 
Expectations by Race and Ethnicity.” Sociological Forum 15: 407-430. 


Lareau, Annette. 1989. Home Advantage: Social Class and Parental Intervention in Elementary School. The 
Falmer Press 


Manlove, Jennifer. 1998. “The Influence of High School Dropout and School Disengagement on the Risk of 
School-age Pregnancy.” Journal of Research on Adolescence 8: 187-220. 


Manlove, Jennifer, Elizabeth Terry, Laura Gitelson, Angela Romano Papillo and Stephen Russell. 2000. 
“Explaining Demographic Trends in Teenage Fertility, 1980-1995.” Family Planning Perspectives 
32:166-190. 


Mare, Robert D., and Christopher Winship. 1991. “Socioeconomic Change and the Decline of Marriage for 
Blacks and WhitesPp.175-202. C. Jencks and PE. Peterson, editors. The Urban Underclass. The 


Brookings Institute. 


1414 « Social Forces Volume 84, Number 3 * March 2006 


Marini, Margaret Mooney. 1984. “Women's Educational Attainment and the Timing of Entry into 
Parenthood.” American Sociological Review 49:491-511. 


Michael, Robert T., and Nancy Brandon Tuma. 1985. “Entry into Marriage and Parenthood by Young Men and 
Women: The Influence of Family Background.” Demography 22:515-544. 


Moore, Kristin A., David E. Myers, Donna Ruane Morrison, Christine Winquist Nord, Brett Brown and Barry 
Edmonston. 1993. "Age at First Childbirth and Later Poverty.” Journal of Research on Adolescence 
34: 393-422. ‘ 


Moore, Kristin A., Jennifer Manlove, Dana A. Glei and Donna Morrison. 1998. “Nonmarital School-Age 
Motherhood: Family, Individual and School Characteristics.” Journal of Adolescent Research 13: 
433-457. 


Olsen, Randall J., and George Farkas. 1989. “Endogenous Covariates in Duration Models and the Effect of 
Adolescent Childbirth on Schooling.” Journal of Human Resources 24: 39-53. 


Portes, Alejandro, and Dag MacLeod. 1996. “Educational Progress of Children of Immigrants: The Roles of 
Class, Ethnicity and School Context.” Sociology of Education 69:255-275. 


Portes, Alejandro, and Rubén G. Rumbaut. 2001. Legacies: The Story of the Immigrant Second Generation. 
University of California Press. 


Rendall, Michael S., Lynda Clarke, Elizabeth Peters, Nalini Ranjit and Georgia Verropoulou. 1999. “Incomplete 
Reporting of Men's Fertility in the United States and Britain: A Research Note.” Demography 
36:135-144. 


Ribar, David C. 1994. “Teenage Fertility and High School Completion.” 7he Review of Economics and 
Statistics 76:413-424. 


Rindfuss, Ronald R. 1991. “The Young Adult Years: Diversity, Structural Change and Fertility.” Demography 
28:493-512. 


South, Scott J. 2001. “The Variable Effects of Family Background on the Timing of First Marriage in the 
United States, 1969-1993.” Social Science Research 30:606-626. 


Stanton-Salazar, Ricardo D. 1997. “A Social Capital Framework for Understanding the Socialization of Racial 
Minority Children and Youths.” Harvard Educational Review, 67:1-40. 


Stier, Haya, and Marta Tienda. 1997. “Spouses or Babies? Race, Poverty and Pathways to Family Formation 
in Urban America.” Ethnic and Racial Studies 20:91-122. 


Thornton, Arland, William G. Axinn and Jay D. Teachman. 1995. “The Influence of School Enrollment and 
Accumulation of Cohabitation and Marriage in Early Adulthood." American Sociological Review 
60:762-774. 


Upchurch, Dawn M., James M. McCarthy and Linda Ferguson. 1993. “Childbearing and Schooling: 
Disentangling Temporal and Causal Mechanisms.” American Sociological Review 58:738-740. 


Upchurch, Dawn M., Carol S. Aneshensel, Jyoti Mudgal and Clea Sucoff McNeely. 2001. “Sociocultural 
Contexts of Time to First Sex Among Hispanic Adolescents.” Journal of Marriage and Family 
63:1158-1169. 


White, Michael J., Lorenzo Moreno and Shenyang Guo. 1995. “The Interrelationship of Fertility and 
Geographic Mobility in Peru: A Hazards Model Analysis." /nternational Migration Review 29:492- 
514. 


Wu, Lawrence L., and Elizabeth Thomson. 2001. “Race Differences in Family Experience and Early Sexual 


Initiation: Dynamic Models of Family Structure and Family Change.” Journal of Marriage and Family 
63:682-696. 


Educational Engagement and Early Family Formation + 1415 


Xie, Yu, James M. Raymo, Kimberly Goyette and Arland Thornton. 2003. “Economic Potential and Entry into 
Marriage and Cohabitation.” Demography 40:351-67. 


Zhou, Min, and Carl L. Bankston. 1994. “Social Capital and the Adaptation of the Second Generation: The 
Case of Vietnamese Youth in New Orleans." /nternational Migration Review 28:821-45. 


ae tak Oe re A ants el et © arene en) 
7 


ihe Pare, bet E % vernonia a 
Corns a ebaate per 
, 


4 


Mie rik GY Ge Ueveeee in, Ace: a heranews fe teary 











tan asco tinamt ata “ a 


an) 


. t atic as ee 


arya s SSF Mart ae Bir pess ee ’ ate et OS ee Phil a 


‘ aan one 


A 


ere ee ek gh ren = eA LAN i : ns 
Bote ed CONTR nn eRe “aa ree eee 


~ 7 ; r ra 
aw oe » ; 4 om ce > ae ome ; " $0 a 

ae 4 ats 1% yes =e 

Bins = YF 









> 
es 
i 


“einniihas Wribel Ae Site el Pitaeecten). Uatiyh We Sh ly ed ae sili s 
ih pare Bd wey EMEA os. a eel a hang CO Team 
i i 5 As 

Ths + S os 


~ = 


ge ae 


hee pos! i Wa Gare: a eta ee ee 
a ig” Pe s Mine = te S84: a° ey 
























ape nas) | web We tte Petre eae dooce Sei 
o* 2 fs ie eH vm” > nae iit Ge FIRE 


; 7 
Orn ~~ 6 1 inv, Ariel ed MATE Arey ¢? > dba shared 
» , “i+ ) ’ 
ee ee «< Viegatierge, (aps Loot ale euler gaa ae 
baptist ae + * ger? Jat eu ig Ae ee ; 
- = -~?, = Shy » 
~. [ 7 : a 
we Oia © Re “hajoencl waits rar cy id BS 
e- ..~ be 7 ‘ " 7 


~*~ 
a ; : \ vy sche 


SN St I i hi ee a Dyes pl: <i i a 


inteey On) OQ OY Wee Rie ey ee 


2 * * 


sae eae Ser oes 


nag On ia aoe 


the irees sty: lh, Se doers Satie! thie nares 
NidAcnen [eet «hoa Sue, “eb SB 


> 2h ter, FAG nar engag tae sity 
eerie 4 v iaires nmeoan ‘ay Aan 


Pe Bat 
athait, Wage) sedi ode Stearn Lee 

EN ene 

t ‘7 eh ; 





‘ ab 0 





Spanish Maintenance among English-Speaking Latino Youth: 
The Role of Individual and Social Characteristics 
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Abstract 

This paper investigates the effects of individual, family, social and demographic 
characteristics on the maintenance of Spanish among English-speaking Latino youth. This 
research finds effects of generation, gender, race, parent’s English proficiency, single-parent 
status, parental income, and neighborhood concentration of co-ethnics as well as combined 
effects of race and gender on Spanish oral proficiency. The findings presented here suggest 
support for elements of the assimilation and the segmented assimilation theoretical 
perspectives as well as the race-gender experience theory. The author suggests that Spanish- 
speaking proficiency may be associated with opportunities to speak Spanish that are 
structured differently, not only by family and neighborhood contexts that allow for greater 
or lesser contact with Spanish, but also by gender and race. 


Language preservation and loss are at the core of debates about contemporary American 
national and ethnic identities. Within the context of the United States, the factors driving 
English-language acquisition have been more readily researched and understood than those 
driving the maintenance of minority languages. However, while theoretical traditions and 
research on language incorporation of immigrants have often focused on language shift — that 
is, a shift from one language to another over generations (Fishman 1972, Lopez 1978, Stevens 
1985) — contemporary research has included discussions of the maintenance of an ethnic 
mother tongue /n addition to English (Alba 1999, Lopez 1999, Portes and Rumbaut 2001). 
Little is known about the individual, social and demographic characteristics that affect 
maintenance and proficiency in an ethnic mother tongue. As such, this article investigates the 
maintenance of Spanish among English-speaking Latino youth and finds that generational, 
gender, race, family and community contexts are key in predicting oral proficiency in Spanish.’ 

Research on contemporary immigrant groups suggests a rapid shift to English as a usual 
or home language (Alba et al. 2002, Veltman 1988), but also a trend toward bilingualism, 
particularly among Latinos (Alba 1999, Lopez 1999, Portes and Schauffler 1994, Rumbaut 
1996). The use of Spanish at home does not necessarily imply a lack of English skills, and 
Rumbaut (1999) notes that all but the poorest and least educated migrants have some English 
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proficiency before arrival in the United States. Further, Stevens (1992) argues, “A high level of 
proficiency in English need not stop native-born Americans from also using a non-English 
language.” (p. 174) Recent research indicates the possibility for sustained bilingualism to be 
higher for Latinos than for other high immigration groups such as Asians (Alba et al., 2002; 
Lopez 1999; Portes and Schauffler 1994; Rumbaut 1996). According to the 2002 National 
Survey of Latinos, 24 percent of first-generation, 47 percent of second-generation, and 22 
percent of third-and-later generation Latinos are bilingual (Pew Hispanic Center 2002). These 
trends are indicative of what Alba (1999) suggests are simultaneous processes of assimilation 
and multiculturalism with respect to language. 

Research that investigates Spanish maintenance among those who speak English well is key 
in understanding bilingual proficiency as Portes and Rumbaut (2001) note that “a closer look at 
the components of bilingualism shows that the prime reason for its relative absence is not lack 
of English fluency but the loss of parental languages.” (p. 127). Previous research suggests that 
language preferences and use vary across generations in association with a variety of individual, 
social and demographic characteristics (Lieberson and Curry 1971, Stevens 1985, 1986, 1992). 
Demographic, social and family contexts are particularly important as “indicators of the general 
availability of other people with whom the non-English language can be spoken.” (Stevens 1992, 
p.174). Prior research on linguistic shift and maintenance has utilized home language (Alba et 
al. 2002), language preference (Portes and Schauffler 1994), or usual language use (Stevens 
1992) as the strategic site of analysis. As noted by Alba et al. (2002), these approaches do not 
address the extent to which descendents of immigrants are “speaking a mother tongue 
proficiently.” (1p. 480) (See also Espiritu and Wolf 2001, Portes and Rumbaut 2001.).ii Research on 
Spanish proficiency over generations investigates a somewhat different concept than home 
language and language preferences. This paper offers a unique counterpart to previous research 
by investigating the individual, demographic and social characteristics associated with 
differential levels of Spanish oral proficiency among those who have already begun the process 
of acculturation through the acquisition of high-level English proficiency. 


Theoretical Perspectives 


The linguistic accommodation of immigrants and their successors is addressed by 
sociological theories of immigrant incorporation. The different theoretical perspectives 
emphasize effects of various individual, social and demographic characteristics. The 
assimilation model suggests that immigrants, upon arrival in the United States, begin the 
process of language shift that will transform their successors into English monolinguals 
within a few generations (Fishman 1972, Gordon 1964, Nahirny and Fishman 1996). This 
perspective assumes the inevitable use of the majority language by the minority group as part 
of acculturation and implies that over generations linguistic assimilation will be complete, with 
English becoming the native language for subsequent generations (Fishman 1972; Nahirny 
and Fishman 1996). As suggested by Fishman’s four-stage model of language shift, 
assimilation theory presumes the gradual decline in the maintenance of the non-English 
language with a complete loss of it by the third generation. 

Early work on language shift among Latinos (Lopez 1978) finds generational declines in 
their mother tongue to be similar to those of earlier European groups. However, a sizeable 
literature has amassed questioning whether contemporary immigrants, particularly those 
from Spanish-speaking countries, will follow a similar path of incorporation into U.S. society 
as European migrants of the past (Fernandez-Kelly 1998, Massey 1995, Portes 1995b, Portes 
and Zhou 1993, Portes and Rumbaut 2001, Rumbaut 1996, Sassen 1990, Zhou 1997, Zhou 
and Bankston 1998). 
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A number of contemporary scholars have argued that assimilation is segmented for new 
immigrant groups. According to segmented-assimilation theory, language outcomes and their 
effects vary according to the class and ethnic incorporation of immigrant children and their 
families into American society (Fernandez-Kelly 1998, Portes 1995a, Portes and Zhou 1998; 
Portes and Rumbaut 2001, Rumbaut 1996, Rumbaut and Portes 2001, Zhou 1997, Zhou and 
Bankston 1998). Segmented-assimilation theory suggests that there are three different routes 
of incorporation available to children of immigrants; assimilation into the middle class, 
assimilation into an “underclass,” and preservation of an ethnic identity within the immigrant 
community (Portes 1995, Rumbaut 1995). The theory suggests that for non-white youth 
assimilation can restrict social mobility; therefore immigrant parents who live in poor urban 
neighborhoods may emphasize the home culture, including language, in an effort to insulate 
their children from downward assimilation (Portes 1995, Zhou and Bankston 1998). 

While segmented assimilation theory posits variation in assimilation patterns across ethnic 
communities, residence, race and social class, it doesn't discuss potential gender variation 
in assimilation patterns or variations associated with intersections between gender and race. 
As such, Lopez (2003b) proposes an alternate theoretical model, the race-gender experience 
theory that combines elements of segmented assimilation theory and racial formation theory 
(Omi and Winant 1994). The race-gender experience theory argues that “race and gender are 
intersecting, socially constructed processes that are historically variable, overlapping, 
intertwined and inseparable.” (Lopez 2003a p.175; Omi and Winant 1994) The race-gender 
experience theory (although it does not address language specifically) suggests that 
individuals’ experiences may vary across intersecting gender and racial boundaries. Variation 
in language experiences may create differential Spanish maintenance and proficiencies based 
on the combined impact of gender and race. 


Data and Method 


The data used in this research are from the restricted-use version of the National Education 
Longitudinal Study (NELS:88). To date, NELS:88 is the only nationally-representative dataset 
of youth that includes rich data on home language use and multiple indicators of proficiency 
in English and Spanish for investigating the effects of social and demographic variables on 
dual language proficiencies. NELS:88 was administered in 1988 to eighth graders as well as 
their parents, teachers and principals; the dataset provides individual, family and school-level 
data. The base year contains 24,599 students. Hispanic students were oversampled. 

A sample of English-speaking Hispanic children was selected from the eighth-grade data to 
investigate the maintenance and loss of Spanish-speaking proficiency among those who speak 
English well.? These data are linked to 1990 U.S. Census data at the zip code level to provide 
contextual data on the neighborhoods in which the children live. The ability to measure 
neighborhood effects on language makes NELS:88 particularly useful for testing theories of 
immigrant incorporation, as neighborhood effects are explicit within the theoretical perspectives. 
This raises the question of how neighborhoods are measured empirically. (See also Ainsworth 
2002.) A limitation of NELS:88 data is that zip code-level, rather than census tractlevel data, must 
be used to operationalize neighborhoods. While many researchers believe that census tracts are 
better measures of neighborhoods than zip code areas (White 1987), there are some advantages 
to using zip codes. South and Baumer (2000) note that zip codes have an advantage over census 
tracts in that, unlike census tract data, zip codes are available for people throughout the entire 
U.S., not just those who live in metropolitan areas. Brooks-Gunn et al. (1993) find stronger 
community effects on dropout rates using zip codes as opposed to census tracts, and Billy and 
Moore (1992) find similar effects on childbearing comparing counties and census tracts. 
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Sampson (1998) has argued that the ecological stratification of communities is “a robust 
phenomenon that emerges at multiple levels of geography, whether local community areas, 
census tracts, or other ‘neighborhood’ units’” and that “empirical results have not varied much 
with the operational unit of analysis.” The inclusion of contextual data resulted in a 5 percent 
reduction’in the sample size because some cases could not be matched to census zip code data. 

Another question that may arise particularly with regard to neighborhood context, but also 
with regard to school and religious context, is that of reciprocal effects. One might question the 
causal relationship between neighborhoods and language proficiencies. Spanish-speakers may 
choose to live in urban neighborhoods with greater numbers of coethnics because they prefer 
to live with others who speak the same language. However, because this is a sample of children, 
who have little power to choose the neighborhood in which they live, it is more appropriate in 
this case to look at the impact of neighborhoods on Spanish maintenance than vice-versa. A 
similar logic with regard to causality relates also to school type and religious affiliation. 

While NELS:88 is arguably the best source of data to date for investigating social and 
demographic factors related to Spanish language proficiencies among Latino youth in the 
United States, the data have some additional limitations. One limitation of these data is that the 
data do not include a relatively small number of children whose English skills were too limited to 
fill out the questionnaire in English. Although Spanish-speaking students with limited English 
skills remain in the NELS:88 data, their numbers are too small to use in the analysis. Thus only 
processes of language maintenance and shift among children who could read in English well 
enough to complete the questionnaire can be investigated. Additionally, because the NELS:88 
survey was initiated with eighth-grade children, the processes of language acquisition and 
maintenance in English and Spanish can be assessed only to a very limited degree in this paper. 
Ideally, such information could be garnered from surveys of parents of preschool-aged children 
along with various types of assessments of children’s linguistic skills. Also, because children 
were not asked language proficiency questions in the same way as in the NELS follow-up 
surveys, we can only assess children’s Spanish language proficiency in these eighth grade base- 
year data, and thus we cannot measure what happens to language proficiencies in Spanish over 
time, nor can we compare their language proficiencies with those of youth who migrate at older 
ages (such as during the high school years). A further limitation is that there are a number of 
cases (nearly 6 percent) where both parents’ and children’s birthplace data is missing, making 
it impossible to determine the generation status of these children. 


Variables 


Table 1 presents the descriptive statistics for all of the variables used in this analysis. 


Dependent Variable 


The dependent variable of interest is a three category ordinal categorical variable measuring 
the child's self-reported speaking proficiency in Spanish (well, not very well, and not at all). 
The dependent variable was created by using three questions regarding language in the 
NELS:88 student survey. Students were asked a screening question, “Is any language other 
than English spoken in your home?” If they answered “yes” to the screening question, 
students were then asked subsequent follow-up questions about language proficiency. With 
regard to Spanish proficiency, the following question (among others) was asked, “With regard 
to that language... How well do you speak that language?’ Only those students who 
indicated that Spanish and/or English was used in the home were included in the analysis. 
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Independent Variables 


Ethnicity 

Variables for ethnicity are represented by a series of dichotomous variables. Ethnicity is measured 
by the student's response to the question, “Which of these best categorizes your background?" 
There are four possible responses including (1) Mexican, Mexican-American or Chicano, (2) Cuban, 
(3) Puerto Rican or (4) Other Hispanic. Mexican is the omitted category. The segmented assimilation 
perspective hypothesizes an interaction effect on language outcomes between ethnic group and 
family socioeconomic status because language outcomes may operate differently as a function of 
the way ethnic communities have established themselves differently in the United States. 


Racial Identity 

Variables for racial identity are represented by a series of dichotomous variables. Race is 
measured by the student's response to the question, “What is your race?” The three racial 
identity variables include black, white and other. White is the omitted category. Segmented 
assimilation theory hypothesizes race as a determinant of differential assimilation patterns for 
whites and non-whites. While this has been hypothesized, the effect of race on language 
maintenance within Hispanic or Latino pan-ethnic categories has been largely unexamined 
in empirical sociological research. 


Gender 

Gender is a dichotomous variable measured by the student's self-report of sex, either male or 
female (male = 1/female = 0). Although gender is not addressed specifically by theories of 
immigrant adaptation, research finds women to be more likely to maintain a minority 
language (Stevens 1986, Veltman 1981) suggesting that children’s linguistic adaptation could 
be “segmented” by gender. 


Gender and Racial Identity 

Variables for gender and race identity are represented by a series of dichotomous variables 
reflecting a combination of the racial identity and gender variables. The categories include 
black female, black male, white female, white male, other female and other male. White male 
is the omitted category. The race-gender experience theory hypothesizes that youth's 
experiences are both racialized and gendered in ways that may create different language 
proficiencies across the race/gender categories. 


Generation’ 

Generation status is measured utilizing the base year questionnaire items on the student and 
parent birthplaces. (See also Oropesa and Landale 1997 for a similar identification of immigrant 
generation.) Generation is constructed through the use of four dummy variables: first 
generation, second generation, third and later generations, and unknown generation (where first 
generation is the omitted category). First generation includes those children who are born 
outside of the United States. Second generation includes U.S.-born children with at least one 
foreign-born parent. The third and later generations category is made up of U.S.-born children of 
second-generation parents, or children of other native-born parents. Island-born Puerto Ricans 
are treated as foreign-born in the construction of generation status. A variable for “unknown 
generation’ is included for the 6 percent of the sample missing birthplace data that would 
establish generation status. While there are some indications that those who are missing data 
for generation status are a group comprised mostly of first- and second-generation children, the 
evidence is not compelling enough to warrant an imputation of data on this variable. The 
assimilation perspective anticipates that over generations, proficiency in Spanish will decline. 
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Religion 

Religion is represented by a dichotomous variable indicating whether the student is Catholic 
or belongs to another religion: Catholic = 1/non-Catholic = 0. Being religiously Catholic is 
often an indicator of retention of ethnic cultural ties, and perhaps to religious services in 
Spanish (Alba 1988, Byun 1990). Therefore, Catholic religious identification is expected to be 
associated with higher levels of Spanish maintenance. 


Single Parent Status 

Parental marital status is a dichotomous variable which indicates if the child's parents are not 
married or married (or in a marriage-like relationship): Not married = 1/Married = 0. The 
relationship of marital status to maintained proficiency in Spanish is unclear. Stevens (1992) 
has found that for adults “being married is strongly associated with speaking English only... 
unless the individual’s spouse can speak the same minority language.” (p. 181) The question 
here, however, is the relationship between parental marital status and the transference of 
language skills to the second generation. It could be argued that single parents lack sufficient 
time and the support of another adult to pass on high-level language skills. Other researchers 
have reported a lack of mother tongue transference in cases of intermarriage (Alba et al. 2002, 
Grenier 1984); the effects of single parenthood may be similar. 


Number of Siblings 

This is measured by the student's or parent's base year report of the number of brothers and 
sisters the student has including any stepbrothers and/or stepsisters who live in the same 
home. Values range from “0” to “6 and above.” It is anticipated that families where there are 
larger numbers of siblings with whom to converse in an ethnic mother tongue, Spanish 
proficiency will be greater. 


Parent’s English Proficiency 

This is measured by a series of dummy variables: English only, well, not well and none, with 
English only as the reference category. The variables were constructed from the focal parent's 
base year self report of English speaking ability. It is anticipated that parent's English 
proficiency will be negatively associated with children’s Spanish proficiency. 


Social Class 

There are two measures of social class: yearly income and parental education. Yearly family 
income is measured in thousands of dollars using the parents’ report of the yearly family 
income in the parental interview in 1988. Because income was reported within a range of 
income categories in the survey, yearly family income is set to the midpoint for each category. 
Mean parental education refers to the combined level of education completed by the parents 
or guardians in years. When two parents are present in the household, the average of the 
parents’ education is used. When only one parent is present, that parent's education level is 
used. 

The assimilation perspective hypothesizes that the higher the social class of one’s parents 
the more likely the parents and children are integrated into the culture of the host society, and 
therefore the greater the likelihood that the children will speak only English. Social class also 
plays a role in the segmented assimilation perspective. Portes and Rumbaut (1996) contend 
that fluent bilingualism in not within the reach of all immigrants, but is a function of social 
class. They suggest that immigrant parents with high socioeconomic status have the 
resources available to be able to maintain high levels of English proficiency while resisting the 
societal pressure to shift to English completely. 
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School Type 

A series of dichotomous variables are included to indicate whether the eighth grader attends 
a public, Catholic or private school. Public school is the omitted category. The presence of 
Cuban bilingual private schools suggests that private school attendance would be associated 
with higher proficiency in Spanish, particularly Spanish literacy, compared to public schools. 
The effect of Catholic school attendance is difficult to hypothesize. While Catholic religious 
affiliation may be associated with ethnic cultural ties, Catholic schools often lack bilingual 
education programs present in public schools because of insufficient funds or interest. 


Neighborhood Context 
Three demographic indicators of neighborhood context are utilized: average household 
income, urbanicity and percent of residents who are Hispanic. To utilize neighborhood level 
variables, the NELS:88 data are linked to 1990 U.S. Census data through the use of zip code 
identifiers allowing the measurement of the average household income in the neighborhood, 
urbanicity (percent of the zip code area defined by the Census as urban), and the percent of 
residents who are Hispanic. It is hypothesized-that living in a neighborhood with a large 
density of coethnics is associated with retaining Spanish, and, conversely, living in a 
neighborhood made up primarily of non-coethnics is associated with English monolingualism. 
In order to provide some background on the processes by which English and Spanish 
proficiency are gained and maintained, as well as the domains in which they are used, 
descriptive data is presented using the entirety of the Hispanic sample in NELS:88 (N = 
3123), including those who have a high level of proficiency in English as well as those who do 
not. The analysis then focuses on the production of bilingualism by examining the individual, 
social and demographic characteristics associated with the maintenance of high-level oral 
proficiency in Spanish among English-speaking Hispanic youth. Ordered logit models were 
estimated as the dependent variable is based on ordinal-level data, indicating that the intervals 
between the response categories (well, not very well, and not at all) are not necessarily equal. 
The ordered models were estimated in STATA using the svy ologit procedure (Stata Corp 
2003).vi In this case the models estimate the log odds of being in the higher level Spanish- 
speaking proficiency categories relative to the lowest category (not at all). 


Findings 
First the findings related to the entirety of the Hispanic sample (N = 3123) are presented to give 
some background on the processes of language maintenance and acquisition. The NELS:88 
data indicate that most use English as their usual language, but also have a great deal of 
exposure to a second language at home, usually Spanish. It appears that most youth are 
learning Spanish first at home (perhaps alongside English) and shifting to greater use of English 
as they enter and progress through the school system. Households with foreign-born parents 
are much more likely to be places where languages other than English are spoken. Almost 92 
percent of the first generation children and 90 percent of second-generation Hispanic children 
lived in households where this was the case. This figure drops to nearly 58 percent in the 
households of third-and-later generation children. Only about 8 percent of all Hispanic children 
have participated in regular coursework in Spanish, and this participation is largely limited to 
those who lacked skills in English. Therefore it is likely that most children’s proficiency in 
Spanish (particularly in the case of children who speak English well) is gained at home and/or 
in other non-school settings such as neighborhoods with large Hispanic populations. 
Linguistic assimilation is common in the sense that children are rapidly acquiring and using 
English as their usual language. The shift from Spanish to English as a usual language 
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appears to occur as children progress through the school system. More than 61 percent 
reported that they spoke a language other than English before starting school. The vast 
majority, more than 87 percent, responded that by the eighth grade they usually spoke in 
English, while slightly more than 12 percent usually spoke in Spanish, and less than 1 percent 
usually spoke another language. With large numbers of children shifting from Spanish to 
English as their usual language, the question of the individual, social and demographic 
characteristics associated with Spanish maintenance and proficiency is compelling. 

Many Hispanic youth who speak English well also speak Spanish; a little more than 50 
percent speak Spanish well; nearly 10 percent speak some Spanish, but do not speak well; 
and almost 40 percent speak no Spanish. Table 2 presents the ordered logit models 
predicting oral proficiency in Spanish. The likelihood of speaking Spanish well is significantly 
higher for the first and second generations compared to the third and later generations. 
Overall, the ordered logit model indicates that the effects of generation, parental English 
skills, private school attendance and race loom the largest. 

Of the individual-level factors, generation and race are particularly important in predicting 
Spanish-speaking proficiency. With respect to generation, there are significant differences in 
Spanish language proficiency between children of U.S.-born parents and children of foreign-born 
parents. The third and later generations (the U.S.-born children of U.S-born parents) are 
significantly less likely to have high levels of Spanish-speaking proficiency compared to first- 
generation children, while the second-generation children (the U.S.-born children of foreign-born 
parents) are not significantly different from their first-generation peers. In fact, third-and-later- 
generation children are more than 3.5 times less likely (Model 1, 1/ e ‘*' = 3.636) to speak 
Spanish well compared to their first-generation peers. There do not appear to be significant 
ethnic differences in the likelihood of maintaining higher Spanish speaking proficiency. However, 
there are differences across race and gender. Compared to Latino students who identify as 
white, those who identify as racially black are significantly less likely to have a high level of 
Spanish-speaking proficiency, while those who identify as racially “other” are not significantly 
different from those who identify as white. Females are more likely to have greater Spanish- 
speaking proficiency than males. Consistent with the race-gender experience theory, the 
combined effect of race and gender (Model 2) indicates that females who identify as racially 
“other” have significantly higher Spanish proficiency compared to males who identify as white; 
males who identify as black have significantly lower Spanish proficiency than males who identify 
as white. Other race/gender categories (white females, black females, and other males) are not 
significantly different from males who identify as white. 

Family also plays an important role in children’s Spanish-speaking proficiency. Children of 
single parents are significantly less likely to maintain Spanish-speaking proficiency, perhaps 
because of limited time and lack of another person to reinforce Spanish speaking at home. 
Parents’ language skills in English also play an important role in predicting their children’s 
Spanish-speaking proficiency. As might be expected, children with parents who are not English 
monolinguals are significantly more likely to maintain Spanish than those whose parents also 
speak Spanish. Children with parents who speak English well but who also speak Spanish are 
more than four times as likely to speak Spanish well compared to children whose parents are 
English monolinguals (Model 1, e'“*! =4.141). Children with parents who do not speak English 
very well are more than five and a half times as likely (Model 1, e'”*'=5.646), and children with 
parents who speak no English are more than six times as likely (Model 1, e'®* = 6.228) to speak 
Spanish well compared to children with parents who speak only English.’ 

Consistent with research in the assimilation perspective, those with higher family incomes 
are significantly less likely to speak Spanish well. While family income has a significant negative 
effect on speaking Spanish well, this effect seems to vary by ethnic group. In an additional 
model that is not shown here, a multiplicative term was added to investigate possible interaction 
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Table 1: Descriptive Statistics of the Variables Used in the Ordered Logit (N = 2435) 


Variable Name Mean Std. Dev. 
Spanish-speaking Proficiency Pie 94 
Generation 
First Generation AT 37, 
Second Generation 38 AQ 
Third and Later Generations 37 49 
Unknown Generation 06 74 
Ethnicity 
Mexican 62 49 
Cuban .04 .20 
Puerto Rican 10 30 
Other Hispanic 23 42 
Racial Identity 
White 60 49 
Black .06 24 
Other oO! 46 
Gender 
Female ay .50 
Gender and Racial Identity 
White Female a2 46 
Black Female .03 ce 
Black Male .03 aie 
Other Female 515 .36 
Other Male 16 eo! 
Catholic 67 AT 
Family Status Variables 
Single Parent 21 A 
Number of Siblings 2.15 1.69 
Family Income (in 1000s of dollars) 26.12 23.89 
Mean Parental Education 12.31 2.98 
Parent’s English Proficiency 
English Only 223 42 
Well O11 50 
Not Well 154 .36 
None .062 24 
School Type 
Public School 872 .34 
Catholic School 079 ae 
Private School 015 2 
Neighborhood and Geography Variables 
Percent Urban TACSS 42.82 
Percent Hispanic 39.59 31.56 
Neighborhood Median Household Income 27.56 10.98 


o_o 
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Table 2: Ordered Logit Predicting Spanish Speaking Ability (N = 2435) 


Independent Variables 


58S 


Generation 

First Generation (reference) 

Second Generation 

Third and Later Generations 

Uknown Generation 
Ethnicity 

Mexican (reference) 

Cuban 

Puerto Rican 

Other Hispanic 

Racial Identity 

White (reference) 

Black 

Other Race 
Gender 

Female (male is reference) 
Racial Identity/Gender interactions 


White Female (white male is reference) 


Black Female 
Black Male 
Other Female 
Other Male 
Religion 
Catholic (non-Catholic is reference) 
Family Variables 
Single Parent 
Number of Siblings 
Family Income (in 1000s of dollars) 
Parents Education 
Parents English Proficiency 
English Only (reference) 
Well 
Not Well 
None 
School Type 
Public School (reference) 
Catholic School 
Private School 
Neighborhood and Geography 
Variables 
Percent Urban 
Percent Hispanic 
Neighborhood Median Household 
Income 
Intercept 1 
Intercept 2 
F 


Model 1 Model 2 
b b 
-.138 (.209) -.139 (.208) 
-1.291. (.236) *** 1.2989 (235)i00° 
-.179 (.241) -.182 (.243) 
588 (.362) bio. (001) 
266 (.217) 265 (.214) 
-.279 (.154) -.285 (.154) 
-.883 (.260) ** 
221 (.140) 
.224 (.106) * 
td (ore 
-.511  (.365) 
-1.068 (.335) ** 
466 (.183) ™ 
174 (.175) 
111 (.103) 111 (.103) 
-.380 (.140) ™ -.384 (.140) ™ 
039 (.042) 039 (.042) 
-.026 (.003) * -.007 (.003) * 
-.007 (.025) -.025 (.025) 
1.421 (.144) ** 1.426 (.144) ** 
th S10 (FOO rt NATO ROME 4 
1.829 (.304) *** 1.845 (.308) ** 
091 (.176) 093 (.177) 
1300 e (oT e 1.373% G523)qr" 
001 (.002) 000 (.002) 
009 (.003) *** 009 (.003) *** 
-.002 (.006) -.001 (.006) 
1.168 (.458) ™ 102 (.597) 
1.732 (.449) “** 666 (.581) 
Aes 22.64*** 


Note: Standard errors of the estimates are shown in parentheses. 


“Dix .05) pO beep OU 
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effects between ethnicity and family income. For Cubans (compared to Mexicans), family 
income significantly increases the likelihood of speaking Spanish well net of the effect of other 
variables. In the Cuban case, this is likely related to an ethnic economy that allows for 
socioeconomic mobility while also preserving Spanish. However these results must be 
interpreted with some caution because while the effect is significant at the single degree of 
freedom contrast, the interaction term does not add significantly to the explanatory power of the 
model as a whole. 

School and neighborhood context appears to influence Spanish proficiency as well. Students 
who attend private schools are nearly four times more likely to speak Spanish than students 
who go to public schools (Model 1, e'* = 3.931). Private school attendance is largely a Cuban 
phenomenon; about 7 percent of Cubans, less than 1 percent of Mexicans, a little more than 1 
percent of Puerto Ricans, and 3 percent of other Hispanic youth in this sample attend private 
school. Those who attend Catholic schools are not significantly different in their Spanish- 
speaking ability than those who go to public schools. Those who live in a neighborhood with 
high percentages of Hispanic residents are also likely to have greater Spanish-speaking ability. 


Discussion 


What do these findings say about Spanish-speaking proficiency among English-speaking 
Latino youth? First, parents’ birthplace has a very strong effect on Spanish-speaking 
proficiency. The effects of generation suggest that parents’ nativity is more important than the 
child's in predicting high levels of Spanish-speaking proficiency. (See also Stevens 1985 for 
similar results regarding the effect of parents’ nativity on children’s language.) Thus, it appears 
that children who speak Spanish well do so because they live in an environment in which their 
foreign-born parents use Spanish with them. Family, school and community contexts are also 
important in terms of reinforcing Spanish use and creating opportunities to speak Spanish. 
Single parents may have a more difficult time resisting the linguistic Anglicization of their 
children, and maintaining Spanish at home than two-parent families. As income rises, the 
likelihood of speaking Spanish well decreases, perhaps reflecting increased entrée into 
American educational, social and economic spaces, and decreased participation in ethnic 
institutions. However, income may operate differently across ethnic lines, such that high 
income increases Spanish proficiency for Cuban youth, who tend to be concentrated in an 
ethnic enclave that offers incentives to maintain Spanish in social, scholastic, and professional 
environments. In general, those who live in an environment with other coethnics:are also 
more likely to have higher Spanish-speaking proficiency, as seen by the effects of percent 
Hispanic in the neighborhood. 

Girls are also more likely to speak Spanish well than boys. This is consistent with literature 
suggesting that girls have greater verbal aptitude than boys in general. (See Kleinfeld 
1998, 1999.) However, girls also seem to have a different linguistic social context than boys. 
Portes and Hao (2002), for example, have found girls to have “a stronger attachment to 
parental languages.” (p. 902) Language skills may, in this sense, reflect greater interaction at 
home between parents and girls. In her study of the linguistic topography of a Puerto Rican 
neighborhood in New York, Urciuoli (1991) found that everyday interactions are structured by 
gender in ways that create meaningful gendered ideologies with respect to language. She 
notes “the networks in which men and women operate, and indeed in which they have grown 
up, are relatively gender-segregated, and it is in gender-segregated everyday interactions that 
opposing ideologies of language and persona are most likely to emerge.” (p. 301) She finds, 
for example that women tend to “slip into Spanish” with non-Spanish speaking neighbors and 
thus share cultural and linguistic repertoires with them in ways that men do not. Other 
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research finds women to be more likely to maintain a minority language (Stevens 1986, 
Veltman 1981) either because they do not face the same pressures to learn English as men, 
or because they teach the minority language to children at home. Stevens (1986) notes that 
once women gain English proficiency they use English more often than men, but are less 
likely to become English monolinguals. In this sense maintenance and loss of Spanish 
proficiency appears to be segmented by gender, potentially reflecting different gendered 
language environments, opportunities, and repertoires. 

The significant relationship between racial identity and language proficiency raises 
questions about causality. As such, racial identification may be linked to the students’ feelings 
about integration into the U.S. society with its notion of a racial binary. Waters (1994) notes, 
"The children of black immigrants... face a choice about whether they will identify as black 
Americans or whether they will maintain an ethnic identity reflecting their parents’ national 
origins.” (p. 796) (See also Foner 1987, p. 796; Kasinitz 1992, Kasinitz-et al. 2001; Waters 
1999.) Waters (1990) and Alba (1990) claim that for descendants of white ethnics, ethnicity 
becomes an optional and somewhat fluid identity. In this sense, racial identification itself 
becomes an indication of the shedding of ethnicity as a meaningful distinction and 
assimilation to U.S. racial norms. Olsen (2001), for example, finds that for immigrant students 
“learning English is a precursor to learning their racial place” (p. 391), with English 
monoligualism perceived to be the “uncontested passage in becoming American,” a 
passage they later find also requires fitting into a limited set of racial categories (pp. 386-387). 
Students who identify as racially black or white may do so because they perceive the 
“Hispanic” category is less relevant in their lives, while those who identify as “other” may do 
so to identify only with the “Hispanic” category. Girls who identify as “other” are likely to have 
significantly higher levels of Spanish-speaking proficiency than white males (“other” males are 
not significantly different), which may reflect gender differences in homeland-oriented 
families in which children might be more likely to see themselves only in terms of a pan-ethnic 
Hispanic or Latino label (i.e., other, rather than black or white). For example, Toro-Morn and 
Alicea (2003) in their study of Puerto Rican families have found that “parents attempted to 
reconstitute homeland communities and values in the United States as ‘authentic’ cultural and 
gendered spaces,” and in such cases girls may be socialized into the ideal of marianismo, in 
which “a woman is defined by her self-abnegation and by placing family and community 
above her own needs” (Confresi 1999) while men have more freedom from family and 
community responsibilities (p. 203). In this sense, in homes where families attempt to 
preserve family relations based on the homeland, males may have greater access and 
participation in social institutions outside the ethnic community than females and therefore 
may use Spanish less than do females. Alternatively, perhaps youth with darker skin tones, 
particularly black males, are less likely to be perceived as “Hispanic” or “Latino” outside of the 
home and thus have fewer opportunities to engage in Spanish-language encounters, more 
opportunities to engage in English-language encounters, or greater pressure to assimilate to 
English monolingualism. As such, black Latinos who speak English well may face a similar 
scenario to second-generation Jamaicans or Trinidadians who have been described as 
“invisible ethnics.” (Bryce-LaPorte 1972) 

The segmented assimilation framework would suggest that black Latinos are assimilating 
into African-American cultural environments, including decreased use of (and therefore 
decreased proficiency in) Spanish. The gender-race experience theory suggests that the 
combined effect of racial identity and gender is an indication that the segmentation of 
linguistic assimilation across gender and race likely reflects language experiences, 
opportunities to speak Spanish, and access to the domains in which Spanish is spoken. In this 
way, race and gender may operate to structure the social context of linguistic relations. 
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Conclusion 


These results appear to indicate that language proficiencies in Spanish reflect social and 
demographic contexts that allow for greater or lesser exposure to Spanish use within the 
United States. The theoretical perspectives of assimilation, segmented assimilation and race- 
gender experience theory illustrate the domains by which access to Spanish is structured. 
As Fishman’s model suggests, it appears that many of the social and demographic factors 
associated with Spanish maintenance are important in that they structure the opportunities 
for interaction in Spanish and the domains in which Spanish is used. Effects of generation, 
residence in concentration with coethnics, and income on speaking Spanish well appear to 
follow the theoretical trajectory of the assimilation perspective whereby language 
accommodation is part of a larger process of shedding of ethnic distinction and integrating 
with the larger society. Yet, the picture is also a somewhat nuanced one as the results indicate 
that Spanish-speaking proficiency is also significantly structured (or “segmented”) by gender 
and race. This may have something to do with the racial climate of the United States, where 
those who appear to be black may not be recognized as Hispanic or Latino, and/or they may 
be absorbed into the native-born black or white society more quickly than others. The 
combined effect of race and gender indicates support for the race-gender experience theory 
in that language proficiencies appear to be structured by both race and gender perhaps as a 
result of different language experiences, environments or norms. 

Further research is warranted on this topic. Such research might include further 
investigation of within-group differences in language maintenance and proficiency, particularly 
racial and gender differences. Additional data collection is also warranted to research patterns 
of bilingual proficiencies and use. In order for research to continue on this topic it is critical 
that new sources of nationally-representative survey data that include measures of proficiency 
in both English and Spanish (and/or other languages) become available. Such data collection 
efforts might include surveys that would allow researchers to assess processes of language 
acquisition and maintenance at very young ages as well as longitudinal data that would allow 
researchers to see how language proficiencies may change as children enter and progress 
through the school system. Further survey data collection efforts might also include multiple 
measures of proficiency such as those used by linguists and psychologists in addition to self- 
reported proficiency. 


Notes 


1. The term “Hispanic” is used in this text when referring specifically to the NELS:88 data 
because that is the term used in the data collection. Elsewhere in the text, the term, “Latino” 
is used because the paper focuses on youth with Latin American and Caribbean ancestries. 


2. Espiritu and Wolf (2001) note the discrepancy between use of language at home and 
language proficiency in their analysis of the Children of Immigrants Longitudinal Study 
(CILS) data, saying “although over 90 percent of the Filipino sample reported speaking a 
language other than English at home, they were not fluent in this language.” (p. 175) 
Similarly, Portes and Rumbaut (2001), using a larger multiethnic CILS sample find that 
“while over 90 percent of the sample report knowing a language other than English, their 
fluency in that language is significantly poorer.” (p. 119) 


3. Children were selected for the multivariate analysis if their overall English proficiency was 
equivalent to “well” or above. 
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4. Fishman and Terry (1969) have found that self reported language skills have high levels of 
validity. In their research, they compared self reports based on questions that are very 
similar to those used in NELS:88 to proficiency measures used in linguistic and 
psychological research on language and find the validity of the self reports “tends to be 
rather substantial and consistent.” (p. 643;) (See also Fishman 1969.) 


5. Although not shown here, additional models broke the first generation category into two 
categories using two different schemas. The first schema broke the first generation into 
two categories: (1) recent migrants, those who migrated within the past two years, and (2) 
those who had migrated more than two years ago. The second schema also separated the 
first generation into two categories: (1) those who had attended some school in a country 
of origin, and (2) those who had attended school only in the United States. In neither case 
was there a significant difference between the two groups in Spanish-speaking proficiency. 
Additional models included a dummy variable for school attendance outside the mainland 
United States. However, net of the other variables it fell just short of significance. 


6. | use the svy ologit command in STATA because it corrects for the complex sampling 
design of the National Educational Longitudinal Study (NELS:88) through the use of 
sampling weights, strata and psu variables. With the regular ologit command, one can 
test for the assumption of parallel slopes using SPost or omodel (Long and Freese, 2003). 
To my knowledge, there is currently no test available that allows one to test for the 
assumption of parallel slopes in conjunction with the svy ologit procedure. However, 
similar effects were found using multinomial logistic regression. 


7. Additional analysis not shown here indicates that the coefficients for the three categories 
of “well,” “not well” and “none” for parent's language skills are not significantly different 
from one another. However, | have included all four categories in the analysis (rather than 
collapsing them into two categories) to indicate that each group is significantly different 
from parents who speak only English. 
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Abstract 

This study explores the relationship between social capital and adolescent violent behaviors 
for a national sample of secondary school students (N = 4,834). Cross-sectional data from 
The National Longitudinal Study of Adolescent Health were used to evaluate multivariate 
models examining the family, school and neighborhood correlates of violent behaviors. 
Results demonstrate the importance of social capital factors across domains as significant 
resources moderating violence outcomes, especially parent-child relationships and school 
affiliation. Although we hypothesized that greater sports and club participation would 
decrease tendencies toward violence, results indicate otherwise. 


Introduction 


Adolescent violence is a social and public health problem of national concern. The effects of 
peer-perpetrated aggression includé emotional distress, physical injury and death. Violent 
offenders often become victims of aggression, using physical force and threat as a means of 
coping through acting out, displacement and self-defense (Brener et al. 1999; Hashima and 
Finkelhor 1999). Therefore, in general, violence has a broad impact on adolescent health and 
behavior with both immediate and long-term physical, psychological and social consequences. 
Injuries and adverse health often persist into adulthood, as physical and psychological scars, 
for both perpetrator and victim. Thus, health policies promoting optimal physical and mental 
well-being over the life course should continue to focus on prevention of problematic behaviors 
such as violence among adolescents to ensure maximum effectiveness. These factors should 
be identified within a framework that includes broader social influences as well as individual 
characteristics. 

This research investigates potential correlates of violence-related outcomes among 
adolescents attending secondary schools from a social capital perspective. Specifically, this 
study attempts to identify important sources of social capital as possible resource factors, 
which help reduce the frequency of violent behaviors. In this framework, social capital is 
defined as resources embedded in social relationships and ties that can be used for 
expressive purposes such as the maintenance of physical health (Lin 2001). To this end, social 
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capital has important consequences for adolescent behaviors. Fitzpatrick (1997b) found that 
adolescents are more likely to threaten or attack others and jeopardize physical well-being 
when they lack relationships and ties and are disconnected from others in the proximate 
environment. Conversely, students who are involved in socially constructive activities and 
have those social ties tend to reap the benefits of social capital that are inherent in close, 
cooperative relationships. Through viable social connections, a normative frame of reference 
or expectation for conduct is communicated across family, school and neighborhood 
domains. This aspect of adolescent sociability is an important pathway for accessing and 
using social capital. 


Review of Literature 


According to life course, criminological and related perspectives, adolescent socialization is 
subject to complex social contexts involving family, school and neighborhood (Catalano and 
Hawkins 1996; Dodge and Pettit 2003). These contexts remain the primary adolescent 
pathway for effects of health and behavioral risks as well as assets (Dodge 2001; Hawkins et 
al. 1999). Psychosocial difficulties from these external sources may produce harmful or 
dysfunctional responses. At the same time, these social environmental components are rich 
sources of protection, capable of partially or fully offsetting unhealthy behavioral choices. 
Adolescents are not only exposed to differential social development processes, but they 
respond to these events in ways that are different from other age groups (Herrenkohl et al. 
2001). The salience of social influences and the manifestation and increase/decrease of risk 
behavior vary by age, motivating the need to understand the role of social relationships in 
developmental pathways during adolescence (Dodge 2001; Thornberry, Huizinga and Loeber 
1995). It is therefore important to acknowledge that the focus of this work — social capital and 
violent behavior — applies across complimentary perspectives of criminology and injury 
prevention, making it a useful interdisciplinary concept. 


Adolescent Violence 


In 2003, the Centers for Disease Control and Prevention reported that 33 percent of youth 
aged 10 to 19 years had been involved in a physical altercation in the past 12 months, and 17 
percent had carried a weapon, usually a firearm, in the past 30 days. Of the more than 
400,000 estimated violence-related injuries from physical- and weapon-related assaults, 1 in 
28 required medical care and 1 in 4 resulted in death. Hence, threatened or actual use of 
physical intimidation and force remains far too common in the daily lives of adolescents. 
These statistics make clear that interpersonal violence among adolescents continues to be a 
significant health concern (Dahlberg 1998; Lowry et al. 1998; Satcher 1999; Snyder and 
Sickmund 1999). 

Aside from physical injuries caused by adolescent violence, such behaviors adversely 
affect the learning environment, creating disruptions and an atmosphere of physical 
intimidation leading to anxiety and loss of focus on academic tasks and normal school 
socialization processes (Fitzpatrick 1999). Adolescents who are repeatedly victimized may be 
prone to impulsiveness and irritability thereby lowering their threshold to strike out, especially 
toward younger children. In particular, adolescent victims lacking strong and supportive 
relationships with parents, teachers and peers are also without a source of protection or a 
way to appropriately seek relief from violence. They may instead lash out with aggressive or 
impulsive behaviors of their own in an effort to cope with on-going personal threat. 
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Family Social Capital 


The family environment is an apparent and important source of influence on adolescent 
development and behavioral preferences (e.g., Cookston 1999; Esbensen, Huizinga and 
Menard 1999; Gottfredson and Hirschi 1990; Mitchell and Finkelhor 2001; Parcel and Dufur 
2001). McNeal (1999a, 2001) found that functional characteristics of families that can 
influence adolescent behavior include parental participation in community organizations such 
as a Parent-Teacher Association (PTA) and parental awareness of their adolescent's friends 
and their parents. The former is an example of formal extra-familial ties from which parents 
draw social capital for themselves and their children. The latter establishes informal extra- 
familial ties that can increase the availability of social capital in the form of mutual 
cooperation, trust and monitoring. Close, dense extra-familial supportive networks are 
particularly important for maintaining existing resources (expressive functions) such as 
physical and psychological health. Both formal and informal ties outside the family can 
increase the density of local supportive relationships making supervision and guidance of 
adolescents outside the home more likely and violent behavior less likely. Parents are more 
likely to encourage independence when they believe outside sources such as teachers and 
other parents will monitor their adolescent's activities and report undesirable behaviors 
(Aneshensel and Sucoff 1996; Bowen and Bowen 1999; Brooks-Gunn et al. 1993). Social ties 
between sets of parents and between parents and teachers can usually facilitate social capital 
by fostering trust and reciprocity between parties. 

More importantly within the home, however, are the characteristics of the parent- 
adolescent relationship. Factors that are typically considered supportive of healthy adolescent 
behavior include the extent to which adolescents consider parents warm, loving and attentive 
and the quality of communications and overall relationship between parent and child (Adlaf 
and Ivis 1996; Demo and Acock 1996; Friedman, Terras and Glassman 2000; Malek, Chang 
and Davis 1998; Sokol-Katz, Dunham and Zimmerman 1997; Videon 2002). 


School Social Capital 


Schools provide opportunities for social capital and directly or indirectly, through these 
available resources, influence student behaviors (Lindstrom 2001; McNeely, Nonnemaker and 
Blum 2002). Social capital factors interrelating the adolescent and school experience include 
extent of attachment to school (Resnick et al. 1997). For many adolescents, school is a 
significant hurdle and often an unpleasant challenge due to failure to meet academic or 
behavioral standards leading to frustration, loss of self-esteem and confidence, detachment 
and violent behavior. Such students may have difficulty getting along with their teachers and 
act out with combative or avoidance behavior due to feelings of inadequacy, isolation and 
vulnerability (Simon, Crosby and Dahlberg 1999). These factors also serve as resources for 
students because social and academic success at school opens avenues for normative group 
participation and healthy perceptions of belonging. 

Manifestations of school-related dysfunction may include lack of identification or feelings 
of closeness to or compassion for other students or a failure to relate to the school in general. 
The disaffected student may have inaccurate perceptions of prejudice, unfair treatment and 
so on. Conversely, research has shown that those students who feel cared for by teachers 
and peers are less likely to initiate violent behaviors (Resnick et al. 1997). Those students are 
protected from aggressive tendencies associated with overt assaultive behavior as well as 
internalized equivalents that increase isolation and vulnerability. In short, disengaged 
adolescents frequently lack access to important social capital embedded in constructive 
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interpersonal relationships with teachers and fellow students (Haynie et al. 2001; McNeely 
et al. 2002), putting them at greater risk for engaging in violent behaviors. 

To this end, collective school activities are thought to be sources of protection for 
adolescents (McNeal 1999b). School-sponsored teams, clubs and groups can promote 
feelings of connectedness and camaraderie with other students and staff. To a certain extent, 
beneficial membership effects may be perceived as building constructive resources in lieu 
of passivity that breeds destructive outcomes. Participation in these activities thereby 
substitutes for unstructured, unsupervised and often detrimental socialization and peer 
influence (McNeely et al. 2002). The choice of whether to participate in healthy collective 
pursuits (school-sponsored) or in deviant collectivities (e.g., gangs) and behaviors (e.g., use of 
substances such as alcohol) in many cases determines the likelihood of violent behaviors 
(Fitzpatrick and LaGory 2000). Contrary research findings report that sports participation may 
be a resource for certain subgroups of adolescents only, that minority students continue to be 
at risk for health compromising behaviors (Pate et al. 2000). 


Neighborhood Social Capital 


Adolescents also will be influenced toward or away from violent behavior by neighborhood 
factors (Fitzpatrick 1997a; Fitzpatrick and LaGory 2000). Beyond objective features of the 
neighborhood, social connectedness, which includes involvement with other residents and 
local institutions such as churches or other places of worship, often plays a protective role in 
reducing the risk of adolescent violence. 

For adolescents, social connectedness may be reflected by perceptions of ease in 
interaction with neighbors. This includes the extent to which individuals know, look out for, and 
talk to each other. An adolescent in an involved neighborhood is less likely to initiate violence 
or become a victim. Protective guardianship by parents, peers and other residents reduces the 
likelihood of physical and psychological injury from violent perpetrators (Fitzpatrick and LaGory 
2000). Adolescents are less likely to become involved in deviant or delinquent activities that 
encourage aggression if the local criminal element is minimal or nonexistent. 

When adolescents have local support and feel safe, they are more likely to participate in 
local organizations and institutions that increase potential for social capital benefits for 
themselves and their parents (King, Elder and Whitbeck 1997; Youniss, McLellan and Yates 
1999). Active participation in neighborhood churches and related youth activities raises ethical 
awareness and responsibility and reduces available time for participation in harmful activities 
(Jeynes 2001). Also, an adolescent choosing to devote time to religious and spiritual pursuits 
has less time to be in environments where violence may be initiated. Neighborhood churches 
of various denominations typically have specific activities for adolescents that recognize their 
need for socialization, self-expression and growth (King et al. 1997). Adolescents who feel that 
religion is an integral aspect of their lives may see a purpose beyond themselves, which 
translates into cooperation with and goodwill toward others. Religiously inclined teenagers 
are more likely to conduct their lives according to moral convention and practice, reducing the 
likelihood of violent behaviors. 


Theoretical Framework 


Social capital theory has garnered attention in the sociological literature (e.g., Coleman 1988; 
Lin 2001; Paxton 1999: Portes 1998; Powell and Parcel 1999; Putnam 1995; Wilson 1997). 
Although perspectives may differ on how social capital is returned (e.g., investment vs. 
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endowment) and for whom (e.g., collective vs. individual), there is agreement that social 
capital is a resource within social structures and relationships. Social capital is external to the 
individual, residing in features of social organization such as interpersonal trust and social 
control resources. Thus, social capital is generated through connections and ties with others. 

Returns on social capital are realized when individuals commit to relationships with 
significant others. Such returns can be used for expressive or instrumental purposes; that is, 
they can be used to either maintain existing resources or gain additional resources (Lin 2001). 
For example, social capital found in close, dense connections generally is more conducive to 
the maintenance of ‘health and well-being. Meanwhile, social capital residing in more 
expansive, diverse connections facilitates the likelihood of obtaining additional resources 
such as employment and other opportunities and information not readily accessible to an 
individual (Putnam 2000). In sum, social capital — in the form of relationship-embedded 
resources — is accessed through formal and informal associations with members of family, 
school and neighborhood environments. These webs of social ties provide a normative frame 
of reference for behaviors, providing standards of expectation that include approval or 
disapproval for certain actions or activities. The result is effective and reciprocal trust and 
social control among those who are connected. 


Social Capital and Adolescent Outcomes 


How is social capital relevant to adolescent health and behavior outcomes? An important 
aspect of adolescent development or socialization is the intergenerational transmission of 
values and expectations about behavior. Viable social connections and associations within 
families, schools and neighborhoods are pathways to productive peer and adult role models 
who guide youth through a myriad of health and behavioral decisions, including ways to 
manage anger and resolve conflict. This becomes social capital for an adolescent when 
standards for behavior are communicated via satisfying relationships within the home or 
cooperative monitoring by other parents and teachers. 

For example, families are an important source of social capital, with close and consistent 
parent-child relationships associated with positive adolescent health and behavioral 
outcomes. Social capital is transferred to adolescents when time and attention are more 
readily shared between family members. Adolescents whose parents spend time talking and 
sharing activities are more likely to be satisfied with their parental relationships. 

Parents can access social capital on behalf of their adolescents through active 
participation with others in respective schools and neighborhoods, either informally or 
formally. Mutual behavioral expectations result from ties with teachers as well as parents and 
adolescents outside the family. Parents within the same social network often share similar 
ideas about supervision, and these parents may rely upon one another to enforce disciplinary 
standards in accordance with their own. This cooperation creates a sense of reciprocity and 
trust among the parents that generates social capital for youth (Coleman 1988; Kennedy et al. 
1998). Through these social ties, teachers and other adults relay information about an 
adolescent even when the parent is not present. Through such mechanisms adolescents can 
be recognized and rewarded for indirectly observed productive behavior outside the home. 
Similarly, problematic behavior can be recognized and tempered. In this way, social capital is 
a resource for adolescents and parents. It fosters trust and improves the quality of life, which 
enhances physical and mental health. 

Adolescents can directly access social capital outside the home through their own 
investments in social relationships. When adolescents invest in social ties with others in 
schools and neighborhoods, they become embedded in networks that are pathways to social 
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capital returns. Therefore, just as parental investment of time and attention opens avenues for 
the transmission of social capital, shared activities among peers and adults outside the family 
are important sources of protection for adolescents. For example, attachment to school has 
been shown to create circumstances whereby students feel more integrated (McNeal 1999b; 
Resnick et al. 1997). This then allows them to better focus on the learning environment, often 
resulting in fewer disciplinary problems. Similarly, neighborhood social capital entails cohesive 
networks among neighbors and institutional affiliates such as clergy. Adolescents thus are 
part of a group unified by reciprocal trust and social control. These social capital resources are 
shared amongst a cooperative collectivity of peers and adults who contribute to healthy 
adolescent growth and development. 


Hypotheses 


Though social capital theory continues to enjoy popularity in the sociological literature, it does 
have critics. A major criticism of the social capital literature is its failure to connect theory and 
measurement (Lin 2001), leading to vague or unreliable indicators of theoretical concepts. 
Another arguable flaw in the social capital literature is the extension of the “notion of social 
capital beyond its theoretical roots in social relations and social networks.” (Lin 2001: 28) 
Finally, social capital should not be divorced from social structures, positions and organizations; 
therefore, the effects of these features should be taken into account when constructing and 
testing models of social capital. Evidence clearly shows that both social positions or roles and 
social relationships within family, school and neighborhood domains shape adolescent 
development (e.g., Duncan, Boisejoly and Harris 2001; Fitzpatrick and LaGory 2000). 

For example, an adolescent's social status in social environments is based on 
compositions or properties such as family income and structure, academic achievement and 
years of school, and median household income and modal race indicators for a 
neighborhood. Likewise, these domains include social capital that is available through webs 
of communication, association and mutual support. The theoretical and empirical 
assumptions are that adolescents with better attained resources and social capital resources 
adapt more readily to normative social and developmental goals, which, in turn, decrease the 
likelihood of violent behaviors (Gottfredson and Hirschi 1990; Lin 2001). Using this 
conceptualization, this study addresses the following research questions: How are violent 
behaviors associated with social capital in an adolescent's family, school and neighborhood? 


Family Social Capital 
Hypothesis 1: The more satisfactory the assessment of parent-child 
relationship by the adolescent, the less likely the adolescent will 
report violent behaviors. 
Hypotheses 2: The greater the level of parental monitoring, the lower 
the reported incidence of violent behaviors. 


School Social Capital 


Hypothesis 3: Students reporting a greater sense of affiliation at 
school will also report fewer incidences of violent behavior. 
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Hypothesis 4: Adolescents who are active in school-sponsored clubs 
and sports teams are less likely to report violent behavior An 
empirical distinction between sports and club participation is made to 
test the independent effects of sports and academic/social clubs on 
violent outcomes, thereby determining whether sports is a resource 
or a risk for students. 


Neighborhood Social Capital 


Hypothesis 5: Adolescents who are more involved with neighbors 
are less likely to report violent behavior. 


Hypothesis 6: The level of religious participation reported by an 
adolescent is inversely related to violent behavior. 


Methods 


Adolescent violence was expressed as a composite index representing a summation of 
responses relating to use of weapons and fighting in the prior year. Variables argued 
previously as associated with this principal outcome were classified as individual ascribed 
characteristics and domain-specific attained and social capital factors. Ascribed variables 
such as age, race/ethnicity and gender have been clearly established in the literature. 
Therefore their effects must be taken into account when constructing multivariate models. 
Domain-specific attained factors are more objective aspects of an adolescent's environment 
and are an important part of the social capital model of interpersonal violence. Attained 
variables for the analysis included self-report household income level, presence of biological 
father in the home, grade average, years enrolled at present school, and block group median 
household income and modal race. Composite index variables for domain-specific social 
capital variables also were formulated using summation. Social capital variables included 
parent-child relationship, parental monitoring, school affiliation, sport and club participation, 
involvement with neighbors, and participation in neighborhood churches and other places of 
worship. For this analysis, social capital resources of interpersonal trust and social control 
were derived from variables or indices measuring the quality or quantity of relationships or 
connections with others. 

The analysis involved multivariate linear regression, with the Violent Behavior 
Summation Index tested for the full study sample (V = 4,834). In the model, ascribed, 
attained and social capital explanatory variables were entered in successive blocks. The 
first block of multivariate models consisted of ascribed characteristics of the adolescent. 
The next block represented domain-specific attained factors. Specifically, these were 
more objective factors that differentially position adolescents in the social structure. This 
aspect of the models examined how these preconditioned factors contribute to inequality 
of both social capital and health: domain-specific elements such as economic, structural 
and positional statuses that affect opportunities to construct or maintain social capital and 
health resources. The next block represented social capital factors. This part of the model 
dealt with how social capital resources affect adolescent behavioral outcomes or how 
“social capital is capital” (Lin 2001: 247), that is, how it generates positive behavioral 
outcomes or reduces negative ones. 
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Data 


This study uses data from The National Longitudinal Study of Adolescent Health (Add Health). 
The primary sampling unit consisted of U.S. high schools. If a selected high school did not 
include grades 7-12, a junior high school that sent graduates to the high school was identified. 
From this sampling frame, 80 high schools with probability proportionate to enrollment size 
were recruited to participate. Additionally, 52 feeder middle schools were recruited with 
probability proportionate to their student contribution to the high school (N = 132). 

From the participating schools, adolescents enrolled in grades 7-12 were selected for 
interviews. For the in-school sample, self-administered surveys were given to students in 
attendance during one 45- to 60-minute class period. No make-up was given to students 
absent on the original survey day. Parental permission was either passive (the form was to 
be signed by a parent and returned if they did not want the child to participate) or active 
(some schools required the consent form signed and returned before a child could 
participate). For the in-home interview, the sample included students who did and did not 
complete the in-school questionnaire, with the latter selected from a roster provided by the 
school. Grade and gender stratified students, with 17 students randomly selected from 12 
stratums. This resulted in a sample of about 200 adolescents from each of the schools. The 
interview consisted of both interviewer-assisted questions and audio computer-assisted 
self-interview technology. The latter was used for more sensitive questions involving 
violence, delinquency and substance use. For the parent interview, the mother or other 
female head of household was asked to participate. If mother, stepmother or other female 
guardian was not available, the interviewer spoke with the father, stepfather or other male 
guardian living with student. Finally, neighborhood block group data were derived from the 
U.S. Bureau of the Census STF-3A. 


Measurement of Variables 


Dependent Variable 


A Violent Behavior Summation Index consisting of self-report frequency of aggression/fighting 
and weapon use during the past 12 months was constructed by summing related responses. 
Weapon use items were (a) you used or threatened to use a weapon to get something from 
someone, (b) you pulled a gun or knife on another person, and (c) you shot or stabbed another 
person (0 = not at all, 1. = 1 or 2 times, 2 = 3 or 4 times, 3 = 5 or more times). 
Aggression/fighting items were (a) you were in a physical fight, (b) you took part in a fight where 
a group of friends was against another group, and (c) you hurt someone badly enough to need 
bandages or care from doctor/nurse (0 = never, 1 = once, 2 = more than once). This index had 
a range of 0-15 (M = 1.11; SD = 1.88) and was reliable with a Cronbach's alpha of .70. 


Independent Variables 


Ascribed (Sociodemographic) 

Gender was a nominal variable with 0 = male. Approximately 48 percent of the sample was 
male. Race/ethnicity was represented by separate dichotomous variables for black, Hispanic, _ 
Asian and American Indian, with white as referent. Approximately 59 percent of the sample 
was white, 23 percent was black, 12 percent was Hispanic, 4 percent was Asian, and 2 
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Table 1: Descriptive Statistics for Select Variables in the Analysis (N = 4,834) 


— 


Variable Percent M (SD) 


Gender 


Male 47.9 

Race/Ethnicity 

White 58.6 

Black . 23.2 

Hispanic Ast 

Asian a7 

American Indian 2.1 

Biological father in household 

Yes 48.8 

Parent’s income (annual) 

< = $10,000 7.8 

$11,000-$20,000 dikeg 

$21,000-$40,000 27.6 

$41,000-$60,000 23.4 

$61,000-$80,000 10.0 

> $80,000 8.5 

Age (range 13-19 years) 16.02 (1.62) 
Grade average (4.0 scale) 2.85 (.79) 
Years enrolled at current school 2.70 (1.66) 
Sports participation (range 0-13) 1.28 (1.47) 
Club participation (range 0-20) 1.14 (1.57) 





percent was American Indian. Age was an interval-level variable with a range of 13-19 years 
with a mean of 16 years (SD = 1.6). 


Family Attained Factors 

Family income earned in 1994 was an interval variable with a range between $0 and $999,000. 
This value included income from respondent, everyone else in household, and welfare 
benefits, dividends, and other sources. Data for self-report annual income initially were 
missing for 22 percent of the sample. Adolescents from homes not reporting annual income 
were not significantly different from other respondents on values for pertinent explanatory 
or outcome variables. Mean substitution for missing cases were imputed prior to analyses 
using subgroup averages based on parent's education, race, and employment and marital 
status. Approximately 19 percent of the sample reported $20,000 or below, and 18 percent 
reported income greater than $60,000. A little more than half the sample (51 percent) 
reported incomes between $21,000 and $60,000. The question “does child's biological father 
live in this household” was a dichotomous variable ranging with 0 = no. About 49 percent of 
respondents reported that their biological father lived in the household. 
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Family Social Capital 

For the Parent-Child Relationship Index, adolescents were asked if they feel (a) parents are 
caring, (b) parents understand them, (c) family has fun together, and (d) family pays attention 
(1 = not at all, 2 = very little, 3 = somewhat, 4 = quite a bit, 5 = very much). The Parental 
Monitoring Index assessed the level of formal and informal social ties with dichotomous 
nominal responses (0 = no). For formal ties, parents were asked if they are active members 
of the PTA. For informal ties, parents were asked if they know their child's friends and friend's 
parents. The composite parent-child relationship measure had a range of 1-20 (MV = 15.98; SD 
= 2.94) and was reliable with a Cronbach's alpha of .80. The composite parental monitoring 
measure had a range of 0-3 (M = 2.08; SD = 0.81) with a Cronbach's alpha of .66. 


School Attained Factors 

Grade average was a self-report indicator of academic success over the most recent grading 
period in four major subjects: English/Language Arts, Mathematics, History/Social Studies 
and Science (4 = A, 3 = B, 2 = C, 1 = D). The values were reverse coded so that higher 
values indicated better grades, summed, and then divided by the number of subjects the 
respondent reported. For example, if the adolescent reported enrollment in English, Math and 
History, the summed value was divided by three. If the adolescent reported taking all subjects, 
the summed value was divided by four. The mean grade average for sample adolescents was 
2.85 (SD = .79). Past research has found that there generally is a high correlation between 
self-report grades and actual grades recorded in school records (Bogenschneider 1997; 
Dornbusch et al. 1987). The number of years respondent has been a student at their current 
school, as a measure of mobility in this domain, was an ordinal variable with 1 = first year, 2 
= second year, 3 = third year, 4 = fourth year, 5 = fifth year and 6 = more than five years. The 
average number of years adolescents have been enrolled in their respective schools is 2.70 
(Sip): 


School Social Capital 

The School Affiliation Index assessed attachment to school. Students were asked if they (a) 
feel close to people at the school, (b) feel like a part of the school, (c) feel other students are 
prejudiced, (d) are happy to be at the school, (e) feel the teachers treat student fairly, and (f) 
feel safe at school. Responses to the student prejudice question were reverse coded so that, 
along with other items, higher values indicated greater attachment (1 = strongly disagree to 
5 = strongly agree). The summed index had a range of 1-30 (M = 20.48; SD = 4.97) witha 
Cronbach's alpha of .67. The school participation variables were the sums of the number of 
academic and social clubs/organizations and sports teams the adolescent participates in or 
plans to participate later in the school year (0 = no). Clubs/organizations included language, 
book, computer, drama, academic, honor society, student council, yearbook, debate, 
newspaper, chorus/choir and orchestra (WV = 1.11; SD = 1.57). Sports teams included 
baseball, basketball, football, hockey, soccer, tennis, volleyball, wrestling, swimming and any 
other sports (VM = 1.28; SD = 1.47). Because some research found that high school sports 
are associated with negative health behaviors (e.g., Pate et al. 2000), this type of school 
participation was summed separately from more academic activities. From a social capital 
perspective, however, this article considered sports participation as a resource for trust and 
social control and thus predicted it would be associated with reduced incidences of violent 
behavior. 


Neighborhood Attained Factors 
Median household income in 1989 was an interval variable ranging from $4,999 to $100,001. 
Approximately 73 percent of block groups in the study had median household incomes 
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between $21,000 and $50,000. Modal race status was a nominal variable with 1 = white, 2 
= black, and 3 = other. This variable was a constructed measure representing statistical 
mode or the race occurring most frequently in a block group. The data did not specify a cut- 
point for modal race status. About 81 percent of the block groups were predominately White. 
Adolescent respondents’ home locations were identified and linked to their respective block 
group census area. A block group is defined as a geographic area averaging 452 housing 
units or just over 1,000 people. There are generally 4-5 block groups per census tract. More 
specifically, a block is a “small, usually compact area, bounded by streets and other prominent 
physical features as well as certain legal boundaries” that does not cross census tracts or 
county lines (U.S. Bureau of the Census 1993). 


Neighborhood Social Capital 

The Neighborhood Involvement Index assessed level of contact and care in the neighborhood. 
Adolescents were asked if they (a) know most of the people in the neighborhood, (b) stopped 
on the street to talk with someone who lives in neighborhood in the past month, and (c) believe 
that people in neighborhood look out for one another. [tems were reverse coded so that higher 
values indicated greater involvement with neighbors (1 = False and 2 = True). The Religious 
Participation Index assessed active participation in neighborhood churches and affiliated youth 
group activities by adolescents. Respondents were asked (a) how often they attend 
neighborhood church services and (b) how often they attended youth activities sponsored by 
neighborhood churches, synagogues and other places of worship in the past 12 months. Items 
were reverse coded so that higher values indicated greater attendance (1 = never; 2 = less 
than once a month; 3 = once a month or less, but less than once a week; and 4 = once a 
week or more). The composite neighborhood involvement measure had a range of 1-6 (VM = 
5.27; SD = 0.96) and a Cronbach’s alpha of .55. The composite religious participation measure 
had a range of 1-8 (MW = 5.22; SD = 2.04) with a Cronbach's alpha of .91. 


Results 


Descriptive 


Table 2 presents frequencies and percentages from full and gender-, race- and age-based 
samples scoring at least one on the violence behavior index. For the full sample, more than 40 
percent reported one or more incidences of violence in the past year. Breaking this down by 
demographic characteristics, male, younger and non-white adolescents reported greater 
violence scores. 


Multivariate 


Table 3 presents the Violent Behavior Summation Index regressed on ascribed, attained and 
social capital factors. In Model 1, males, blacks and Hispanics reported greater overall 
violence (p < .01). In Model 2, family, school and neighborhood attained variables were 
introduced. Males continued to report significantly greater violence than females (p < .01). 
However, younger adolescents now had significantly higher violence scores (po < .01), 
whereas the coefficients for blacks and Hispanics lost significance. The effects of attained 
resources appear to have mediated the significance of these age and race coefficients so that 
incidences of overall violence were significantly increased for younger adolescents and 
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Table 2: Frequencies and Percentages of Violent Behavior Summation Index for Full Sample and 
Sociodemographic Subsamples 


Violent Behavior Summation Index 
> =1 incidence of overall violence 


_ Frequency Percent 

Full sample 

(n = 4,834) 2,061 42.9 
Gender 

Males 

(n = 2,315) 1,240 53.9 

Females 

(n = 2,519) 821 Boa, 
Race/ethnicity 

Whites 

(n = 2,832) 1,064 Sia 

Non-Whites 

(n = 1,965) 986 50.6 
Age 

13-15 years 

(n = 1,900) 830 44.0 

16-19 years 

(n = 2,879) 1,204 42.0 


significantly decreased for black and Hispanic adolescents. The attained resources of grade 
average (p < .01) and biological father (p < .05) were also significant. 

In Model 3, family, school and neighborhood social capital factors were introduced into the 
regression equation. Males and younger adolescents remained significant (0 < .01). Having a 
biological father in the home lost significance, but lower grade average continued to predict greater 
violence (po < .01). With the exception of involvement with neighbors, all social capital resources 
were significant. Specifically, adolescents reporting lower scores for parent-child relationship (0 < 
01), parent monitoring (0 < .05), school affiliation (o < .01), and religious participation (0 < .05) also 
reported higher values for the Violent Behavior Summation Index. Both sports (0 < .01) and club (o 
< .05) participation had direct positive associations with overall violence. 

The final model explained approximately 16 percent of variation in adolescents’ overall 
violence scores, whereas ascribed factors alone accounted for 7.2 percent of the variance. 
Neither family nor neighborhood attained variables contributed to the model with statistically 
significant coefficients. Likewise, the school attained variable of years enrolled at current 
school was not significant for the overall violence model. 


Discussion and Conclusions 
Although there is a considerable body of literature on adolescent violence, much of it is not 


specifically focused on the benefits of social capital for this behavioral outcome. In this study, 
attained resources were considered more tangible assets in a domain that potentially could 
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Table 3: Unstandardized and Standardized Regression Coefficients for Violent Behavior Summation 
Index Regressed on Ascribed and Domain-Specific Attained and Social Capital Factors (N = 4,834) 


re 


Model 1 Model 2 _ Model 3 
BU ATOD IG 1 rset tome diets tee Desert win Dene Pa DBs 
Sociodemographic 
Gender -.828" -251 -.760" -.230 -.744** -.226 
Age . 042 -038 ~-.067** -.062 ~-.067** -.061 
Black oO a UO 2 14U .037 134 033 
Hispanic 348% 059 .199 034 .156 .026 
Asian -302 -031  -.295 -.030  -.297 = -.030 
American Indian 420 WU Ee OO .020 Aes 014 
Attained 
Family 
Annual income .001 .026 .001 027 
Biological father -154* -045 -.084 = -.025 
School 
Grade average OL any | so22 elo 
Years enrolled at school 013 013 .004 .004 
Neighborhood 
Median household income -.001 -.016 ~—--.001 -.033 
Modal race .061 017 101 .028 
Social Capital 
Family 
Parent-child relationship -.051** — -.084 
Parental monitoring -112" = -.051 
School 
School affiliation -.038** = -.112 
Sports participation oes eO 
Club participation O72. O75 
Neighborhood 
Neighbor involvement -.093 — -.030 
Religious participation -.050* ~—-.062 
Constant 1.91 3.40 5.41 
R’ 072 mvs" erode 
Adjusted i .069 .098 148 


Note: b = value of the coefficient. 
One-tail test; *p<.05 **p<.01 
Unadjusted R’ change F test; ***p<.05  **** p<.01 
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influence violence outcomes. Again, these factors were not treated as forms of social capital — only 
resources accessed and utilized from social relationships with others in family, school and 
neighborhood domains represent social capital factors. Taking their effects into account in a social 
capital model of adolescent interpersonal violence was important for teasing out the relative 
importance of these objective features compared with intangible resources resulting from social 
connection or ties within a domain. How important was social capital relative to attained resources 
for adolescents in reducing violent behaviors? The.lack of significance for biological father in the 
home with respect to violence outcomes seemed to be balanced by the effects of parent-child 
relationship and parental monitoring. In other words, as it relates to violence, family social capital 
supersedes the importance of living with a biological father. Also, the effects of neighborhood 
structural indicators, low household incomes and high minority status, were not significant for 
adolescents. As with family social capital indicators, social capital found in religious participation 
seemed to offset the importance of these more objective block group characteristics for violence. 

The results suggest that cooperative and supportive ties with significant others in these 
domains are a means of suppressing violent outcomes even in the face of objective disadvantage 
for adolescents. It appears that the importance of interpersonal trust and social control garnered 
from constructive and cohesive relationships overshadowed the more tangible resources available 
in family, school and neighborhood domains. For example, adolescents depend on significant 
others to provide for their basic needs, including the ability to successfully grow and develop within 
communities of caring peers, parents and adults. The investment of time into social groups by the 
latter, whether formally or informally, is an integral part of forming a cooperative community where 
the responsibility for nurturing and tending to children and adolescents is shared across domains. 

An important consideration that merits further investigation is how more objective or 
structural aspects of a domain can increase or decrease the availability of social capital for 
adolescents. Causal models should be tested to determine if adolescents from households 
where the biological father was absent or from neighborhoods with low household incomes 
and high minority status are less likely to have resources to offset the likelihood of violent 
behaviors. These results also suggest that social capital models should be tested against 
other theoretical (e.g., social disorganization) and empirical (e.g., family structure) 
perspectives to bring to light contradictions in common assumptions and expectations about 
the effects of individual and social factors on adolescent violent behaviors. 

How are violent behaviors associated with social capital in an adolescent's family, school 
and neighborhood? First, adolescents were positively affected by social capital found in 
satisfactory parent-child relationships. Parents who are involved with and connected to their 
children accumulate social capital within the family. Social capital that is built through this 
investment of time and attention lessens the likelihood of violent behavior and thereby 
improves the likelihood of successful development of emotional and social skills for the child. 
Parents who commit to relationships with their adolescent children through shared activities 
create avenues for the transmission of social capital across generations. 

School affiliation also was consistently associated with lower incidences of violent outcomes 
for adolescents. Part of improving a school’s ability to insulate students from violence involves 
emphasizing socialization processes and curriculum-based academic achievement. 

Neighborhood social capital evident in active participation in local churches and other places 
of worship was an important resource for adolescents. It is known that involvement in religious 
institutions increases contact opportunities with others holding similar values and beliefs. As 
such, religious involvement provides a network of trust, cooperation and support that 
discourages youths from violent behaviors. Through religious affiliation, friendships develop that 
are conducive to health-enhancing behaviors rather than health-compromising ones (Litchfield, 
Thomas and Li 1997; Wallace and Forman 1998). Collaborative ties between adolescents and 
religious leaders become opportunities to access and use social capital outside the home. 
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Some forms of social capital did not have positive behavioral consequences however. 
Although it was expected that greater participation in school-sponsored activities would be 
associated with lower rates of violence, the results refute that prediction. These results were 
consistent with previous empirical evidence that sports and club participation are associated 
with increased fighting and delinquency (e.g., Roche 1999). Athletes may have overinterpreted 
the questionnaire items related to violence. For example, sports team members may have 
included skirmishes on the field or court as incidents of fighting, thus inflating or biasing results 
to more fighting with greater sports participation. Fighting on the field is an important indicator 
of violence and is no less unhealthy because it occurred in the context of competitive sports 
participation. Such skirmishes should serve as a signal to coaches and teachers to effectively 
counsel athletes about anger and conflict management, thereby emphasizing sportsmanship 
both on and off the field. Such efforts would enhance the larger goal of positive school 
environments characterized by cohesiveness, involvement and adherence to traditional 
constructive values instrumental in mitigating social stress and pressures in this domain. 

This, however, does not explain why greater club participation was associated with increased 
violence. The theoretical explanation of the positive association between sports participation 
and violence outcomes provides a more comprehensive interpretation. That is, sports and club 
participation are inherently social situations in which ties with others are based on common 
characteristics or goals. The esoteric nature of such association may affect social capital in a 
way that undermines or distorts the value or effectiveness of its resources. That is, the close, 
dense nature of social ties among those participating in sports or clubs may stifle individuality 
and become the impetus for unique norms or standards of conduct for members (Portes 1998). 
In these cases, Members may become targets for those outside the group, or they may group 
together in or as a gang, using their social unity and capacity for violence against other students. 
Organized sports in particular serve as a point of reference in making decisions and often 
exacerbate ingroup and outgroup differences, increasing competition or opposition. It is evident 
that these associations are complex and deserve further attention in adolescent health studies. 


Study Limitations 


Because a cross-sectional study design was chosen for this research, causality between 
explanatory variables and violent outcomes cannot be inferred from the reported associations. 
Although temporal ordering of variables cannot be confirmed in this analysis, consistent and 

strong associations between statistically significant resource factors and violent behaviors 
provide a basis for future multivariate models derived from social capital perspectives. The 
associations revealed from this study's findings also facilitate an exploration of the long-term 
effects of significant resource variables on behavioral outcomes over time. 

Another caveat of this study involves the generalizability of Add Health sample results to 
adolescents not enrolled in school. Add Health is a school-based study with respondents 
selected from rosters of participating schools. Adolescents not enrolled in school were not 
eligible to participate. It may be possible that youth in the excluded population have increased 
rates of risk behavior or chronic/disabling medical conditions. Data on adolescent violent 
behavior were obtained using a self-report methodology, creating a potential for bias. To 
minimize the potential influence of parent or interviewer, responses to sensitive issues such as 
those related to violent behaviors were recorded on laptop computers. With audio computer- 
assisted self-interview, adolescents listened to prerecorded questions and entered responses 
directly into a computer. The increased confidentiality promotes more complete and accurate 
reporting of violent behavior. To this end, evidence finds that such measures of adolescent 
health and behavioral risk outcomes are generally reliable and valid (e.g., Brener et al. 1995). 
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Finally, three of the explanatory indices (parental monitoring, school affiliation and 
neighborhood involvement) are below what Nunnally (1978) indicated to be an acceptable 
reliability coefficient (.70). In practice, however, lower thresholds are often used in the 
literature. The items used for the composite measures form a theoretical definition of the 
index that reflects the content of a multi-domain social capital model. Nevertheless, some 
caution should be taken when interpreting the findings. 


Conclusions 


A social capital model was chosen to frame this study because it related social positions or roles 
with social relationships by arguing that both can predictably influence behavior. By 
discriminating between ascribed, attained and social capital factors, this research was able to 
identify the most significant factors associated with reduced frequency of violent behaviors for 
study adolescents. This information not only improves models for adolescent interpersonal 
violence but also moves conceptualization and measurement forward. At the same time, it 
brings to light contradictions in common assumptions and expectations about the effects of 
social capital on adolescent violent behaviors. 

The results of this study show that school- and community-based conflict resolution 
programs or classes that emphasize the importance of diffusing or evading physically 
threatening situations need to be established. Collaborative efforts between families, schools, 
neighborhoods and their institutional affiliates (e.g., churches) can mutually reinforce each 
other by discouraging violent means of self-expression. Also, integrating adolescents and their 
families into the neighborhood are crucial for creating extra-familial ties. Evidence suggests this 
may reduce social and physical isolation and create a sense of being part of a cohesive group 
with collective efficacy (MacMillan and Chavis 1986). This calls for the design or reclamation of 
public spaces that are free from threats of crime or violence. The creation or restoration of safe 
sidewalks, parks and other public areas will enable adolescents and adults in neighborhoods 
to establish or reestablish social connections and ties with each other, creating new 
opportunities for the benefits of social capital as both an individual and group good. 
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Power and Dependence in Intimate Exchange 


Arnout van de Rijt, Cornell University 
Michael W. Macy, Cornell University 





Abstract 

A division of labor is mediated by exchange of valued goods and services. We use social 
exchange theory to extend this principal to “labors of love.” Sexual activity in a close 
personal relationship seems outside the domain of bargaining and exchange. Nevertheless, 
we explore the possibility that this most intimate of human relations is influenced by 
exchange mechanisms. We derive exchange-theoretic predictions about the level of sexual 
effort and test these using U.S. survey data on sexual behavior. Results provide modest 
support for the predictions. Sexual favors are reciprocated, and individuals offer greater 
sexual gratification to partners who are themselves more sexually generous and less 
emotionally attached. Evidence is inconclusive for the effects of relative income, physical 
attractiveness and household chores. 


The Theory of Social Exchange 


The social division of labor is mediated by exchange, the terms of which reflect relative power. 
Social exchange theory (Blau 1964; Emerson 1976; Homans 1958) models power in an on- 
going relationship as a bargaining process that reflects the relative dependence of the 
partners. The theory has received widespread empirical support in several domains, including 
marriage. Although there are numerous exchange-theoretic studies of marital relations, 
researchers have been understandably reluctant to extend the theory to intimate relations that 
seem outside the sphere of bargaining and exchange. As Blau (1964:88) warned, “People do 
things for fear of other men [sic] or for fear of God or for fear of their conscience, and nothing 
is gained by trying to force such action into a conceptual framework of exchange.” Yet Blau 
did not regard intimate relations as outside this framework. “Social exchange can be 
observed everywhere once we are sensitized by this conception to it, not only in market 
relations but also in friendship and even in love ...” (1964:88) We explore the possibility that 
this most intimate of human relations is governed by the same exchange mechanisms 
identified by Blau and elaborated by Emerson. | 

We begin by briefly outlining the theory of social exchange and reviewing applications of the 
theory. to marital relations. We then propose an exchange-theoretic model of sexual labor, derive 
predictions about exchange outcomes, and test these hypotheses using survey data. The model 
predicts greater sexual effort toward partners who are themselves more sexually generous, and 
who are more desirable, less dependent and who contribute more to the household. 

Social exchange differs from economic exchange in three important ways. First, the 
articles of exchange are not commodities but gifts. No money is involved, nor credit, nor 
contract. Giving a gift could be seen as a “selfish act of generosity” in that it creates in the 


We thank Stephen W. Benard, Douglas D. Heckathorn, Edward O. Laumann and Sarah Thebaud for 
helpful comments and suggestions on earlier drafts. We also express our appreciation to the National 
Science Foundation (SES 0241657 and SES 0432917) for support provided to the second author. 
Direct correspondence to Arnout van de Rijt, Department of Sociology, Cornell University, 323 Uris 
Hall, Ithaca, NY 14853. E-mail: av85@cornell.edu. 


© The University of North Carolina Press Social Forces, Volume 84, Number 3, March 2006 


1456 © Social Forces Volume 84, Number 3 * March 2006 


recipient the need to reciprocate with something that is desired by the giver. Both parties to 
the exchange “are prone to supply more of their own services to provide incentives for the 
other to increase his supply.” (Blau 1964:89) Simply put, in social exchange theory, a gift is not 
an expression of altruism, it is a way to exercise power over another. 

Second, the terms of exchange are unspecified.’ One side offers something the other 
values without knowing how or when the partner will return the favor. 

Third, the exchange is not instrumentally calculated. Without a quid pro quo and in the 
absence of explicit bargaining, one cannot know if the gift is optimal in a given transaction. 
Instead, optimization takes place on the fly through incremental adjustments to behavior in 
response to experience. These need not be conscious adjustments but could be experienced 
merely as feelings of satisfaction or dissatisfaction with the relationship, such that the terms of 
exchange emerge as a byproduct of a learning process. Each partner evaluates the outcomes 
from the exchange relative to a comparison level corresponding to what the actor expects to 
receive from his or her best alternative relationship. When the value falls below this standard, the 
individual is dissatisfied and seeks alternative partners whose offers are perceived as superior 
(Thibaut and Kelley 1959). On the other hand, “if the outcomes they are receiving from their 
current relationship are better than what they expect to receive from their best alternative(s), 
they will feel dependent on the relationship and become committed to it.” (Sprecher 1998:34) 

These differences with economic exchange make SET applicable to emotionally-charged 
behaviors where instrumental manipulation of the partner would ruin the experience for both. 
Social exchanges can be experienced as acts of generosity toward those we trust. Trust is 
necessary because of the unspecified terms of exchange. It increases when the generosity is 
satisfactorily reciprocated and decreases when it is not (Blau 1964:93). 

Although social exchange lacks explicit terms of trade, enforceable contracts or a 
monetary medium, it nevertheless follows a basic principle of economic bargaining over the 
price of commodities, the “principle of least interest.” (Waller 1937; Waller and Hill 1951) This 
principle predicts that the lower the dependence on the relationship, the greater will be the 
power. “The party who is receiving the least comparative benefit from a trade has the greater 
bargaining power to improve upon that trade. If that power is used, as we assume above, 
then the terms of the trade will shift until power is balanced.” (Cook and Emerson 1978:724) 


From Social to Intimate Exchange 


SET applies to the balancing of power in exchanges between workers, neighbors, friends and 
business associates. It has also been applied to exchanges between intimate partners. 
“Whether it is two lovers who share a warm and mutual affection or two corporations who 
pool resources to generate a new product, the basic form of interaction remains the same,” 
note Lawler and Thye (1999:217). “Two or more actors, each of whom has something of value 
to the other, decide whether to exchange and in what amounts.” 

There is a large and growing research literature that has found consistent support for SET 
predictions for exchange outcomes in intimate relations. Sprecher (1998:32), one of the key 
contributors to this field, surveys applications of SET to research on “mate selection, 
relationship formation, and the prediction of relationship dissolution.” A central hypothesis is 
the matching hypothesis (Hatfield et al. 1985; Sprecher and Hatfield 1996: Walster et al. 
1966), which states that the more socially desirable a person is, the more socially desirable 
will (s)he expect a partner to be (Sprecher 1998:35). 

In the realm of mate selection, South (1991:928) summarizes a number of studies that 
demonstrate strong tendencies toward homogamy — the tendency to marry persons close in 
status (Kalmijn 1998) — based on age (Atkinson and Glass 1985), race and ethnicity (Labov and 


Power and Dependence in Intimate Exchange * 1457 


Jacobs 1986; Tucker and Mitchell-Kerman 1990), religion (Glenn 1982), education (Rockwell 
1976; Schoen and Wooldredge 1989), and occupational status (Morgan 1981). Homophily 
(Buss 1985; Byrne 1971; Byrne and Nelson 1965; Hoffman and Maier 1966; Rubin 1973) - the 
principle that “likes attract,” also known as the “law of attraction” — provides a straightforward 
explanation. A related explanation focuses on the greater opportunities for interaction with 
those who are less culturally or socially distant (Blau and Schwartz 1984; Kalmijn and Flap 
2001). The matching hypothesis suggests an alternative possibility. Homogamy may reflect 
not a taste for similarity but rather constraints on the ability to attract a partner who has more 
valued resources. From an exchange-theoretic perspective, romantic relationships are formed 
through an assortative matching process in which women and men look for the best “catch” 
(Shibazaki and Brennan 1998). 

However, the matching hypothesis does not require a like-kind exchange. Becker's (1973, 
1981) specialization hypothesis, that it is difference rather than similarity in human capital that 
makes marriage worthwhile, is perfectly compatible with SET. lwo complementary resources 
are exchanged. Using data from the National Survey of Families and Households, South 
confirms the popular impression that “men place higher value than do women on physical 
attractiveness and youth, while women are less willing than men to marry someone with low 
earnings or unstable employment.” (South 1991:928) Sprecher (1998:928) also cites several 
other studies that indicate that “women’s physical attractiveness is often exchanged for a 
man's wealth or social standing (e.g., Elder 1969; Stevens, Owens and Schaefer 1990; Taylor 
and Glenn 1976; Udry 1977; Udry and Eckland 1984).” However, as women’s socio-economic 
standing has improved during the second half of the 20" century, SET predicts more 
symmetrical exchanges in current relationships. This trend toward greater symmetry has 
been noted in a number of studies, both theoretically (Bergstrom and Bagnoli 1993; 
Oppenheimer 1988; Schoeni 1996) and empirically (Blackwell 1988; Kalmijn 1991; Mare 
1991; Sweeney and Cancian 2004). 

Although exchange theorists have focused mainly on mate selection, recently attention 
has shifted to the problem of marital stability, satisfaction and duration (McDonald 1981). 
Equity theory (Walster [Hatfield] et al. 1978), a variant of SET, suggests the hypothesis that the 
more equitable the exchange of resources in a relationship, the more stable, satisfying and 
durable the relationship will be. Various studies document support for the equity hypothesis 
(Walster [Hatfield], Walster and Traupman 1978; Schaefer and Keith 1980; Traupman et al. 
1981: Cate et al. 1982; Davidson 1984; Michaels, Edwards and Acock 1984). While Sprecher 
finds the predicted effects of equity in three separate studies (1986, 1988, 1992), she notes 
(1998:34) that the theory overlooks the importance of the comparison level. Comparisons 
with prospective others are particularly important among young, relatively uncommitted 
partners. The rewards from the exchange are compared not only to those given to the partner, 
but also to those that one might receive in an alternative relationship. Relationships are 
predicted to be stable, satisfactory and durable if rewards in the current relationship are above 
the ones that one would expect to get in another relationship. According to Sprecher 
(1998:35), “most researchers examining the degree to which equity forecasts relationship 
stability have not found it to be a good predictor... On the other hand, absolute reward level, 
investments, and poor alternatives have been good predictors of relationship longevity.” (e.g., 
Felmlee et al. 1990; Rusbult 1983) Sprecher (2001) reproduces this result for dating couples. 
Effects of rewards, investments and alternatives on satisfaction and commitment were in the 
expected direction and generally significant, while results for the effects of inequity were less 
convincing (Sprecher 2001:606). 

SET has also been applied to the exercise of power in the family. The principle of least 
interest predicts a positive effect of relative socioeconomic position on conjugal power in 
decision-making (Heer 1963; Rank 1982). For example, Sprecher and Felmlee (1997) report 
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that women with high-paying jobs are less dependent on their husbands and thus have more 
power in marital exchange than do women without such jobs. Similarly, “being the less 
emotionally involved partner in the relationship was associated with greater power.” (Sprecher 
and Felmlee 1997:361) 

Sprecher (1998) overviews evidence for the operation of exchange mechanisms at three 
stages of the sexual relationship: the negotiation of the onset of sex, evaluation of sexual 
satisfaction and decisions about extra-dyadic sex. Walster [Hatfield], Walster and Traupmann 
(1978) found that undergraduates who had relationships that were perceived as more 
equitable were more likely to engage in premarital sex. Lawrance and Byers (1992) note that 
a model of sexual satisfaction “needs to take into account the interpersonal context in which 
sexual activity occurs... exchange theories offer such an approach.” (p. 268) They found 
sexual satisfaction to be independently related to both absolute and comparative reward and 
cost levels and equality in reward. 

Sprecher also reports support for SET in studies of extra-dyadic sex (1998: 38). She 
speculates that “extra-dyadic behavior may be one way in which a partner in an inequitable 
relationship restores equity to his or her relationship.” Both Hatfield et al. (1979) and Prins, 
Buunk and Van Yperen (1993) find a positive relationship between inequity and desired and 
actual extra-dyadic involvement, the latter only for women. Rusbult, Drigotas and Verette 
(1993) find that a constructive response to extra-dyadic sex (i.e., trying to save the 
relationship) is more likely in the case of greater satisfaction, commitment, investments and 
poorer alternatives. 


Intimate Exchange ‘ 


Although support has been found for exchange theory hypotheses about the formation, 
duration and satisfaction of sexual relationships, one area of intimate exchange that has been 
largely ignored by SET is the exercise of power in the bedroom. The important exception is 
Sprecher, who believes “this framework is useful for understanding... which partner has more 
influence on what sexual activities they do together.” (1998:32) And “because most dyadic sex 
occurs within the context of an emotional relationship in which partners are interdependent at 
many levels,” she continues, “an exchange approach is particularly applicable to the study of 
sexual phenomena in close relationships." (1998:40, emphasis in original) 

SET predicts that the exchange of sexual favors will be direct reciprocity or “for other 
resources in the relationship such as intimacy, love, favors and money.” (Sprecher 1998:32) 
Based on the least-interest principle, “the partner who is more reluctant to have sex has a 
valued resource — his or her consent to sexual activity. At some point he or she may agree to 
have sex, and although there may never be any discussion of a ‘trade’ or an ‘exchange,’ one 
or both partners may treat the act as having exchange value. Gifts or special favors may be 
presented by the person who wants sex more, either before sex occurs (to create a sense of 
obligation) or after sex begins (as a form of reciprocation).” (Sprecher 1998:36) Sprecher 
offers an example of a working-class woman interviewed by Lillian Rubin, who appreciated 
the exchange value of oral sex: “He gets different treats at different times, depending on what 
he deserves. Sometimes | let him do that oral stuff you're talking about to me. Sometimes 
when he's very good, | do it to him.” (Rubin 1976:207) 

More precisely, SET predicts that, all else being equal, sexual effort (defined as the acts 
aimed at the physical gratification of one’s partner) increases with the gratification received 
from the partner, and with the value of other resources that one’s partner brings to the 
relationship, including contributions to household tasks. Conversely, effort decreases with the 
value of other resources that one brings to the relationship. 
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In addition, SET also predicts that sexual effort varies with relative dependence on the 
partner as a source of valued resources. The greater the relative attraction and attachment 
to the partner, the greater the partner's power, and thus the greater the sexual effort that must 
be invested in the partner in order to balance the partner's power. 


Measures and Methods 


We tested the SET predictions using data from the Chicago Health and Social Life Survey 
(CHSLS), a retrospective survey (NV = 2,114) containing detailed questions about past and 
Current sexual relationships and with unique information on sexual activities. The CHSLS was 
conducted between 1995 and 1997 using a stratified random sample of the non- 
institutionalized Chicago population, composed of five sub-samples on two geographical 
levels. The greater Chicago sub-sample consists of 890 respondents from Cook County, 
including the inner suburban ring. Another 1,224 respondents were sampled from four 
ethnically-targeted neighborhoods within the city of Chicago, each characterized by the 
dominance of a particular ethnic group. Response rates were 71 percent for the city-level sub- 
sample and ranged from 60 percent to 78 percent for the neighborhood sub-samples. We 
tested for neighborhood-level heterogeneity by estimating the model separately for each 
neighborhood but found no significant neighborhood differences. Therefore we report only 
the results for the pooled sample. 

The survey was executed using Computer Assisted Personal Interview (CAPI) techniques, 
with responses entered directly into lantop computers. Respondents were surveyed in 
person by experienced interviewers from the National Opinion Research Center (NORC), who 
matched respondents on various social attributes such as race and ethnicity, for an interview 
averaging 90 minutes. Spanish-speaking interviewers were employed so that recent migrants 
lacking facility in English could be included. 

Respondents answered a series of questions concerning their most recent sexual 
relationships. Table 1 displays the descriptive statistics of the observed variables that we used 
in the empirical analysis. Sexual effort refers to the willingness to confer physical and sexual 
pleasure.? Respondents were asked to indicate on a scale of 1 (always) to 5 (never) their own 
and their partner's participation in six sexual activities: active & passive chest/breast 
stimulation, active & passive manual sex, and active & passive oral sex. The three responses 
for respondent F's participation were standardized and added as a composite measure of F's 
sexual effort (a = .69). A’s three responses for partner P’s participation were standardized and 
added.as a composite measure of P’s sexual effort (a = .72).' 

We also included a measure of A’s desire to please P, and A's desire for P to please A. 
Unfortunately, the survey included only a single activity, oral sex, using two items that asked 
Ff to indicate on a scale of 1 to 4 how appealing they found “performing oral sex on your 
partner” and “your partner performing oral sex on you.” 

Attractiveness was measured by responses to the question: “How would you rate your 
partner in terms of attractiveness compared to other men/women in his/her age group (during 
the time you were involved with him/her),” on a scale of 1 (very unattractive) to 7 (very 
attractive). A similar question was asked about A's own attractiveness. 

Emotional attachment was measured by responses to two related questions: “On a scale 
of 1 (not at all) to 5 (deeply), how in love is your partner with you?” and a similar question 
about A’s love for her or his P. 

We also use measures for the total personal income of last year for both R and PR 
Respondents picked one out of a series of income categories. We constructed two income 
variables, transforming the mean income value of each category by taking the natural 
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Table 1: Mean, Standard Deviation, Minimum, Maximum and Number of Cases for Each 
Observed Variable 


EE 


Variable Mean St.Dev. Minimum Maximum N 


Dependent 
R’s sexual effort? 597 Sea 0 1 1,491 
Breast/chest stimulation 2.973 1.183 0 4 1,491 
Digital sex 2nee 1.206 0 4 1,491 
Oral sex 1.466 1275 0 4 1,491 
Independent 
Male 419 493 0 1 1,491 
Race / ethnicity 
African American 293 455 0 1 1,482 
Hispanic .290 454 0 1 1,482 
White Oth .465 0 1 1,482 
Other .098 .298 0 1 1,482 
R’s age 35.299 10.270 18 60 = 1,491 
P’s age? 35.703 10.799 18 72 ~—- 1,490 
R’s education 1.973 1.751 0 6 1,491 
P’s education 2.026 1.755 0 6 1,481 
R’s church attendance 4.064 2.101 1 8 1,491 
R’s attraction to P 5.801 1.261 1 r 1,489 
P’s attraction to R 5.144 1.290 1 il 1,478 
R’s attachment to P 4.232 1.088 1 5 1,490 
P’s attachment to R 4.339 1.011 1 5 1,477 
R’s income 25,803 38,284 2,500 400,000 1,453 
P’s income 26,181 36,689 0 400,000 1,367 
P’s sexual effort .607 .242 0 1 1,491 
Breast/ chest stimulation 3.029 1.114 0 4 1,491 
Digital sex 2.704 1.221 0 4 1,491 
Oral sex 1.07 1.287 0 4 1,491 
R desires active P 2.975 1.131 1 4 1,473 
R enjoys own effort 2.726 1.081 1 4 1,472 


logarithm. Note that the difference in standard deviation between the two variables in table 
1 is a result of a difference in categorization in the questionnaire. 

Respondents were also asked to indicate the weekly hours spent by R and P on seven 
different household tasks: preparing meals, washing dishes, cleaning the house, outdoor 
tasks, shopping, washing and ironing, and paying bills. Participation in household tasks was 
asked only if R and P cohabited at the time of the interview. 

We included standard demographic controls for age, gender, ethnicity and education 
(highest degree). Three ethnic compositions of relationships dominate: both Hispanic (431), 
both white (470), and both African American (436). We created four race/ethnicity dummy 


Power and Dependence in Intimate Exchange + 1461 


variables, one for each of the above and one ‘other’ category (146), which contains mixed race 
couples and very small minorities such as Asian and Native American couples. We also 
included a measure of church attendance because religious norms are likely to influence 
sexual behavior. 

This information was available for the two most recent heterosexual’ relationships that were 
current at the time of the interview (VV = 1,491). Our analysis is limited to these observations. 
The two ‘most recent’ relationships are those for which the moments of first sexual contact are 
most recent. A relationship is considered ‘current’ on the basis of the answer to the question, 
‘Are you still sexually involved with your partner, that is, do you expect to have sex with your 
partner again?” One hundred fifty-five respondents answered this question affirmatively for 
both their most recent and second most recent sexual relationship. For these respondents, we 
have two observations.® Our estimates are based on the assumption of independence 
between respondents, but interdependence between multiple observations for a single 
respondent (Huber 1967). We thus obtained conservative standard errors. 


Measurement Validity 


Following the publication of “The Social Organization of Sexuality” (Laumann et al. 1994), 
based on responses to a survey very similar to the one used in our study, critics scoffed at the 
ability to study sexual behavior using survey responses (Lewontin 1995). We acknowledge 
that survey respondents, at best, “tell us the truth as they see it.” (John H. Gagnon in Boxer 
2000) For example, Table 1 shows that the people of Chicago, like the children of Lake 
Wobegon, are “all above average” in attractiveness, indicating that respondents have inflated 
assessments of themselves and their partners.’ We also recognize the need to confirm 
survey-based results with in-depth interviews, ethnographic case studies and laboratory 
experiments that provide measures whose strengths and weaknesses complement those of 
the survey methods. Nevertheless, we believe much can be learned from survey responses, 
despite the discrepancies that undoubtedly exist between what respondents report and what 
they experience. 

We base this view on four considerations. First, the theory we are testing is about the 
effects of subjective beliefs, not objective conditions. If R’s assessments of A’s and P's 
attractiveness differs systematically from the consensus of third parties, it is not a problem as 
long as those inflated reports correspond to what A actually believes. Second, even if 
respondents intentionally and consciously misrepresent what they believe, these distortions 
do not necessarily invalidate the results. For example, misrepresentation of attractiveness, 
sexual effort or emotional attachment does not bias the results if the errors are uncorrelated 
with the dependent measure, and moreover, the inflated scores impose a Ceiling effect that 
actually makes our estimates conservative. Third, because of the sensitive nature of some of 
the questions (Laumann et al. 2004:51), special care was taken to ensure response validity, 
including self-administration of some items and the use of racially matched interviewers. Less 
than 1 percent of respondents refused to answer the questions about intimate behavior. 
Fourth, the responses in this survey were in line with those of other surveys (Hamilton 2003). 


Results 


We present the results in three parts. We begin with the effects of the demographic controls. 
We then address the two principal SET predictors, direct reciprocity and dependence. Finally, 
we consider secondary SET predictors (e.g., non-sexual contributions). 
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Table 2 reports unstandardized multivariate regression coefficients for three models of R's 
sexual effort.2 Models 1 and 2 are nested. Model 1 includes only demographic measures and 
factors that are not relevant to SET. Model 2 then adds the SET predictors, except for 
household effort (which was only asked of cohabiting couples). Both models are estimated 
for the full sample. Model 3 adds household effort for a sub-sample of cohabiting couples. 


Table 2: Three Regression Models of Respondent's Sexual Effort 


aS 


Predictor Model 1 Model 2 Model 3 

B S.E. B 5.E, B S.E. 
Male .0699*** .0128 .0818*** .0103 .0856*** + .0140 
African-American -.0819*** .0167 -.0160 .0120 0006 = .0141 
Hispanic 1213") 20181 -.0225"  .0134 -.0302*  .0151 
Other race / ethnicity 0097 ~=—- .0178 0032 ~=.0132 -0047 0174 
R’s age’ -.0011 .0011 0010  .0008 0003 ~=.0010 
P’s age’ -.0014  .0011 -0010  .0008 -.0009 .0010 
R’s education .0178*** = .0045 .0031 .0034 0066 =. .0042 
P’s education .0087*  .0042 0039 ~=—- .0030 -.0001 .0039 
R’s church attendance -.0075*  .0030 -.0017 0022 -.0016 .0024 
P’s sexual effort .5691*** .0225 5765** .0261 
R’s attraction to P 0062 ~§=.0039 0085" .0045 
P’s attraction to R -.0050 .0038 -.0062  .0041 
R’s attachment to P .0234** .0069 0167 0109 
P’s attachment to R -.0206** .0073 -.0145  .0114 
R’s income (log) 0050 §=.0049 0046 = .0057 
P's income (log) .0046** .0014 .0043*  .0017 
R enjoys own effort .0562*** .0070 .0554***  .0086 
R desires active P -.0023 0071 -.0092 .0084 
R's household effort’ -.0003  .0002 
P’s household effort 0003 =©.0002 
Constant .6957*** 0294 0346 = .0529 0070 =: .0698 
R’ 18 61 63 
N 1491 1491 946 


Note: (Models 1 and 2 are nested and run on the full sample. Model 1 includes only 
demographic variables and factors that are not relevant to SET. Model 2 adds the SET variables 
except household effort. In model 3, measures of household effort are added for a sub-sample o 
cohabiting couples.) 
*R = Respondent 

P = Partner 
“ Currently cohabiting couples only 
Ap<.10 *p<.05 “*p<.01 ***p <.001 (based on a two-tailed test) 
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Demographic Differences in Sexual Behavior 


Model 1 documents demographic differences in sexual behavior, based primarily on gender, 
ethnicity and education. All else being equal, male respondents report a .07 higher level of 
sexual effort than females (on a scale of 0, no effort, to 1, maximum effort), while African- 
Americans (-.08) and Hispanics (-.12) report less sexual effort than do white couples. (See also 
Laumann et al. 1994.) Sexual effort also increases with R's education (.02) and decreases with 
religious attendance (-.01). Age negatively affects sexual effort (James 1974; Jasso 1985; 
Kinsey et al. 1948, 1953; Udry 1979; Udry and Morris 1978; Udry et al. 1982). Due to mild 
collinearity, the effect of R's age becomes significant when P's age is omitted as predictor. 

Except for gender, these demographic differences largely disappear after we take into 
account the determinants of power as hypothesized by SET. The inability of SET to explain 
gender differences should not be surprising, given the existence of cultural norms that define 
gender-appropriate sexual behavior (West and Zimmerman 1987; Ridgeway and Correll 2004). 
Some of these differences are related to differences in the bargaining power men and 
women derive from the resources they bring to the relationship. 


Core SET Predictions: Reciprocity and Dependence 


The core prediction in SET is direct reciprocity, in which effort is rewarded with effort. The results 
suggest that this principle extends to intimate exchange. The single most important predictor of 
sexual effort is the effort received (.57, p < .001). This means that a 1 point increase in effort by 
the partner produces a .57 increase in one's own effort.’° Despite the size and significance of the 
effect, the evidence is only suggestive. Part of this effect could be spurious for two reasons. 
First, respondents with inflated or deflated perceptions of the frequencies of P’s sexual activities 
may have similar biases in their self-assessments. Second, respondents may be reluctant to 
report imbalances in the frequencies of active and passive sexual activities. Future studies could 
avoid these potential biases by interviewing both partners separately. 

Along with A's desire for P's effort, we also included a measure of A’s enjoyment of A's 
own effort. Model 3 shows that A’s effort increases with A's intrinsic gratification and 
decreases with A’s lack of enjoyment (.06). This is consistent with the exchange-theoretic 
prediction that resistance to exchange increases with the cost of the expenditure. 

Along with direct reciprocity, a key prediction in SET is the effect of relative dependence 
on generosity toward the exchange partner. All else being equal, the greater one’s 
dependence on the partner for a valued resource, the more one is willing to exchange in order 
to obtain it. Results for Models 2 and 3 (in which the effects are somewhat weaker) provide 
some support for SET. A's attachment to P increases A's sexual effort in Model 2, while P's 
attachment to F has the opposite effect. The effects are smaller and statistically insignificant 
in Model 3, which is limited to a sub-sample of cohabitants. The reason for this difference is 
that virtually all cohabitants indicate that they love their partner a great deal, and that this love 
is reciprocated, leaving little variation in these two variables. 

The effects of attachment also appear to account for much of the effect of physical 
attraction on sexual effort. The effects of physical attraction are in the right direction, but are 
too small to rule out random fluctuation as the default explanation. The effects become 
significant when measures of attachment are excluded from the model, suggesting the 
possibility that at least some of the effect of physical attraction on effort may be mediated 
by increased willingness to commit to the relationship. 

Although reciprocity and dependence have the greatest weight in theories of social 
exchange, other valued resources can also indirectly affect exchange outcomes. In intimate 
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exchange applications, these other resources might include income (which has a long history 
of exchange for sexual gratification) and other contributions to the household, such as cooking, 
cleaning and yard work. Results reported in Table 2 show that these secondary factors do have 
some effect on sexual effort, albeit modest. The effect of P’s income is in the predicted 
direction and strongly significant, while the effect of A’s income is in the wrong direction and 
insignificant. Separate regression models for male and female respondents (not shown in Table 
2) reveal that men (o < .001) but not women respond to P's income, while women (o < .01) but 
not men respond to P’s degree. Neither men’s nor women’s effort increases with their own 
income or degree. These findings do not parallel the significant effects of income and 
education that have been found for the case of household effort as the dependent variable (cf. 
Bianchi et al. 2000). For the sub-sample of cohabiting couples, these models show a marginally 
significant (9 < .1) positive effect of women’s income on their sexual effort. 

We also found that the effects of household effort by respondent and partner were in the 
expected directions but the effects were not significant for the sub-sample of cohabiting 
couples. Decomposing the composite measure of household effort does, however, reveal a 
significant effect (o < .05) for preparing meals, while all other chores had insignificant effects. 
Apparently, the way to a lover’s heart is through his or her stomach." 


Discussion and Conclusion 


The division of labor is mediated by exchange of valued resources. This study extends social 
exchange theory to labors of love. Although this most intimate of human relations seems 
outside the domain of bargaining and exchange, we find evidence that the most important 
determinant of sexual effort is effort received, suggesting that intimate exchange Is governed 
by principles of reciprocity similar to those that have been widely observed in kinship, 
marriage, friendship and work relations. 

Social exchange theory also emphasizes the importance of dependence as a determinant of 
exchange outcomes, and that prediction is also partially supported by this data along with non- 
sexual contributions to the relationship. Regardless of gender, respondents report greater sexual 
effort if they are more deeply loved and cook. Evidence regarding the effects of income, physical 
attractiveness, and household work other than cooking, however, is inconclusive. 

The positive effect of women’s income on their sexual effort constitutes the only direct 
violation of SET predictions. This effect may be due to self-selection in which women with 
more assertive attitudes are more successful in their careers and more active in their intimate 
relations. An alternative explanation is masculine overcompensation in reaction to an identity 
threat (Brines 1994; Bittman et al. 2003; Munsch and Willer 2005; Willer 2005). 

The support for exchange theoretic predictions should not lead us to downplay the importance 
of emotional determinants of intimate sexual behavior, just as one would not want to deny the role 
of emotions in other relations such as work teams. As Thibaut and Kelley (1959) showed, there is 
no need to assume coldly calculating behavior (although there is also no reason to preclude it). 
Exchange outcomes can also be obtained through a learning process that converges with the 
balance point predicted for actors whose choices are strictly governed by instrumental calculation. 

Another important determinant of the division of labor in the bedroom is gender. While 
SET is gender-neutral, some of our effects vary by the gender of the partner. Resources that 
women and men bring to the relationship may be differently valued by women and men, and 
gender norms of sexuality (cf. Sanchez, Crocker and Boike 2005) may keep exchange partners . 
from obtaining the full sexual rewards for their resources. 

Although the results we report are consistent with SET and with intuitions about the non- 
instrumentality of sexual behavior, we nevertheless urge caution in generalizing from this 
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single study. The more private the behavior, the greater the difficulty in making accurate 
measurements. New surveys need to be conducted, with items constructed to better gauge 
the full range of manifestations of sexual effort. We also need to replicate these findings using 
different data collection methods. For example, the quote from Rubin's (1976:207) qualitative 
study strongly suggests an exchange-theory interpretation of our quantitative findings rather 
than an interpretation informed by another theoretical framework. While we caution against 
reading too much into these results, we do hope this study will encourage other researchers 
to extend social exchange theory from non-sexual marital relations, where it has been widely 
employed for several decades, to the bedroom, which has remained off limits. 


Notes 


1. “Social exchange differs in important ways from strictly economic exchange. The basic 
and most crucial distinction is that social exchange entails unspecified obligations.” (Blau 
1964:91-2) 


2. We did not use data from the National Health and Social Life Survey (NHSLS), the 
predecessor of the CHSLS, because it does not include questions on sexual effort. 


3. Respondents privately answered written questions on the frequencies of sexual practices 
for themselves; no interviewer saw their answers and respondents were guaranteed the 
information they provided could not be traced back to them. Less than 1 percent of the 
respondents refused to answer these questions. 


4. Principle Component Analysis reveals a single dimension with all measures scoring 
positively, accounting for 65 percent of the variance. This supports the validity of our 
composite measure. 


5. Although social exchange theory applies equally to homosexual and heterosexual 
relationships, there were too few cases (/V = 92) to establish statistical significance for the 
predicted effects, but the signs of the effects were identical to those for heterosexual 
relationships. 


6. Results are not qualitatively different if these 310 relationships are omitted. 


7. We checked self-reports of attractiveness against an interviewer-reported measure. 
Although the correlation was statistically significant (0 < .001), the relationship was not 
strong (r = .108), suggesting that beauty is indeed in the eye of the beholder, especially 
when looking in the mirror. The interviewer-reported measure was not available for partner 
attractiveness. 


8. Byers, Demmons, & Lawrance (1998) use SET to predict sexual satisfaction within dating 
relationships as a function of many of the same factors we use to predict sexual effort. 
We therefore estimated parallel models with sexual satisfaction, rather than sexual effort, 
as the dependent variable. Results were similar to those reported in Table 2. 


9. The dependent variable is ordinal and has a lower and upper bound. We therefore 
replicated the model using ordered logistic regression and linear regression with a 
transformed dependent variable. Results were robust across these models. 
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10. Part of this effect could be spurious, due to reporting bias. First, respondents with inflated 
(or deflated) perceptions of the frequencies of P’s sexual activities may have similar biases 
in their self-assessments. Second, respondents may be reluctant to report imbalances in 
the frequencies of active and passive sexual activities. Future research could address 
these limitations by interviewing couples, to allow an independent measure of the 
partner’s behavior. 


11.That cooking is more closely linked to intimate exchange than other household tasks may 
not be too surprising, given the similarities of physical and emotional gratification involved 
in both activities, compared to, for example, raking the leaves or paying bills. This is 
consistent with Blau's (1964) distinction between social and economic exchange and his 
belief that the articles of social exchange are not commodities but g/fts, with symbolic 
meaning to the exchange partners. 
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Abstract | 

The study of group processes has benefited from longstanding programs of theory-driven 
research on status and power. The present work constructs a bridge between two formal 
theories of status and power: Status Characteristics Theory and Network Exchange Theory. 
Two theoretical models, one for “status value” and one for “status influence,” illuminate the 
underlying mechanisms whereby status differences between individuals lead to power 
differences in negotiated exchange. The two models generate precise hypotheses that are 
tested against new and previously published data. The results generally support both 
models, indicating that status characteristics create power in negotiated exchange relations 
in contrasting ways. We conclude with a discussion of the broader theoretical and 
empirical implications. 


The relationship between social status and the allocation of societal benefits is a major concern in 
our discipline. In this paper we develop new theory that bridges two research programs dealing 
with allocation processes and status: Network Exchange Theory (NET) and Status Characteristics 
Theory (SCT). Over the past 20 years NET and SCT have developed in parallel, with rare instances 
of cross-fertilization (Lovaglia 1994, 1995; Lovaglia, R. Willer and Troyer 2003; Thye 1999, 2000; 
Willer, Lovaglia and Markovsky 1997). NET is a general theory that seeks to explain the distribution 
patterns of valued goods and the location of structural power in networks of exchange. It is 
premised on the notion that the shape of the network and its rules of exchange predispose some 
actors to exercise power over others (Walker et al. 2000; Willer 1999). Power exercise, in this 
tradition, occurs when one actor receives favorable resource allocations at the expense of others. 
In contrast, SCT addresses the causes and effects of status ascriptions in groups (Berger et al. 
1974; Berger et al. 1977; Wagner and Berger 2002). A key idea in this theory is that culturally-valued 
status characteristics such as age, education and gender create performance expectations in 
collectively-oriented groups (Troyer and Younts 1997; Webster 2003). The theory assumes that 
interactions are organized by actors’ performance expectations. Those with the higher state of the 
characteristic are granted higher status and prestige than those with the lower state of the 
characteristic. Both theories address inequality: SCT explains prestige hierarchies based on 
culturally-valued status characteristics (Ridgeway 1991, 2000); NET explains resource hierarchies 
on the basis of network structures and exchange conditions. 
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In this article we explain how status ascriptions affect power. To illustrate, imagine that you 
are about to sell a used car to a prospective buyer. Upon meeting the buyer, you find an attractive, 
middle-aged, white man who is well dressed and polite. Before he inspects the car, you learn that 
he is an accomplished negotiator for a “Big Three” auto maker. Will his personal traits (race, age, 
gender, occupation) affect your perceptions of him? Would you negotiate differently with him 
than with someone less skilled and less notable? Field research suggests that you would. Ayres 
(1991) sent white men and African-American women to buy cars from 90 different dealerships 
in the Chicago area. When the buyer was a white male, the dealer's initial price benefited the 
dealer by $818. However, when the buyer was an African-American female, that benefit nearly 
tripled to $2,169. Relatedly, economists find that individuals are more generous to higher status 
subjects when allocating profit in ultimatum games (Ball and Eckel 1996). The overall implication 
is that status ascriptions have potent effects on negotiation and exchange. 

To explain this phenomenon we propose two mechanisms whereby status produces 
power. First, an extension of Thye’s (1999, 2000) Status Value Theory of Power lets us generate 
precise, ratio-level predictions from status characteristics to power exercise. Specifically, 
NET’s model for calculating negotiation outcomes (Willer 1981a, 1987) is modified with a 
factor from SCT that captures status differences. Second, we propose a new status influence 
theory which asserts that high status actors are more competent and influential in 
establishing the initial conditions for exchange. This new theory also predicts that higher 
status actors exert power over those of lower status. After explicating these mechanisms we 
report data from a new experimental test. We close by considering how the scope conditions 
of the two theories bear on our models, and suggesting several avenues for future research. 


Background 


Status Characteristics Theory 


SCT emerged as part of a larger effort to understand the emergence of influence and prestige 
gradients in small groups. (See Wagner and Berger [2002] for an overview.) Early researchers 
found that stable interactional differences emerged quickly among initially homogenous actors 
working together on a group task (Bales 1950; Bales et al. 1951). These early studies led to 
the development of Expectation States Theory which introduced the concept of a performance 
expectation, i.e., a belief about an actor's future task performance. Performance expectations 
are not necessarily conscious. Rather, they are “unaware hunches about whose suggestions 
are likely to be better.” (Ridgeway and Walker 1995:288) The theory claims that, through social 
interaction, individuals develop performance expectations for themselves and for others 
regarding task competency. Because performance expectations are formed on the basis of 
observable acts, these beliefs are generally consensual across members of a group. Those 
perceived to be more competent are higher in the expectation hierarchy; those perceived to be 
less competent are lower. In turn, performance expectations are postulated to shape 
differentiation along an observable power and prestige order. Later elaborations of expectation 
states theory give critical insights into the traits and behaviors that yield expectations and 
status in groups (Balkwell 1991; Berger, Cohen and Zelditch 1972; Berger, Conner and Fisek 
1974; Berger, Wagner and Zelditch 1985; Ridgeway, Berger and Smith 1985). 

SCT, a branch of the larger expectation states program, connects culturally specified . 
beliefs to performance expectations. The theory defines two kinds of status characteristics. A 
diffuse status characteristic (DSC) such as gender exists in a particular culture if and only if (1) 
one state (e.g., male) is more highly valued than the other state (female), (2) men are expected 
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to be more competent at specific tasks such as sports, and (3) men are generally expected to 
be more capable at a wide range of tasks. In contrast to a DSC, a specific status characteristic 
(SSC) satisfies conditions (1) and (2), but not (3). For example, math skill is a specific status 
characteristic when being good at math is preferable to being bad at math, and when one 
expects a math expert to be competent at other numerical tasks. 

The core of SCT is a set of five logically-connected assumptions that explain how status 
characteristics translate into power and prestige orders and associated behaviors (e.g., 
Webster and Foschi 1988): (1) A status characteristic becomes salient if it differentiates 
members of a task group or is directly related to the task at hand. (2) A salient status 
characteristic is relevant to the task unless explicitly disassociated from the task. (3) Status 
information is incorporated and/or maintained according to the principles described in 
assumptions one and two above as individuals enter or leave the group. (4) Relevant 
characteristics combine to form an aggregated performance expectation associated with 
each member. (5) Aggregated performance expectations give rise to observable differences 
in social interaction. Those with relatively higher performance expectations are predicted to 
receive more opportunities to perform, to perform more often, to be evaluated more 
positively, and to have greater influence over the group's decisions. Numerous tests and 
replications have supported the basic claims of the theory (Wagner and Berger 2002). 


Network Exchange Theory 


NET is a branch of the larger Elementary Theory program that builds on the classical 
understandings of structure found in Marx ([1867] 1967) and Weber ([1918] 1968). It 
incorporates seven structural power conditions that lead to power exercise. Here we consider 
one such power condition, exc/usion, which is most effective when one actor can, without 
cost, prevent another from exchanging. The simplest case is the A-B-C structure when B can 
profit from exchanges with A or C, but not both. The result is that, to their own detriment, A 
and C make offers increasingly favorable to B. That is, B exercises power over A and C. 


Resistance 


NET employs a set of mathematical models to predict the location of power positions (Girard 
and Borch 2002; Lovaglia et al. 1995a, 1995b; Markovsky et al. 1988, 1993; Simpson and 
Willer 1999). Resistance equations (Willer 1981b; Willer et al. 1989; Willer and Markovsky 
1993) are then applied to network relations to generate predictions for exchange outcomes 
(Willer and Anderson 1981; Willer et al. 1989; Willer and Markovsky 1993). It is within the 
context of resistance equations that we establish links between status and power. 

The resistance model consists of equations that use actors’ mixed motives in exchange to 
predict exchange outcomes. The basic idea is that, when considering any proposed payoff, actors 
balance their interest in gaining a better deal against their interest in avoiding “confrontation” or non- 
exchange. Let P; be the payoff proposed to actor / by actor /, and let P7max be /'s best hope. Then 
Pimax—P; eae i's interest in gaining a better payoff. Now let Prcon be/'s cost of confrontation 
that occurs when / and / cannot agree on an exchange. Then P;- P;con is /’s interest in avoiding 
confrontation. NET defines /'s resistance (R;) to an exchange as the ratio of these two interests: 


“ePmax =F; (1) 


P; — P;con 
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When P; is small, the offer is undesirable and /’s resistance will be large because the 
niithnokatss P;max — P; is large, and the denominator P;— P;con is small. In words, / is highly 
resistant to a small diten because / has a large interest in gaining a better offer, and, / is nearly 
indifferent between the offer and not exchanging at all. NET asserts that agreements occur 
when actors are equally resistant to the offer: 


Pimax — Pi P;max = Pi . (2) 


Ri = = ae Ri 
Pr P;con Pi - Pjcon 





Frequently network exchange experiments simulate exchange by having subjects negotiate 
over the division of a resource pool.' Figure 1a illustrates a resource pool relation: A and B can 
share 30 valued resources only if they agree on a division. Initial conditions for this relation 
appear on the left half of Table 1. We assume that resource units are indivisible and that no 
actor will exchange for less than one unit. Thus, for either A or B, Pmax = 30-1 = 29. 


Figure 1. Relations and Structures 


a. Dyad 


b. Dyad with Fixed Alternative for B 





X y te 
30) PE 
Table 1: Initial Conditions for a Dyadic Exchange 
Isolated Dyad Outside Offer to B 
Actor A Actor B Actor A Actor B 
Pmax 29 29 14 29 
Pcon 0 0 0 15 


Payoff x Y x Y 
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At best, A and B both hope to gain all but one unit of the resource pool and both recognize 
that, in failing to reach agreement, their payoff is zero. When A and B exchange, let the 
payoffs to actors A and B be X and Y, respectively. Plugging these values into the resistance 
equation yields: 


We know Y = 30-X, and so by substituting for Y we obtain: 


_ 29-X _ 29-(30-X) (4) 


ie AIR 


X 30.=X 


olving for X, P, = X = 15 and P, = Y = 15, indicating a 15:15 payoff to A and B, respectively. 
At the outset of the bargaining process, the actors seek payoffs close to their respective 
Pmax values. They must make concessions, however, because their payoffs will be zero 
unless agreement is reached. Therefore, each offers to relinquish more and to receive less. 
This example represents the equipower baseline because all initial conditions including 
status, network positions, Pmax and Pcon are symmetrical. At that baseline, equation 4 
predicts that each actor receives 15 resource units. 

The right two columns of Table 1 introduce a type of asymmetry shown graphically in 
Figure 1b. Here, B has an external source of valued resources worth 15 points as an exc/usive 
alternative to exchanging with A. Now P,con = 15 because B is guaranteed to earn 15 if the 
A:B exchange is not completed. This means that A must top B's outside offer of 15 by at least 
one unit to secure an exchange. This places an upper limit on As best hope such that P, max 
= 30-16 = 14. The resistance equation becomes: 


ae Fegarny (5) 


Solving, X = 7.5 and Y = 22.5. The outside offer allows B to exercise power over A. In sum, the 
resistance model captures and combines the various motives and comparisons that negotiating 
actors are assumed to take into account in the process of making and responding to offers. 


Status Value and Power Exercise 


Status structures, like power structures, provide a context and a mechanism for the exercise 
of power. Here we build upon the Status Value Theory of Power (Thye 1999, 2000; Thye and 
Markovsky 1997). Status value refers to the worth, self-esteem or honor that comes from 
possessing a characteristic or object (Berger et al. 1972). Veblen (1899) was perhaps the first 
to affirm its importance, claiming that the overall value of an object is a combination of 
consummatory and status values. Advertisers frequently capitalize on status value. For 
example, despite sharing many components, Chevrolet is marketed as the car of the working 
class while the comparable Buick is marketed as the car of the upper-middle class. Thanks 
to their higher status value, Buicks are sold for a much higher price despite costing only 
slightly more to produce (Womack, Jones and Roos 1990:106 ff.). 
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The status value theory of power applies under the following scope conditions: (1 ) Actors 
are differentiated by salient status characteristics and (2) have accurate knowledge of each 
partner's characteristics; (3) the resources to be exchanged are nominally distinct from one 
another and (4) those resources are relevant to each actor's status.’ As noted earlier, states of 
characteristics may differ in status value. For example, U.S. culture attaches more status value 
to the rank of general than to sergeant, and to the PhD. more than to a high school diploma. 
Furthermore, objects may acquire status value. For example, competitions for the Stanley Cup 
and for the Nobel Prize are fierce because both objects confer status value due to their 
associations with previous winners. 


From Status to Value 


Assumption 1 of the theory claims that the status value associated with actors’ characteristics 
spreads to exchangeable objects (Berger et al. 1972; Thye 2000). For example, a medal of valor 
awarded by the U.S. President confers more honor than the same medal of valor given by a 
community leader. The presidential award has status value because of its direct relevance to 
a high status position. The status value of the Oscar results from its relation to the Academy of 
Motion Picture Arts and Sciences and former recipients, not from its $300 production cost. 

Experiments conducted by Thye (2000) support Assumption 1. In one study, subjects could 
exchange their own blue poker chips for either (1) purple poker chips associated with a higher 
status partner or (2) orange chips associated with a lower status partner. Results showed that 
subjects tried harder to acquire the purple chips, assumed they were generally more important 
than orange chips, and accepted less profit in exchange for them. Importantly, these effects 
were observed even though all subjects knew that orange and purple chips had the same 
monetary payoff at the conclusion of the experiment. The data indicate that relevant status 
characteristics alter the perceived value of goods, as claimed by the theory. 

The spread of status value from person to object has clear implications for power in 
exchange relations. In status differentiated exchanges, objects controlled by the higher status 
person should acquire status value, while objects controlled by the lower status person should 
lose status value. Because high status actors control resources perceived to be more valuable, 
they are predicted to gain more resources when exchanging with status inferiors than with 
status equals. 


From Value to Power 


Assumption 2 claims that the more / values /’s goods, the greater /‘s power over /. Virtually all 
exchange theories agree that values affect exchange (Blau 1964; Bonacich and Bienenstock 
1993; Cook and Emerson 1978; Cook and Yamagishi 1992; Emerson 1962, 1972a, b, 1981; 
Friedkin 1992, 1993; Homans 1974; Molm 1990, 1997; Thibaut and Kelley 1959; Willer 1981a). 
At issue here is not how objects acquire value, but whether status characteristics increase 
or decrease intrapersonal valuations (Heckathorn 1983a, b). In order to detect such changes, 
empirical tests adopt the convention of assigning monetary values to all resources. These 
resources are then made relevant to status characteristics and, by Assumption 1, increase or 
decrease their status value. 

Assumption 3 links structural power to power exercise, stating that actors who have a 
power advantage receive favorable exchanges. A long history of experimental research 
indicates that actors in powerful locations do in fact receive favorable exchanges (Cook et al. 
1983; Lawler 1992; Markovsky, Willer and Patton 1988; Willer and Anderson 1981). 
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Thye’s (1999, 2000) research on dyads and triads showed that status advantaged actors 
also receive favorable exchanges. For instance, the high status members of status- 
differentiated dyads earned nearly double the profit of low status members. A similar result 
occurred in a status-differentiated triangle (high, low, low) where each could negotiate with 
both others. A third experiment demonstrated that status effects countervail “weak power” 
known to occur in the A;-B:-B.-A, network.’ The central Bs normally enjoy a weak power 
advantage, but in this study, were believed by the As to be low status. The structurally 
disadvantaged As were believed by the Bs to be high status. A:B exchanges were near 
equality indicating the elimination of weak power differences caused by status. 


From Ordinal to Metric Predictions 


In previous research the status value theory of power was limited to ordinal predictions for 
exchange outcomes. In this section we offer a method that yields more refined predictions 
and strengthens connections between SCT and NET. To illustrate, we apply the model to an 
exchange relation in which actors are differentiated on one diffuse and one specific status 
characteristic, neither of which is explicitly tied to exchange ability. We assume that the two 
characteristics are consistent, i.e., the higher status actor possesses the positive state of 
both. The theory asserts that when goods held by low status and high status actors are 
relevant to their status characteristics, the goods acquire concomitant status values. That is, 
low status actors perceive an increase in the value of objects held by higher status actors, and 
high status actors perceive a reduction in the value of objects held by lower status actors. 

First we calculate the expectation advantage using the graph theoretic procedures of SCT 
(Berger et al. 1992; Walker 1999). The first step is to calculate Cn and @p, the aggregated 
expectations for actors p and o, respectively. The size of the expectation is determined by the 
length and number of paths connecting p and o to the group task.* When two actors are 
consensually differentiated by one diffuse and one specific status characteristic, they are 
connected to the group task by paths of length +4, +5, +4, +5 for p, and -4, -5, -4, -5 foro. 
Given these paths, Berger et al. (1992) calculate ey; = .35 for the high status actor and e; = 
—.35 for the low status actor. F = e, — €y captures the difference in expectations between 
two actors. Hence, E = .70 is the high status actor's expectation advantage.® 

In the standardized experimental setting used in SCT research, a subject makes an initial 
choice, receives information on a partner's choice, and then makes a final choice. P/s) is the 
probability of staying with an initial choice given disagreement with the partner. Berger et al. 
(1977) proposed a linear function to capture the effect of the expectation advantage on the 
probability of “stay” behavior: 


P(s) =m + gE (6) 


The parameter m is the baseline tendency for stay behavior, net of status information, and q 
captures the robustness of the status effect in a particular empirical setting. 

We also use E to inflate or deflate the status value of subjective payoffs. Using the 
resistance model (Willer 1981a, 1999), subjective payoffs are given as P; = v,x;, where P; is 
the payoff for actor / as determined by the product of /’s valuation (v) and the quantity (x) of the 
resource (r). We specify the relationship between status value and payoffs in a manner similar 
to equation 6. Thus, if a high status / holds status valued resources, and Pa is the low status 
i’s evaluation of those resources, then the status inflation effect is given as 


PS; = P(1+ gE) (7) 
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At the same time, payoffs from low to high status individuals should be deflated. When a low 
status / holds status devalued resources, and PS; is the higher status /’s status-deflated 
evaluation of those resources, then status deflation is given as the reciprocal function, 


PS = PARGEY (8) 


Note the relationship between equations 7 and 8 and the form of equation 6 above. P; and Pi 
give the baseline payoff to actors / and / before status value effects, comparable to m in the 
SCT equation. Also, g is an empirical parameter capturing the strength and salience of the 
status manipulation. E is the expectation advantage. In the next section we show how this 
model combines with resistance equations to permit tests of exact predictions for status 
effects on exchange. 


A Test Using Published Data 


In this section we check the new status value model against data reported by Thye (1999, 
2000). In that setting, two subjects negotiated in a dyad over the division of 30 poker chips 
having cash value. However, from each subject's perspective the interaction appeared as a 
reciprocal exchange of distinct goods. That is, each subject's chips always were traded for 
different colored chips held by the other. The maximally differentiated exchange rate of 29:1 
was set by the experimental protocol and software, so Pmax = 29 for each subject. Before 
the negotiation, a standard manipulation was used to differentiate subjects on several status 
dimensions. In the treatment condition (N = 25) the ostensive high status member believed 
s/he possessed the positive state of two diffuse status characteristics (age, education) and 
one specific characteristic (“Meaning insight ability”). Status was symmetric so the “low 
status” subject possessed the low states of these characteristics. Following Berger et al.'s 
procedures (1977:130-136), ey; = .47 and e, = -.47, produces an expectation advantage for 
the high status subject of E = .94. (See Thye 2000 for further details.) 

To capture the effects of the experimental manipulations, we again follow Berger et al.'s 
(1977:140) strategy and estimate g= .28 for our research setting. Thus, gE = .28 x .94 = .26. 
Plugging this into equations 7 and 8 gives the parameters for status inflation (PS; = 1.26P;) 
and status deflation (PS; = .79P,). The positive status characteristics associated with the high 
status actor should increase the payoff 26 percent. Because the status structure is 
symmetrical, resources held by the low status actor should decrease by the reciprocal factor. 
Plugging these values into the resistance equation yields: 


_ 29-.79X _ 29-1.26Y _ (9) 


.79X — 0 1.26Y — 0 


L 


where X is the payoff to the high status actor and Y is the payoff to the low status actor. 
Solving for X and Y, the prediction is 18.44:11.56 in favor of the high status subject. 

We compared the predicted exchange rate against data reported by Thye (2000). The 
average observed exchange rate at equilibrium was 19.05:10.95 in the status differentiated 
condition. Vot taking status into account yields a predicted 15:15 equipower division, and the 
observed exchange rate departs significantly from this baseline (tz = 2.65; two-tailed p = - 
.014). However, the predicted and observed rates did not differ significantly (tzs = .39; two- 
tailed p = 7n.s.) using the status value model. Thus, the new model represents a significant 
improvement in precision. 


From Status to Power * 1479 


A Status Influence Theory of Power 


We now develop a second connection from SCT to NET. In the foregoing model, power was 
affected by changes in values. Here, power is affected by changes in beliefs. /nfluence occurs 
when / changes /'s beliefs such that /'s behavior is affected. In sociology, influence has long 
been defined in terms of belief change (Mokken and Stokman 1976; Parsons 1963: Zelditch 
1992). In SCT, influence is defined as “a change of evaluation or opinion as a consequence of 
a disagreement by another.” (Berger et al. 1977:38) Our definition of influence as belief 
change is somewhat broader. It allows for influence due to new information, as well as 
inconsistent information. Below we (1) develop a new theory linking status to influence and 
power in exchange relations, (2) quantify influence using resistance equations, and (3) report 
the results of a new experiment designed to test status influence predictions. Along the way 
we also identify overlaps in the central concepts of SCT and NET. 


Theoretical Scope 


Our theory applies to status-differentiated exchange relations wherein each actor can benefit 
more from the focal relation than from any alternative. This establishes the incentive for 
exchange and ensures that the actors are oriented toward one another relative to less valued 
alternatives. Further, we assume that actors (1) can communicate with one another and (2) 
have imperfect information about the shape of the larger exchange structure and the payoffs 
available elsewhere. Ambiguity and limited information are known to prime social influence 
and related processes (Deutsch and Gerard 1955; O'Keefe 1990; Sherif 1935), and we expect 
the same to occur in exchange. Finally, the theory applies to negotiated exchange wherein the 
range of possible solutions is narrowed to the point of agreement as actors maneuver to 
maximize gains. This distinguishes our work from research on “reciprocal,” “productive” and 
“generalized” exchange (Lawler et al. 2000; Molm 2003). 


Assumptions 


The logic of our status influence theory parallels that of SCT. Assumption 1 asserts that status 
characteristics become salient when they provide a basis of discrimination or are directly 
relevant to the exchange (Berger et al. 1977). That is, we presume actors search for social 
cues including status- or exchange-based information that distinguish them from an 
exchange partner. Thus, information that is differentiating (e.g., race in a mixed-race group) 
or task-relevant (e.g., one actor is a professional negotiator) becomes available for use by 
participants in the exchange (Berger et al. 1977). 

Assumption 2 states that salient status characteristics produce like-valued states of 
performance expectations. There is mounting evidence that status characteristics activate 
performance expectations in a wide variety of settings. For example, traits such as gender and 
ethnicity have been shown to trigger performance expectations for a broad array of tasks, such 
as standardized tests of intelligence, that are beyond the scope of SCT as initially formulated 
(Lovaglia et al. 1998; Steele and Aronson 1995).° One recent study provides direct evidence 
that the DSCs age and education activate performance expectations in exchange. Thye (2000) 
found that subjects with the lower state of age and education were (1) less likely to be chosen 
for an important class project, (2) perceived to have less general problem-solving potential, and 
(3) seen as having less business skill in a negotiated exchange setting. Together, these findings 
suggest that status characteristics impact performance expectations in exchange relations. 
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Assumption 3 asserts that performance expectations alter beliefs and aspirations. We 
suggest two corresponding mechanisms. The first is social influence, by which low status 
actors believe and act on the communications of high status actors more than the converse. 
Berger and colleagues claim that low status individuals who are consensually differentiated 
on status and performance expectations tend to engage in “generalized deferential 
behaviors” that advantage the high status actor (Berger et al. 1998:384). We see the 
accepting of exchange-related communication as an instance of this more general process. 
The second mechanism centers on profit aspirations. We expect high status actors to be 
more acquisitive in their negotiations with low status actors than the converse. A key idea in 
Reward Expectations Theory (Berger et al. 1985) is that larger rewards are associated with 
higher status. More broadly, hundreds of studies in multiple disciplines find status-based 
expectations alter aspirations in a variety of collaborative and competitive tasks (Cohen 1986, 
1993: Cohen and Roper 1972, 1985; Rosenthal 1994, Rosenthal and Jacobson [1968] 1992; 
Rosenthal and Rubin 1978; Rozenholtz and Cohen 1985). We merely extend this logic to the 
profit-seeking motive in social exchange. Our experimental design allows both influence and 
aspiration effects to emerge. We track both the level of social influence reported by the 
message recipient and the bargaining aspirations communicated from higher and lower 
status individuals. 


Quantifying Status Influence 


Here we apply resistance equations to show how status influence is quantified. Table 2 and 
Figure 2 give the initial conditions for this example and the experimental test reported below. 
Here, A and B divide a 25-point resource pool. In this example, B’s payoff is X and A's payoff is 
25 — X. Presume that C provides B with a standing alternative offer worth 10 points — an 
exchange that transpires only if A and B do not exchange. Assume that A is unaware of B's 
outside offer of 10, and can be influenced by B's communications regarding the size of that 
offer. At issue is whether or not A’s beliefs are influenced when higher versus lower status Bs 
communicate to A that the outside offer of 10 exists. If so, how are payoffs affected? Given that 
high status actors have more influence than low status actors, we assume that A will be more 
likely to believe communications when B is higher status than when B is lower status. 


Figure 2. An Example of Status Influence 


25 - x Xx 10 
ht 2 


Table 2: Initial Conditions for Status Influence 


B is not believed B is believed 
High Status A Low Status B Low Status A High Status B 
Pmax 24 24 14 24 
Pcon 0 10 0 10 


Payoff 29 -X X 25 - X X 
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The left side of Table 2 gives the initial conditions when the lower status B is not believed; 
the right side gives the initial conditions assuming a higher status B who is believed. In both 
cases B has direct knowledge of the offer of 10, so Pscon = 10. Again, Pamax is affected by 
whether or not influence occurs. If no influence occurs, as illustrated on the left side of Table 
2, the offer of 10 is not believed and Pamax = 24. When A does believe that the high status 
B has an offer of 10, as shown on the right side of Table 2, in order to exchange with B, A 
must offer at least 11 to B. Thus Pamax = 25-11 = 14. This is captured in equation 10 when 
B is not influential, and equation 11 when B is influential. 








ee ee Ye ee (10) 
Rae ? Ae 
25 — X X— 10 
spel hs snk Asad) Sei ee ea Co (11) 
Bi ~ Re 
25 — X X— 10 


Solving equation 10 where B is not believed gives X = 15.53. However, when B is believed 
as in equation 11, B's payoff increases to X = 17.5. Thus, the effect of social influence is the 
difference between the two payoffs: 17.5 — 15.53 = 1.97 units. 


A Test of Status Influence Theory 


Design and Subjects 


A two-condition experiment was designed to test the status influence theory. Each condition 
involved two subjects, A and B, who negotiated the division of a 25-point resource pool. 
Because the resource pool is not associated with either subject, there can be no spread of 
status value to the resources. As in the Figure 2 network, the subject in position B always had 
a fixed alternative worth 10 points. Further, the individual in position B could communicate to 
A the size of that offer (truthfully or not). In Condition 1, A and B were assigned low status and 
high status positions, respectively. Condition 2 reversed the status assignments. First- and 
second-year undergraduates at the University of South Carolina were recruited as paid 
volunteers from courses across campus. To control for possible race and gender effects, only 
white females were used in this research. 


Procedures 

Upon arrival, the two subjects for a given experimental session were randomly assigned to 
separate rooms where they received written instructions. These explained that each 
participant would negotiate with another over how to divide a 25-point pool of resources. 
Both subjects received identical instructions with one exception: The subject in position B 
was shown a diagram of the full network and informed that she could accept a computer- 
generated offer worth 10 points at any time during the negotiation. The subject in position A 
was informed that B had an outside offer, but the size of that offer was not disclosed. Both 
subjects were told that, prior to each round of negotiation, B would send a communication to 
A regarding the size of the outside offer. Each subject was advised to negotiate for as much 
of the 25-point resource pool as possible. A brief questionnaire administered prior to the 
actual negotiation verified that subjects understood the instructions. 
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Subjects were told the study would run for 25 rounds of negotiation, but sessions were 
concluded after 20 rounds in order to preclude end-effects. Each round lasted two minutes. Just 
before each round, player B was prompted to send a computer-mediated message to player A 
indicating “The size of my outside offer is x,” where x represented a number freely chosen by 
player B. Once the message was sent, negotiations began. At any time during negotiations 
subjects could send offers (i.e., a proposed division of the resource pool), or accept counter 
offers, or B could accept the standing offer of 10 points. If no agreement was reached and B 
declined the alternative offer, both earned nothing for that round. Subjects were informed that 
their pay for the study depended on the number of points accumulated during the negotiations. 
When the experimental session was complete, the assistant explained the nature of the research, 
answered questions, and paid subjects using a standard formula to convert points into money. 

The bargaining process was controlled and recorded using ExNet 3.0, a web-based 
experimental system developed to investigate exchange and power dynamics under a range of 
conditions. Using mouse control, subjects could send and receive offers and complete exchanges. 
The software displayed the experimental network on each screen and, consistent with initial 
conditions, B's screen displayed the alternative offer of 10 while A's did not. All subjects completed 
an interactive on-line tutorial to ensure they understood all aspects of the bargaining protocol. 

Just prior to the negotiations, a standardized status manipulation from SCT research was 
used to differentiate the subjects on the DSCs of age and education. The experimenter 
introduced predetermined status information via posted summaries in each subject's room. 
Subjects in the low status position were led to believe they were connected to a higher status 
partner — a 25-year-old engineering graduate student with an outstanding GPA. Subjects in the 
high status position believed they were connected to a 16-year-old high school student with 
a low GPA. Polaroid photographs of the subject and each ostensive partner added control and 
realism to the manipulation. Comparable status manipulations have been used successfully 
in past research to produce consistent status differentiation (e.g., Lovaglia and Houser 1996; 
Markovsky, Smith, and Berger 1984; Thye 2000). 


Dependent Measures 

ExNet 3.0 tracked power exercise, exchange frequency, and communication from B to A in 
each experimental session. Power exercise was measured as the average profit earned from 
exchanges during the last 10 rounds, a period during which payoffs typically stabilized. The 
mean profit difference across the two experimental conditions indicates the effect of status 
influence on power exercise. Exchange frequency is the proportion of negotiation rounds in 
which the two individuals reach agreement. Profit aspirations are manifested in the 
communications from B to A regarding the size of the outside offer. The communication 
precedes actual bargaining, so the communicated amount provides a measure of aspiration 
independent of the actual offers and counter-offers that produce exchange. 

Prior to the negotiations, we administered a questionnaire to measure perceived status 
and competence. Each 9-point bipolar adjective was scaled from low to high. Status was 
measured using four items with the following anchors: low/high honor; follower/leader; 
little/high respect; low/high status. Four items measured performance expectations: little/high 
work-related ability; little/high business ability; generally incompetent/competent; little/high 
value on a class project. These items have been found reliably to detect status and 
expectation differences in several recent studies (Lovaglia et al. 1998; Ridgeway and Erikson 
2000; Thye 2000). We averaged the responses for each set of four items, for self and other, to 
create indices for status and performance expectations. 

We also administered a post-experiment questionnaire that measured a variety of reactions 
to the study. The theory predicts that communications from higher status individuals are more 
influential than those from lower status individuals. We designed a number of measures to 
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assess the degree of influence reported by the individual in position A. Specifically, status 
influence was measured via four bipolar items that assessed (1) influence by the partner's 
message, (2) credibility of the partner's message, (3) partner's trustworthiness, and (4) effect 
of the partner's communication on the subject's bargaining. 


Hypotheses 


The first three hypotheses pertain to differences between higher vs. lower status subjects in 
position B of the Figure 2 network. The final hypothesis compares higher and lower status 
subjects in position A. 


Hypothesis 1: Higher status subjects earn more resources. 
Hypothesis 2: Higher status subjects are perceived to be more 
competent. 

Hypothesis 3: Higher status subjects communicate higher outside 
offer values. 

Hypothesis 4: Lower status subjects are more influenced by 
communicated outside offers. 


Results 


Sixty-four subjects (2 subjects X 16 sessions X 2 conditions) participated in the study. Two 
sessions were excluded from the analysis because subjects were suspicious of the status 
manipulation and reported that their behavior was affected. Thus, we included 15 sessions 
and 30 subjects in each condition. 


Manipulation Check 


Table 3 presents data from the pre-negotiation questionnaire. The manipulations did produce 
consistent status perceptions. First, subjects with positive states of age and education 
perceived themselves to have higher status than their partners. This occurred for both 
position A (8 = 2.60, ts = 6.54, p < .001) and B (8 = 2.17, ti = 4.43, p < .001). As observed 
in prior. research, low status subjects also tended to see smaller differences in relative 
expectations (Thye 2000). Overall, age and education differences create differential 
perceptions of status, although primarily for the high status subjects. 


Hypothesis 1 
Hypothesis 1 was supported, indicating that higher status subjects in position B exercised 
more power (i.e., acquired more resources) than lower status subjects in the same position. 
The mean profit over the last 10 exchanges was significantly greater when B was high status 
(X = 14.62; S.D. = 3.47; N = 15) than when B was low status (M = 13.10; S.D. = .60; N = 
15; tee = 1.65, p = .05, one-tailed). Cohen's (1992) guidelines for estimating the magnitude of 
experimental treatments indicates a moderate effect of status influence on resource 
accumulation (d = .60). That is, the mean profit earned by the high status B is .6 of a standard 
deviation greater than that earned by the low status B. 

The effect of status influence on the exchange rate is the difference between the means 
for the high status B and the low status B, or 14.62 - 13.10 = 1.52. That value does not differ 
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Table 3: Means and Standard Deviations for Status Indices’ 


LS 


Assigned Self Partner 5 
Measure _— Position Status Rating Rating (Self - Partner) 
1.00 4.95 
Status A High *(.79) (1.34) 7.00 
7.10 7:3 
Low (.88) (.90) -.03 
6.92 4.57 
B High (.82) (1.87) 2AFS 
7.30 Les 
Low (1.11) (1.37) 07 





Notes: The degrees of freedom for each matched sample t-test is N-1 where N is the number 
of distinct experimental sessions (15). 

* All measures could range from 1 to 9. 

*p<..05. *p <.01 ***p < 005 one-tail ¢ 


significantly from the predicted difference of 1.97 (pooled S.D. = 2.33; N = 30; f = 1.040). 
This shows that our theoretic model of status influence predicts the magnitude of power 
exercise, lending further credence to the theory. 


Hypothesis 2 

The data in Table 4 support the second hypothesis predicting that the status manipulation 
would produce consistent competence expectations. For both high and low status actors, in 
either network location, high status led to high performance expectations, and low status led 
to low performance expectations. This is especially important in that competence 
expectations are postulated to be a theoretical mechanism leading to status influence. It is 
also notable that, as with the status measures, the perceived difference in competence is 
greater for high status subjects (6 = 2.79 and 6 = 2.30) than for low status subjects (8 = -.37 
and 8 = -.70). This pattern suggests that high status subjects perceived large differences 
between themselves and lower status others, while low status subjects tended to rate 
themselves as similar to higher status partners. These findings are consistent with much 
related research on self-serving biases (Eagly and Chaiken 1998) and previous studies of 
status in exchange (Thye 2000). 


Hypothesis 3 

The third hypothesis bears on the mechanism through which expectations produce power. 
We reasoned that, if status alters profit aspirations, B’s relative status would affect her 
portrayal of the outside offer she communicates to A. We averaged those communicated 
values across the final 10 rounds and checked for a status effect. The average communication 
from the high status Bs ( X = 11.32; S.D. = 4.57; N = 15) was greater than that from the low 
status B( X = 9.83; S.D. = 3.51; N = 15), however the difference is not statistically significant 
(te = 1.00, p = .32, two-tailed). Thus, we conclude that high status individuals did not 
communicate significantly greater profit aspirations relative to low status individuals. The 
implication is that power exercise is not necessarily driven by the higher profit aspirations of 
high status subjects. 
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Hypothesis 4 

The final hypothesis predicted that communications from higher status others are more 
influential. This was observed in the post-experiment questionnaire where subjects felt that 
communications from higher status others were more influential (X = 7.40; S.D. = 2.03; N = 
15) than those from lower status others ( X = 6.00; S.D. = 2.53; N = 15). The difference was 
Statistically significant (ts = 1.66, p = .05), The recipients of communications also judged 
higher status others to be significantly more credible (d = 1.93, ts = 3.86, p < .001) and 
trustworthy (8 = 1.20, ts = 2.20, p < .022). Finally, subjects reported that communications 
from those with higher status were more likely to affect their bargaining (X = 6.53; S.D. = 
2.59; N = 15) than communications from lower status subjects (X = 6.33; S.D. = 2.58: N = 
15), but this difference was not statistically significant (ts = 0.21, p = .42). Overall, the 
behavioral and questionnaire data indicated that communications from higher status subjects 
resulted in more influence and power than those from lower status subjects, thus supporting 
the hypothesis. 


Table 4: Means and Standard Deviations for Expectation Indices’ 





Assigned Self Partner ) 
Measure Position Status Rating Rating (Self - Partner)” 

f° FAQ 4.63 

Expectations A High (1.08) (1.21) 21g" 
7.10 7.47 

Low (.69) (.71) =31 
7.03 4.73 

B High (.77) (1.52) 2:30"% 
7.08 7.78 

Low (1.07) (.69) -.70"* 


Notes: The degrees of freedom for each matched sample t-test is N-1 where N is the number 
of distinct experimental sessions (15). 

* All measures could range from | to 9. ‘ 

> The smaller “self-partner” values for low status subjects in both locations may reflect a self 
serving bias. This pattern was also observed by Thye (2000). 

2p <..05 .**p <..0lae***p <.005,one-tail ¢ 


Ancillary Analyses 

Small resistance (R) values indicate that actors should be more willing to complete 
exchanges.’ Thus, our status influence model suggests that there will be more A:B 
exchanges when equiresistance values are small than when they are large. R = .866 when B 
is high status, and R = 1.53 for low status Bs. If A and B more easily reach agreement when 
B is high status, then A:B exchanges would be predicted to occur more frequently when B 
has high status than when B has low status. The data support this prediction. The exchange 


frequency when B has higher status than A is significantly greater (X = 17.40; S.D. = 2.29; N 
= 15) than when B has lower status than A(X = 15.93; S.D. = 2.37; N = 15; ts = 1.72, p = 
.04, one-tailed). Overall, this finding suggests that high status individuals benefit from both 


power use and more frequent exchange. 
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Summary 


The experiment showed that status broadly impacts the use of power and related phenomena 
in negotiated exchange. Most important for the status influence theory developed here, high 
status subjects earned significantly more points (Hypothesis 1), were perceived to be more 
competent (Hypothesis 2), and were more influential (Hypothesis 4) than lower status subjects 
in the same location. The results suggest that key factors from SCT — status, performance 
expectations, and influence — interact with the development of power in systems of exchange. 
We further observed that, controlling for content, communications from higher status subjects 
had more impact on negotiations and power. The implication is that power is driven more by 
the social influence of high status individuals than by their profit aspirations. Overall, the data 
provide strong support for the status influence theory and models. 


Discussion 


This paper introduces two new models connecting power and status using two of the most 
extensively tested theories in structural social psychology, NET and SCT. One model links 
SCT to NET by formalizing the status value theory of power and generating metric 
predictions for status value effects. It asserts that positive status characteristics enhance 
payoffs from exchange, and that negative status characteristics attenuate payoffs. The 
model conforms remarkably well to data published by Thye (2000). Second, we develop a 
new theory of status influence that links SCT to NET through expectations and beliefs. This 
theory claims that status characteristics activate consensual and consistent expectations 
for task competence that, in turn, unleash social influence processes that benefit high 
status individuals. Results of a new experimental test support the principle claims of the 
theory. 

More generally, our theoretic models contribute to ideas from both classical and 
contemporary theory. Status and power have been intertwined at least since Aristotle 
([353BC] 1963) and through the works of Marx ([1867] 1967) and Weber ([1918] 1968). To this 
stream of theorizing we contribute two distinct mechanisms through which status generates 
power differences: status value and status influence. As for contemporary social exchange 
theory, by mathematically linking SCT and NET, our status value and status influence models 
propel us toward a more integrated theory of status and power inequalities. This theory 
promises to be as precise as either theory, but also broader and richer. Furthermore, our work 
represents a broadening of traditional social exchange theory which, since Thibaut and Kelley 
(1959), has focused predominantly on power as a consequence of network structure and 
dyadic connectedness. In contrast, our work shows that power also emerges from personal 
attributes such as status. This raises the possibility of linking, integrating and extending other 
person-based theories in comparable ways. 


Scope Issues 


Scope conditions are abstract statements that prescribe the domain of phenomena within 
which a theory applies (Foschi 1997; Walker and Cohen 1985). To properly test linked theories 
requires that their scope conditions intersect and that tests occur within that intersection. Our - 
aim has been to develop new models using concepts and empirical phenomena common to 
SCT and NET. Here we return to the scope of these two theories to examine whether and how 
they intersect. 
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In SCT, the group task (T) is constrained by four scope conditions (Berger et al. 1977:95): 
(1) Group members are motivated to successfully complete T; (2) actors believe that a 
particular instrumental characteristic (C*) is required to successfully complete T: (3) T is 
evaluated in that task outcomes are defined as a success (T+) or failure (T-); (4) T is collective 
meaning it is necessary and legitimate for each actor to take the behavior of the others into 
consideration. SCT's scope conditions are applicable to an initial setting within which status- 
ordering processes subsequently play out. Similarly, we assert that these scope conditions 
also apply to the initial setting within which social exchange processes operate. 

In the status value theory, status affects the value of goods held by status differentiated 
actors. The theoretical mechanism for the spread of status value from person to object is not 
a formal component of SCT, but was developed elsewhere (Berger, Zelditch, Anderson and 
Cohen 1972; Thye 2000). Our use of SCT’s graph theoretic model to find the expectation 
difference (E) indicates some conceptual overlap. At issue is how status alters the va/ue of 
related objects, and how value subsequently affects the initial conditions of exchange 
relations. Only after status has affected values are the initial conditions for the exchange 
established, and only then are resistance equations applied. Importantly, SCT does not apply 
to the exchange relation itself, so the scope of SCT need not be altered. 

The status influence model asserts that status-based influence impacts messages related to 
establishing conditions for exchange. We suggest that SCT’s four scope conditions do in fact 
encompass the circumstances under which individuals initially select exchange partners and 
sample information. Imagine a person seeking a contractor to build a new home in the context 
of the above scope conditions. First, in preliminary discussions, both the buyer and contractor 
share information to determine if a mutually beneficial transaction is possible. The buyer might 
claim she has a limited budget, while the contractor might establish a minimum construction 
price. If the two cannot agree on such terms the home cannot be built, and neither can benefit. 
Thus, the benefit at stake creates task-motivation. Second, the interaction is affected by 
numerous task-relevant characteristics: knowledge of the housing market, information on the 
price of raw material, the number of alternative buyers, the availability of other contractors, and 
so on. Third, each must consider the other's position when attempting to determine whether a 
deal is possible, so the two are collectively-oriented. Finally, the “task” of the buyer and 
contractor is to exchange the home for money, a deal that must benefit both if either is to 
consider the transaction. Thus, the final scope condition is also satisfied in that task success is 
tantamount to completing the exchange while task failure is equivalent to being excluded. 

To summarize, it is only after these conditions are established that explicit negotiation 
occurs and mixed motives emerge. Thus, SCT applies to the establishment of conditions 
initial to the negotiation and exchange. SCT’s scope conditions do not preclude such 
applications, because they are not applied to the negotiation process itself. That negotiation 
and exchange processes themselves entail mixed motives is simply irrelevant.® 


Future Research 


For future study there are several issues that our models raise but do not resolve. Recall that 
we failed to find support for hypothesis 3 which suggested that, relative to lower status 
subjects, higher status subjects would have greater profit aspirations and inflate their 
communications regarding an outside offer. Although the effect was in the predicted 
direction, it was not statistically significant. There are a number of reasons this may have 
occurred. First, it is possible that our indicator of profit aspirations (i.e., communications 
regarding the outside offer) may be sensitive to factors such as norms of fairness, obligations 
of honesty, or unwillingness to appear greedy. Perhaps a more direct measure of profit 


1488 * Social Forces Volume 84, Number 3 * March 2006 


aspirations (e.g., questionnaire items measuring what subjects expect prior to each 
exchange) would yield the hypothesized effect. Another possibility is that the effect would 
become significant with a more powerful statistical test. In either case, given the substantial 
body of research linking status to reward expectations (Berger et al. 1985) we suspect that 
future tests will produce the hypothesized effect. 

Second, previous research clearly indicates that power exercise results in negative 
emotional reactions among low power actors (Kemper 1990; Lawler and Yoon 1993, 1996, 
1998; Lawler and Thye 1999; Willer et al. 1997). It remains to be seen whether status-driven 
power produces similar negative emotions, but we suspect not. Status value leads to power 
exercise but, from the perspective of a low status actor buying a status valued good, the 
exchange does not appear unequal. To the contrary, because the exchange ratio appears as 
equipower there is no basis for negative reaction. The buyers of Michael Jordan shoes 
generally do not resent the retailer from which they were purchased. Similarly, collectors who 
pay high prices for status valued goods do so willingly because the goods are highly valued 
to them. The emotional consequences of status-driven power is a fertile area for future 
investigation. 


Field Applications 


Here we consider ways that our theorizing may apply outside of the laboratory. Advertisers 
routinely use status to enhance product value. A recent television advertisement for a sporty 
car clearly makes a status value claim: “Some trophies are made to sit on a shelf. This one is 
made to drive.” A shoe company recently adopted an approach that more explicitly 
recognizes the status value effect. Initially the ad shows an athletic shoe with a high status 
athlete and a high price. Then, as the athlete's image fades, the price falls. Vivid examples of 
status influence also abound. It is well known that status influence stemming from expertise 
can lead to power and financial gain. Expert witnesses who testify for fees in high-profile 
cases rely on SSCs that are enumerated as qualifications prior to testimony. In terms of 
classical exchange theory, the expert's influence on power has long been recognized. Related 
examples include Berle and Means’ (1932) “managerial control,” French and Raven's (1968) 
“expert power,” and Bacharach and Lawler’s (1980) “expert vs. structural power.” We theorize 
these ideas formally by constructing models in which expert power is based on the value of 
status characteristics and their ability to confer expectations of competence and influence. 
In status influence we find that power exercise occurs, not because high status actors 
have greater profit aspirations than lower status actors, but because exchange-related 
messages from them are simply more influential. Possessing great influence also suggests 
conditions under which high status actors may absolve themselves of responsibility for the 
use of power. For example, in past years U.S. auto makers have asked workers to forego 
wage increases to mitigate Japanese competition. Japanese auto makers thus were blamed, 
not their U.S. counterparts. During some of that time U.S. auto makers earned record profits. 
Power from status influence may provoke negative emotions or blame, but when influence 
is sustained, the responsibility for that blame can be shifted elsewhere (Lovaglia 1994, 1995). 
The two theoretical models proposed here link SCT and NET to make metric predictions, 
one from status value and the other from status influence. These models suggest new lines 
of research — experimental, institutional and historical - that promise to enlarge the 
explanatory scope of each theory. For example, the status influence theory is still rudimentary — 
compared to the status value theory. A more refined theory will relate the size of the status 
difference to the extent of believability, a development that will undoubtedly inspire further 
testing. Previously, SCT and NET did not study the effects of status structures and power 
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structures together, and for good reason. The first task for each of these programs was to 
push forward investigations within the provisional scope conditions associated with each 
theory. With substantial support now established, the path is cleared for developing research 
programs that clarify the interplay between status and power inequality. 


Notes 


1. Although resource pools are conventionally used in this line of research, resistance 
equations also make predictions for exchanges where actors send and receive sanctions. 
See Brennan (1981) and Willer (1987, 2003) for those predictions and their tests. 


2. According to Freese and Cohen (1973:182), “an element e; is relevant to an element ej if 
and only if: if x; possesses e;, then x; is expected to possess ej. 


3. Unlike strong power structures, the more powertul B positions in this network do not have 
a surplus of less powerful A positions to exclude. However, the As can be excluded 
without excluding the Bs, whereas the reverse is not true. The result is B:A power 
differences that are less extreme than those from strong power. 


4. More specifically, expectations are based on the length and number of relevance paths 
connecting an actor to a task outcome (Berger et al. 1977; Webster and Foschi 1988). A 
relevance path may be positive or negative, its strength being a function of its length (L), 
represented by f(L). Berger et al. (1992) report empirically-based estimates of (3) = .406; A4) 
= .150; and 5) = .050. To illustrate, consider a high status p connected to a positive task 
outcome because that person has the positive state of a DSC. The model yields a positive 4- 
path and a positive 5-path connecting p to the positive task outcome. Thus, the expectation 
value (€p+4) is given by Cn + =1—[(1—.150) (1-.050)]. Solving yields Cn+ = 1—.80 = .20. 


5. Strictly speaking, this model addresses performance expectations, not status value. 
However, this model for performance expectations suits our purposes ideally because (1) 
the relationship between status and performance expectations is now axiomatic, (2) 
increments in valued status traits produce increments in performance expectations, and 
(3) no alternative standardized measure of status value currently exists. 


6. Empirical studies confirm that SCT is accurate within the scope conditions stated in Berger et 
al. (1977). Thus, the cited studies do not contradict that theory or its evidentiary basis. On the 
contrary, they show that the status organizing processes are broader than initially presumed. 


7. The reason is that small resistance values imply small numerators and/or large 
denominators of the resistance ratio. In turn, this translates into payoff values (P) that are 
very near one’s best hope (P max) and/or much larger than one’s worst fear (P con). 


8. There are additional grounds to justify the joint use of SCT and NET. First, a number of 
studies demonstrate that SCT can be usefully applied outside those scope conditions 
(Cohen 1993; Foschi 1996, 2000; Gerber 2001; Lovaglia et al. 1998; Thye 1999, 2000; 
Willer et al. 1997). Second, as shown above, there is mounting evidence that perceptions 
of status value can alter exchange outcomes in a variety of contexts. Thus, there are 
logical and empirical grounds to presume that status processes are more robust than 


previously assumed 


=e 
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Lake Wobegon Upside Down: 
The Paradox of Status-Devaluation 


Stoyan V. Sgourev, MIT Sloan 





Abstract: 

This paper examines a cognitive bias whereby respondents in postcommunist Bulgaria 
systematically decrease their self estimates on material welfare in contrast to the well- 
established status-enhancement bias. The analysis shows that the main reason for the 
occurrence of status-devaluation is the experience of relative deprivation in postcommunism, 
reflected in unfulfilled expectations and the perception of status loss relative to socialism. The 
bias is reinforced by a process of network closure under conditions of high uncertainty and 
accelerated social change. More homogeneous personal networks provide reference points, 
which make high-status actors more likely to see themselves as low-status. 


The tendency to overstate one’s positive features and relative standing is well-documented in 
the social sciences. In the social-psychological literature, this tendency is known as the “Lake 
Wobegon’" bias, alluding to Garrison Keillor’s fictitious land where “all the women are strong, all 
the men are good looking and all the children are above average.” As early as the 1950s, 
scholars identified “ego-defensive” biases through which people enhance their self-esteem or 
positive opinion of specific dispositions or abilities (Heider 1958, Kelley 1967). More recently, 
psychological work postulated the Better-Than-Average effect (BTA), which denotes the 
propensity of people to believe they are better and do better than the average person (Alicke 
1985, Brown 1986, Kruger and Mueller 2002). BTA emerges in a variety of judgment domains, 
such as personality descriptions, risk perceptions and performance expectations (Kruger 1999: 
180). Status-enhancement tendencies were also documented in network studies, where people 
report having more friends and being more central in the network than they actually are 
(Kumbasar, Romney and Batchelder 1994, Zuckerman and Jost 2001). 

Status-enhancement tendencies are probably most pronounced in self-ranking processes. 
Consider the following example. In the 1983 and 1987 waves of the General Social Surveys 
(GSS)' respondents were asked to rank themselves on a scale of social positions. A little more 
than 70 percent of respondents positioned themselves above the midpoint of the scale, 
illustrating the proclivity for enhancement of self-estimates. Another example of status- 
enhancement is the tendency for most people to see themselves as middle class (Barnes 
1954).? Figure 1 presents the answers to the following question in the GSS: “Compared with 
American families in general, would you say your family income is far below average, below 
average, average, above average or far above average?” The question was asked on the GSS 
in every wave from 1972 to 1998. The time points on the scale stand for the beginning, 
midpoint and end of that period. 

Considering the fact that the income distribution in the United States is skewed to the right 
(Harrison and Bluestone 1988), Figure 1 shows massive self-enhancement. The consistent 
normal distribution of self-estimates is all the more remarkable against the backdrop of 
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Figure 1. Distribution of Answers in USA to the Question, “Compared with other families in 
general, would you say your income is...?” 
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increasing income inequality over the past decades (Levy and Michel 1991, Nielsen and 
Alderson 1997). Despite the slight decrease in the share of respondents in the middle category, 
the stable form of the lines suggests that the perception of relative status is generally immune 
to shifts in income distribution. There is little to indicate that Americans have become any less 
sanguine about their social standing over time. The evidence, in fact, suggests the opposite: 
the increase in the share of respondents who consider themselves “below-average” amounts 
to only half the increase in the share of respondents claiming “above-average” income status. 

One explanation for these discrepancies draws on the “availability heuristic” in cognitive 
psychology, which represents a systematic perceptual bias whereby individuals overestimate 
the number of others similar to themselves and their intimates (Tversky and Kahneman, 1981). 
Similarly, social-psychological research posits that people typically compare themselves to 
others of objectively similar status (Festinger 1954), which is one of the main reasons for the 
tendency to claim middle-class status. The fact that low-status individuals see many others of 
similar or lower standing around them predisposes them to elevate their position in the status 
order (Evans, Kelley and Kolosi, 1992). Another explanation emphasizes the need for positive 
self-evaluation, which is a core claim in psychological research (Kruger 1999). Traditionally, self- 
enhancement tendencies have been taken as evidence of “motivated reasoning” in psychology 
(Kruger 1999: 221). People exaggerate their skills, traits and abilities to serve psychological 
needs for positive self-esteem (Alicke 1985, Brown 1986). Psychological studies have posited 
the “pancultural” nature of self-enhancement on the grounds that people in all cultures strive 
to maintain positive self-esteem (Sedikides, Gaertner and Toguchi 2002: 73). 

The universal nature of the postulated needs and cognitive mechanisms raises the 
expectation that the status-enhancement bias holds sway across a variety of contexts 
(Sedikides et al. 2002). There is indeed supporting evidence from many countries, - 
representing both individualist and collectivist cultures, such as Canada, France, The 
Netherlands, Japan and China (Kruger 1999: 221, Heine and Lehman 1997). The tendency for 
low-status individuals to claim middle status is not particular to the United States. It was 
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found in other countries, including such dissimilar cases as Australia and socialist Hungary 
(Evans et al., 1992). The lack of contravening evidence has reinforced the tendency to elevate 
the bias to the status of a universal law. 

This paper situates status enhancement in comparative light by examining manifestations 
of the opposite perceptual bias. In stark contrast to Americans, respondents in Bulgaria reveal 
a strong tendency to decrease their standing in comparative evaluation. Figure 2 presents the 
distribution of answers in Bulgaria to a question worded similarly to the question on the GSS 
mentioned earlier.* The distribution is remarkably skewed. While 68.2 percent of respondents 
report that they are “below average” on material welfare, only 1.8 percent claim “above 
average” status. These results reveal a stark contrast between the patterns characterizing the 
Bulgarian and American data. Bulgarians appear much more likely to view themselves as low- 
status on material welfare than Americans. 


Figure 2. Distribution of Answers in Bulgaria to the Question, re to others, how would 
you define your level of material welfare?” 
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| suggest that the skewness in the distribution of the responses in Bulgaria is due to the 
operation of a status-devaluation bias, which depresses self estimates in a systematic, 
sociologically meaningful manner. The bias is defined as the tendency to claim lower status 
than the status objectively possessed on a social dimension pertinent to the identity of the 
individual.4 The particular social dimension examined here is material welfare. This analysis 
investigates the conditions that lend themselves to the manifestation of status-devaluation. | 
argue that the main reason for the occurrence of the bias is the experience of relative 
deprivation as a result of the felt deterioration of life chances in postcommunism. Another 
antecedent of status-devaluation is structural in nature. It relates to the process of homo- 
social reproduction of similarity within clusters of personal ties. 

The analysis is important for several reasons. The examination of cases that do not fit the 
status-enhancement pattern enables the refinement of its scope conditions and generalizability. 
Evidence contravening prior observations provides greater analytical leverage over the set of 
conditions that facilitate the emergence of biases. These include both context-specific historical 
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and cultural forces, as well as general cognitive and social-structural factors. The thrust of the 
analysis is to uncover general trends in comparative evaluation. Therefore, it steers clear of 
historical and cultural idiosyncrasies, which are a natural contender for explaining deviations 
from a universal pattern, but have limited explanatory power beyond the local context. 

The arguments rest on the assumption that cognitive-judgmental biases are not deep- 
seated to a degree that makes them immune to social influence. In psychological research, 
there is a tendency to view biases as innate and immutable, which leaves little room for 
articulating contingency arguments in a more sociologically informed manner. However, recent 
developments have given rise to more contextual approaches to the study of cognition that 
explore cultural variation in perception. Research in cross-cultural psychology has examined 
differences in motivational and attentional orientations across countries (Triandis 1990) as well 
as cultural differences in the accessibility of mental constructs used to interpret ambiguous 
stimuli (Bender 1990, Hong, Benet-Martinez, Chiu and Morris 2008). By highlighting differences 
in cognitive orientations across countries, my work is a part of an emerging trend. However, it 
deviates from a cross-cultural research agenda by exploring within-cultural variation and 
focusing on social-structural determinants of human perception. In doing so, it seeks to apply 
sociological reasoning to cognitive processes. The spotlight is on the social forces that set the 
premises for the emergence of systematic patterns in the perception of social reality. The 
analysis proceeds in agreement with the sociological convention that cognition is enabled and 
constrained by one’s position in the complex web of social and cultural experiences (Cerulo 
2001: 3). The sociological approach to cognition emphasizes the embeddedness of 
comparative thinking in sociocultural contexts (Zerubavel 2001), making it incumbent on 
sociologically-minded students of cognition to explain how the social environment intervenes 
in self evaluation in a manner conducive to perceptual distortion. 


Theory 


The surprising discrepancy at the aggregate level in how Americans and Bulgarians evaluate 
themselves motivates the examination of status-devaluation patterns at the individual level. 
Given an overall status-devaluation trend, what are the social categories and groups most 
closely associated with it? Past research appears to be of limited help. Status-devaluation 
tendencies have rarely been documented, even though they are not unanticipated. Soon after 
their discovery, “ego-defensive” biases were challenged on both theoretical and empirical 
grounds, bringing attention to “counter-defensive” or “esteem-attenuating” biases operating 
in the opposite direction to self-enhancement. In several experimental studies, respondents 
rated their own performances and abilities as a more important determinant of failure than 
success (Ross, Bierbrauer and Polly 1974). More recently, the work of Kruger directed 
attention to the need for reexamination of the ubiquity of the BTA effect. In a series of 
experiments (Kruger and Dunning 1999, Kruger and Mueller 2002), Kruger and his colleagues 
discovered that people rate themselves as below-average in a number of judgment domains. 
These scholars explain this finding by focusing on the difficulty of the ability domain — people 
report having above-average skills in domains in which absolute skills are high and below- 
average skills in domains in which absolute skills are low (Kruger 1999: 222). In this view, the 
above-average and below-average effects stem from the same source — domain difficulty. 
However, this mechanism can account for variation between, but not within domains. In 
seeking to explain why residents of population A inflate their relative standing in a given: 
activity domain, while residents of population B display the opposite tendency in the same 
domain, one should look for a substantively different set of arguments. 
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Relative Deprivation and Status-Devaluation 


Reference-group and relative deprivation theories are a natural source of insights into the 
emergence of any negative psychic state, be it disappointment, envy (Schaubroeck and Lam 
2004) or self-deprecation. Relative deprivation theory (RDT) has long been used to understand 
how people react to situations of frustration (Feldman, Leana and Turnley 1997: 43). At the core 
of the theory is the basic principle that people evaluate their outcomes against subjective rather 
than absolute standards (Festinger 1954, Merton 1968). Subjective standards are derived from 
two principal sources: reference groups (Merton 1968) and one’s own past experience (Taylor 
1982). Comparison to reference groups engenders feelings of relative deprivation when the 
person making the comparison recognizes that she is not receiving valued resources to which 
she feels entitled. In the classic study “The American Soldier,” Stouffer, Suchman, DeVinney, 
Star and Williams (1949) discovered the contradiction between feelings of satisfaction or 
deprivation and the objective situation among groups of soldiers. It was not the absolute level 
of attainment that mattered for poor morale, but relative deprivation — the discrepancy between 
what one anticipates and what one attains. Even more advantaged soldiers felt deprived 
because they chose to compare themselves with others who were even better off. In this 
manner, a perception that an expectation has been violated engenders feelings of injustice, 
dissatisfaction, discontent and outrage at this violation (Grant and Brown 1995: 196). 

Even to the unseasoned observer of postcommunism, the turbulent aftermath of socialism 
appears as a fertile breeding ground for feelings of relative deprivation. A large part of the 
populace in the former socialist states experienced deterioration of life chances in 
postcommunism, contradicting expectations nurtured upon the commencement of reforms. 
Under conditions of high uncertainty and increasing inequality (Szelenyi and Kostello 1996, Gerber 
and Hout 1998), dissatisfaction with the present mushroomed as a powerful collective sentiment. 
For example, while real net income increased overnight in the Eastern provinces following the 
re-unification of Germany, satisfaction with living standards declined, mainly because East 
Germans increasingly compared themselves with their western counterparts (Headey, Krause 
and Habich 1995). However, unfavorable comparison to the more advantaged was not the only 
source of dissatisfaction for East Germans. Feelings of deprivation also grew out of unfulfilled 
expectations and an unfavorable comparison to the past. The discrepancies between desired and 
achieved status, and that between past and present status are important sources of stress in self- 
evaluation. The relative deprivation literature has mainly examined the former, in terms of the 
influence that high expectations and comparison standards exert on satisfaction with the present. 

With few notable exceptions (Taylor 1982), past status has escaped attention as a source 
of relative deprivation. However, under conditions of accelerated social change, the previously 
occupied social position is a key platform with which expectations for the future are formed 
and life outcomes evaluated. Generally, expectations for the future reflect the social position 
currently occupied and the continuity of the status order. Research in social psychology 
shows that status differences determine expectations for future performance, thereby 
shaping the distribution of influence and prestige (Berger, Cohen and Zelditch, 1972). Under 
conditions of historical continuity and low uncertainty, expectations for the future tend to 
reproduce current status hierarchies. However, when history is punctuated by sudden shifts 
in the principles of social organization, the correspondence between status and expectations 
is likely to diminish. Hence, it is imperative to differentiate between the social position 
occupied at the outset of reforms and the set of expectations held about the future to better 
understand the sources of deprivation in postcommunism. 

An extension of the relative deprivation framework along these lines suggests a two- 
pronged mechanism whereby perceived status is diminished in comparative evaluation. On 


one hand: 
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Higher past expectations for prosperity increase the probability of 
status-devaluation, ceteris paribus (Hypothesis 1). 


On another hand: 


Higher perceived past status increases the probability of status- 
devaluation, ceteris paribus (Hypothesis 2). 


These hypotheses formalize the expectation that the experience of relative deprivation 
amplifies the perception of status loss by reinforcing the belief that one has fallen further 
down in the status order than one actually has. This belief results in a sociologically 
meaningful gap between individual perception and reality. 

The analysis of status devaluation as an outcome of invidious comparison to reference 
points derived from past states and expectations affirms an identity-based understanding of 
perception that lies in agreement with basic tenets of congruence theory. According to this 
theory, experiences of distress and anxiety are a product of the disruption or interruption of 
the continuously adjusting identity process that underlies normal or healthy functioning (Burke 
1991: 841). The experience of status devaluation can thus be attributed to the interruption of 
the identity maintenance processes that imparted stability to everyday life under socialism. 
Postcommunism generated considerable distress and anxiety by impelling the realignment of 
identity and self-perception into a new feedback loop of identity maintenance. Congruence 
theory suggests that these transformations will be particularly stressful when individuals are 
highly committed to the interrupted identities and when these identities are more central and 
important to them than other identities. Thus, an expectation shared by both relative- 
deprivation and congruence theories is that the greater the salience attributed to the past, the 
greater distress interrupted identities will produce, with the status-devaluation bias 
representing one prominent manifestation of the experience of distress. 


Social Networks and Status Devaluation 


The processes of relative deprivation and identity interruption are context specific in the sense 
that they allude to reference points particular to an individual or a group in a given location. 
Besides such factors, more general social forces operate on human perception in important 
ways. Individuals are connected to larger social entities, such as the neighborhood, the city or 
the country, through networks of interpersonal ties that filter their perception of the social 
milieu. An inaccurate estimate of the distribution on a given dimension in a population could 
be partly due to the configuration of network ties, which determines the quality and volume 
of information accessible to the individual (Burt 1992). If individuals are mostly drawing ona 
restricted pool of experience and information, such as the one circumscribed by their kin and 
close friends, they would be ill-equipped to gauge population-wide distributions. The status- 
devaluation bias might therefore reflect the operation of “selective exposure” mechanisms 
(Marks and Miller 1987), whereby actors generalize from their personal networks to the target 
population, as a result of the availability of their personal sample (Kitts 2003: 226). 

It is a sociological axiom that a reference set of family members, friends and co-workers has 
the most profound impact on people’s judgments (Lockwood 1966, Merton 1968). People 
typically assess their social position in light of the distribution of education, occupations and - 
income among those around them (Evans et al, 1992: 465). However, proximate others are 
usually similar to the individual along many dimensions, such as income or education. As a result, 
personal networks appear homogeneous with respect to many socio-demographic, behavioral 
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and intrapersonal characteristics (for a review, see McPherson, Smith-Lovin and Cook 2001). 
Network homogeneity introduces bias in comparative evaluation by inducing undersampling on 
dissimilarity. By comparing themselves to others of objectively similar characteristics, people tend 
to overestimate the number of others who are similar to themselves. The consequences of 
overestimating similarity vary according to the categories of the status order. For low-status 
actors, it typically leads to inflated self-estimates, while for high-status actors, it leads to lower 
self-estimates than merited because the benchmarks used significantly exceed the population 
mean (Evans et al, 1992). Note that while this mechanism can help explain the status-devaluation 
tendencies of higher-status actors, it is not well-suited to account for a potential failure of low- 
status actors to enhance their self estimates, as will be shown to be the case in Bulgaria. 

The role of personal networks as factors of evaluation precision is based on both their 
socio-demographic composition and their structural properties, which determine the extent 
to which information is obtained from diverse, non-redundant sources (Burt 1992). The 
circulation of information within cohesive networks of dense ties sets the premises for 
misjudgment in comparative evaluation by virtue of the more limited network range and 
relative shortage of bridging ties to diverse external sources. Actors in more cohesive 
networks are at an informational disadvantage relative to members of sparse networks 
featuring a greater number of weak ties spanning diverse informational sources (Granovetter 
1973, Burt 1992). Weak ties tend to pull people into cosmopolitan rather than local 
interactions, casting a much wider informational net through the social world than strong ties 
(McPherson, Popielarz and Drobnic 1992: 166). 

Network cohesion is useful as a cross-sectional proxy for the degree of exposure to 
diversity, but it offers limited insights into temporal patterns. The extent to which a network is 
reconfigured over time is important in comparative evaluation. Structural inertia reduces the 
informational edge provided through weak ties, as the network congeals and the degree of tie 
redundancy and uniformity increases. | expect personal networks experiencing significant 
structural change over time to contribute to more precise self-estimates. Dynamic networks 
are more diverse and feature a greater proportion of weak ties, as documented in studies 
linking tie strength to network change (McPherson, Popielarz and Drobnic 1992). Compared 
to networks in flux, static networks are more conducive to the gestation of influence patterns 
among members, reinforcing similarity in the network. As shown in ecological research, 
structural inertia constrains exposure and adaptability to environmental change (Hannan and 
Freeman 1984), promoting an orientation of self-centeredness. 

Based on these considerations, | expect that the smaller socio-demographic variation and 
greater tie redundancy in more cohesive and more static networks increase the probability 
of status-devaluation. Formally expressed: 


More cohesive personal networks increase the probability of status- 
devaluation, ceteris paribus (Hypothesis 3) and, 


More static networks increase the probability of status-devaluation, 
ceteris paribus (Hypothesis 4). 


The hypothesized effects are driven by the homogeneity of reference points in more cohesive 
and static networks, which induce undersampling on dissimilar others. Static and cohesive 
personal networks are more likely to be used as reference groups in the first place, because the 
longer history of interaction and greater tie strength make them more central to the individual. 

In sum, two mechanisms are hypothesized to explain the status-devaluation bias 
observed in Bulgaria. The main one is the experience of relative deprivation, depressing self 
estimates as a result of invidious comparison to reference points of high past expectations 
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and high past status. In addition, selective exposure mechanisms in more cohesive and 
static networks promote status devaluation by reducing exposure to diversity and reinforcing 
reference points similar to oneself. These mechanisms are complementary to the extent that 
relative deprivation helps explain why low-status respondents fail to self-enhance in the 
manner anticipated by the homogeneity principle. 


Data and Variables 


Data for the analysis come from a multi-stage, random-probability sample of 1887 Bulgarian 
respondents. The sample is stratified regionally and within regions according to rural/urban 
divisions and settlement size. Within the primary sampling units, individual respondents were 
chosen on the basis of standard random procedures. Data were collected through face-to- 
face interviews in 2002. The survey questions cover a wide range of topics, including 
occupational history, expectations for the future, satisfaction with past and present, 
participation in collective action and similar others.° 

There are three sets of variables in the analysis. The first one is socio-demographic, consisting 
of age, years of schooling, income, settlement size, gender and ethnicity.° Basic descriptive 
statistics are presented in Table 1. The sample is representative of the gender and ethnic mix of 
the population, as revealed in census data. Income is recorded per household member, which 
has the advantage of controlling for the number of working adults in the household. 

As a measure of material welfare other than income, | created a “wealth” index that counts the 
possessed items in a “fairly new condition” from the following set — automobile, refrigerator, 
microwave oven, washing machine, color TV set, CD player, personal computer and a mobile 
phone. About 60 percent of respondents possess none of these items in a “fairly new condition’. 

The second set of variables measures the experience of relative deprivation - the first 
mechanism on which this study focuses. As a measure of perceived past status, respondents 
were asked to designate the position they occupied in the status order in 1989 on a scale 
from 1 to 10, where 10 stands for highest status. The observed pattern is for respondents to 
evaluate their past status as considerably higher than their present one. The median value in 
the past status distribution is 6 and only 3 in the current status distribution.’ In addition to the 
“perceived” measure, | include an “objective” measure of past status, which gauges the 
prestige of one’s occupation in 1989 by way of three dummy variables, indicating a position 
of administrative authority, non-administrative/non-manual labor and manual labor. The base 
category is “not working in 1989."° The second variable measuring relative deprivation 
examines expectations for the future (as of 1989). Respondents were asked to evaluate their 
life prospects as of 1989.° Nearly 90 percent report either “good” or “very good” prospects 
ahead of them at that point of time." 

The third set of variables measures structural properties of respondents’ personal 
networks — the second mechanism on which this study focuses. The limited choice of 
network measures in the survey permitted the use of only two variables to test the structural 
hypotheses. The first one measures the extent to which the composition of networks 
changes over time. The first option indicates little or no change, meaning that respondents 
have mostly interacted with the same set of contacts over time. The second option stands for 
significant change, with the third one standing for structural isolation.’ Out of these 
categories | created two dummy variables, with the isolation category as the base. About the 
same number of respondents report having dynamic and static personal networks. 

The second question is a proxy for the degree of social cohesion of the network. It asks 
respondents to categorize their level of closeness with contacts in their network. The 
categories were designed to correspond to a strong, moderate and weak level of closeness 
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Table 1: Descriptives and Frequencies of the Variables Used in the Analysis of Status-Devaluation 


Standard 
Variable Mean Deviation Minimum — Maximum 
Socio-demographic 
Female 02 48 
Ethnic minority ain oo 
Settlement size 5.21 2.71 { 9 
Age 47.71 16.43 18 91 
Education (years) 9.95 2.81 4 15 
Income 102.49 68.69 50 400 
Wealth (number of items) 1.01 1.64 0 8 
Relative Deprivation 1989 ‘ 
Perceived status 6.20 2.39 { 10 
Past expectations 3.12 oh 1 4 
Managerial work 07 .26 
Non-managerial skilled work 34 AT 
Non-managerial non-skilled work .20 39 
Not working 39 52 
Social Networks 
Social cohesion 2.15 1g 1 3 
Network inertia .408 491 
Network change 431 495 
No friends 161 .366 


or cohesion.”? Considering the apparent flaws in the wording of the question, | decided to use 
it as originally intended following reassurances by the data collection team that the question 
was explained to respondents as probing the degree of closeness to their social contacts. 
The response categories were likewise explained as standing for high, medium and low level 
of closeness. However, the question can also be interpreted as a measure of multiplexity 
rather than social cohesion and the categories as corresponding to multiplex supportive ties, 
specialized supportive ties and no supportive ties.'* Acknowledging the nature of the question 
as a mixed measure of social cohesion and multiplexity is acceptable insofar as cohesion and 
multiplexity tap into similar social processes and lead to similar substantive conclusions. 
Multiplexity and cohesion are distinct structural properties. The presence of multiplex ties 
does not mean a shortage of weak ties. Yet, multiplex ties and strong ties are both correlated 
with homogeneity or the degree to which similarity in the network is reinforced. Having strong 
multiplex ties is an indication that actors are likely to have a homogeneous network that 
strongly influences them. Whatever label is attached to that measure, the substantive 
argument it conveys is centered on the role of structural similarity in self-evaluation. It is 
reasonable to expect that networks defined by the first response category are more cohesive 
and less exposed to diversity on average than the other two types, with similar distinction in 
degree applying to the second and third categories. This interpretation receives some support 
from the consistency in responses across the two network questions. While 58 percent of 
respondents who chose the high-cohesion category also reported having static networks, 
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only 35 percent of those who chose the second category did likewise. This finding suggests 
that the cohesion and inertia categories measure similar processes that have mainly to do 
with network homogeneity. 

Considering the fact that data limitations prevent the use of sophisticated network 
measures, the available two are used as proxies for the degree of homogeneity or exposure 
to diversity. Network diversity is a function of how personal contacts are distributed across 
different areas of social space and of the strength of connections to those contacts (Reagans 
and McEvily 2008: 255). Finding significant predictive power for the two measures will lend 
some support to the capacity of networks to influence comparative evaluation by regulating 
exposure to diversity. 


Analysis 


My research strategy includes a two-stage analysis. The first stage provides evidence for the 
existence of a status-devaluation bias in self-evaluation by comparing the distributions of 
objective and perceived status on material welfare and categorizing respondents according to 
their degree of precision. The second stage uses the obtained error scores as a dependent 
variable in a multivariate analysis, testing the predictive power of the factors hypothesized to 
promote status devaluation. 


First Stage 


A key obstacle to my research strategy is the ambiguous wording of the main question of 
interest (Relative to others, how would you describe your level of material welfare?”), 
which requires several assumptions to guide the statistical modeling of status-devaluation. 
First, it is unclear whether respondents had the mean or median in mind when they 
answered the question. In recognition of the fact that the median is the more cognitively 
taxing measure and in agreement with previous research (Zuckerman and Jost 2001), | 
make the assumption that respondents use the mean as a benchmark. The second 
assumption concerns the population of “others” to whom respondents compared 
themselves when making their estimates. Did they interpret “others” to mean the country, 
the region, the city or the neighborhood? Insofar as people derive information about self- 
worth from comparison with relevant others, local reference points are more significant 
than global ones (Zuckerman and Jost 2001: 209). Considering that the data cannot support 
analysis at the neighborhood or city level, | decided on region as the primary unit of 
analysis. The intermediate position of region on the scale ranging from family, on one end, 
to country, on the other, renders it an appropriate reference point. Note, however, that the 
assumption does not hold that people resort mainly to regional benchmarks in self- 
evaluation, but simply that they have a better idea of the distribution of material welfare at 
the regional than country level. There are 31 geographic regions in the sample, averaging 
about 240,000 people. The last remaining gray area in the wording of the question concerns 
respondents’ understanding of “material welfare”. Did respondents interpret “material 
welfare” in terms of wealth or income? Because data on both are available, no assumptions 
are required here. | run separate analyses with income and wealth data and compare the 
results to confirm the validity of findings. 

The first step in the analysis is the standardization of the income variable by subtracting 
the regional mean from it. This procedure takes out a part of the unobserved regional 
variance, making income comparable across regions. The obtained income distribution is 
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heavily skewed, with 68 percent of respondents positioned below the mean." 

The next step is the comparison of perceived and objective material welfare, as measured 
by income. One way to make this comparison is to split the objective income variable in three 
parts that correspond to the categories of perceived status: “below-average,” “average” and 
“above-average.” Because no rule of thumb is available for drawing the boundaries of the. 
categories, several assumptions guide the segmentation process. First, the “average” 
category in the objective distribution should be roughly equal in size to the “average” category 
in the perceived distribution. As the ‘average’ category is of little substantive interest relative 
to the “above-" and “below-average” categories, | assume the proportion claiming to be 
average (30 percent) is the true estimate of the size of the “average” category in the objective 
distribution on material welfare. 

After experimenting with several values for the cut-off point, | settled on 20 levs." 
Respondents whose income deviates by no more than 20 levs from the regional mean in 
each direction are considered “average.” Respondents whose income exceeds the upper 
bound of the average category (equal to the value of the regional mean plus 20) are 
considered “above-average.” Those whose income is smaller than the lower bound of the 
average category are coded as"below average.” The obtained “objective” distribution of 
income is as follows: 46.03 percent of respondents are below average, 29.52 percent are 
average, 24.46 percent are above average. 

Comparing the obtained distribution to the subjective distribution reveals a significant bias 
toward status devaluation. By design, the average category is the same size in the two 
distributions, but the share of below-average respondents is much greater in the subjective 
distribution than it is in the objective distribution (68 percent vs. 46.03 percent, p < .001). 
Conversely, the share of above-average respondents is much larger in the objective 
distribution than it is in the subjective distribution. (24.46 percent vs. 1.8 percent, p < .001). 

As an additional check on the status-devaluation finding, | employed an alternative 
measure of perceived status. A survey item asks respondents to “designate their position in 
the social order on a scale from 1 to 10,” where 10 stands for highest status. The proportion 
of respondents who positioned themselves below the midpoint of the scale (values 1 through 
5) is overwhelming — 93.7 percent! | collapsed the response scale to correspond to the three 
categories of perceived welfare” and notably, the obtained distribution mirrors the distribution 
of perceived welfare (67.1 percent, 31.65 percent and 1.25 percent in the former and 68.2 
percent, 30 percent and 1.8 percent in the latter). The consistency of results obtained across 
different specifications of perceived status provides additional evidence for the biased 
perception of inequality. It allows a greater degree of confidence in the inferences made so far 
and corroborates the decision on the size of the average category. The most substantive 
implication is that in the popular view the social order in Bulgaria resembles a pyramid with a 
solid base that narrows steeply toward the top. Wealth is thought to be concentrated in a tiny 
layer at the top. However, the results suggest that the popular perception is incorrect. While 
the distribution of material welfare is indeed skewed, it is not nearly as skewed as 
respondents perceive it to be. 

Following the aggregate-level analysis, | calculated the degree of correspondence between 
perceived and objective status on material welfare for each respondent. Half of all respondents 
were accurate in their estimates, 8.96 percent devalued their status by two units (claimed 
below-average status while objectively being above-average), 34.18 percent devalued their 
status by one unit (either from high to average or from average to low), 6.63 percent increased 
their status by one unit and just .3 percent increased it by two units. The analysis reveals a 
dependency between status level and evaluation accuracy. While 85 percent of low-status 
respondents are accurate in their self-estimates, only 5 percent of high-status and 33 percent 
of middle-status respondents are similarly on target. This finding indicates that the peculiar 
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distribution of perceived status on material welfare in Bulgaria is a joint product of the status- 
devaluation tendencies of middle- and high-status actors and the failure of low-status actors to 
self-enhance in the manner Observed in many other social contexts. 


Second Stage 


To establish the degree to which the theoretical constructs are able to account for outcomes 
of status devaluation, | ran a multivariate analysis, where the dependent variable is an error 
score, measuring accuracy in self-evaluation. The dependent variable has three categories: (1) 
status devaluation (the self-estimate is lower than the objective estimate), (2) accurate self 
evaluation (the self estimate equals the objective estimate) and (3) status enhancement (the 
self estimate is higher than the objective one). In view of the polytomous nature of the 
dependent variable, | estimate a multinomial logistic regression.The correlations for the 
variables are presented in Table 2. They tend to be low, except for the one between the two 
measures of relative deprivation, whose effects on the dependent variable will be estimated 
independently. 

The results of the multinomial regression analysis are reported in Table 3. The coefficients 
are the log odds of status devaluation and status enhancement relative to making an accurate 
estimate. To better capture the independent effects of the covariates, | estimated three 
models. The relative deprivation arguments will be supported if the coefficients for past 
expectations and perceived past status are positive and significant net of the effects of 
objective past and present status. Note that the dependency the analysis seeks to establish 
is between the level of precision in self evaluation of status on material welfare, as measured 
by income, and the self estimates of general social status and life prospects at the end of 
socialism. It should be remembered that in socialism social status was based to a greater 
extent on party membership and position in the party hierarchy than on income and wealth 
(Konrad and Szelenyi 1979). 

In the first model, | added the measure of perceived past status to the control variables. 
The observed effect lends support to Hypothesis 1. Controlling for occupational prestige in 
1989, a one-unit increase on the scale of perceived past status raises the odds of status 
devaluation by 6 percent [100(e°'-1)], as compared to the baseline. The second model 
introduces the past-expectations measure. The coefficient is significant and positive, 
substantiating Hypothesis 2. A one-unit increase on the expectations for prosperity scale (as 
of 1989) is associated with a 20 percent increase in the odds of status-devaluation in the 
present, as compared to the baseline. 

The results support the arguments derived from relative deprivation and congruence 
theories. The ways in which people assess their relative standing on material welfare are 
significantly affected by the perceived gap between past and present status, controlling for 
objective status differences. The higher the perceived status position occupied in socialism, 
the more likely people are to devalue their relative status (on material welfare) in the present. 
The anchoring of self-identity in a reference set of past status and past expectations 
underlines those features of the environment that impede the alignment between past, 
present and desired statuses, enhancing dissatisfaction with the present. Even for high-status 
individuals comparison to the past may invoke negative feelings because relative advantages 
in the present are viewed as minor compared to the absolute loss in prestige or purchasing 
power over time. That is, cross-sectional differences are insufficient to offset the negative | 
perception of longitudinal (or “within-individual”) variation. As a result, only those at the apex 
of the social order consider their earnings large enough to justify an above-average self 
estimate on material welfare. 
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Turning to the social-structural part of the analysis, the coefficients for the network 
measures in all three models support Hypotheses 3 and 4. The strong, positive effect for 
cohesion in the first two models attest to the capacity of cohesive networks to set in motion 
status-devaluation dynamics. In Model 2, a one-unit increase on the social cohesion scale 
increases the odds of status-devaluation by about 35 percent. Cohesive networks usually form 
between similar others and are excellent conduits of social influence, reinforcing similarity in 
the network (Friedkin 1998). As past research suggests, a reference set of objectively similar 
peers is conducive to overestimating the share of similar others in the population, leading to 
lower self-estimates for actors of higher status (Evans et al, 1992). A similar conclusion is 
reached in the third model. Reduced exposure to diversity in static networks reinforces the 
perception of similarity in ways that promote status devaluation. Embeddedness in a static 
personal network doubles the odds of status devaluation, as compared to the base category. 
The lack of statistical significance for the network coefficients in the status-enhancement 
models indicates that network homogeneity does not lead to self enhancement among low- 
status respondents, which can be attributed to the operation of relative deprivation. 

These findings confirm the intuition that network homogeneity, a correlate of both 
structural inertia and social cohesion, helps explain status devaluation. The formation of 
cliques, where ego is surrounded by the same alters over time, leads to overemphasizing 
internal standards. Likewise, increasing cohesion curtails diversity in terms of information, 
attributes and attitudes (Burt 1992). This form of closure promotes a systematic bias in the 
perception of reality by imposing a set of similar social categories as reference points that 
homogenize the perceived environment. One should bear in mind that social networks 
crosscutting different status levels and social categories to an extent that makes them 
broadly representative are rare, given the powerful tendencies toward “focused” interaction 
and triadic network closure (Heider 1958, Granovetter 1973). Social interaction is highly 
localized and concentrated along several social dimensions, such as class, neighborhood, 
race and ethnicity (Feld 1991). However, as shown here, the structural properties of networks 
influence the ways in which people see themselves. All other things being equal, more static 
and cohesive personal networks make it more likely that people of higher status will see 
themselves in more negative light than merited. 

While the structural preconditions of status devaluation may be found in any social setting, 
the emergence of the bias in Bulgaria was likely facilitated by the process of structural 
fragmentation in postcommunism. The demise of socialism marked the onset of a period of 
considerable uncertainty, social dislocation and reconfiguration of pre-existing networks of ties 
(Stark 1996). As social identity theory suggests, a high degree of uncertainty encourages in- 
group identification and in-group biases (Hogg and Mullin 1999). Confronted with the uncertainty 
of reforms and the challenge of adjustment to new realities, people increasingly oriented 
themselves to familial and friendship networks for assistance and support. Past studies show 
that uncertainty accelerates the rate of dissolution of ties between non-similar others and the 
formation of niches of similarity in social space (McPherson, Smith-Lovin and Cook 2001). In full 
swing, these forces lead to accelerating social segmentation and increasing distance between 
dissimilar others (Popielarz and McPherson 1995: 699). This anomic tendency is associated with 
a high level of dissatisfaction in social communities (Seeman 1972). 

That status devaluation is a symptom of structural fragmentation can only be established 
with longitudinal network data of a kind that is rarely collected in postcommunist countries. As 
a working hypothesis, it stands at odds with the popular perception of postcommunist societies 
as excessively rich in network capital. For example, Sik and Wellman (1999: 225) argue that in — 
the post-communist era people rely even more on networks than they did under communism. 
The increase in network capital is attributed to the coping strategies of households in hard 
times, when loyalty to the household tends to increase. However, the increase in exchanges 
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within and between households belonging to the same kin structure does not necessarily lead 
to an increase in total network capital, considering that such gains are likely to come at the 
expense of the volume of exchange between non-kin. As past research implies, the uncertainty 
stemming from abrupt institutional changes encourages the reconfiguration of social networks 
in a manner that minimizes distances within cliques, so as to serve as a safeguard against 
uncertainty. One of the consequences of this structural shift is the decrease in the mean level of 
exposure to diversity, which creates the preconditions for distortion in the perception of reality. 
Indirect evidence in-support of this argument comes from research in Hungary, showing 
that the social-structural changes in postcommunism do not conform to a linear trend, but 
include the simultaneous contraction of friendship networks" and enhanced reliance on 
family/kin ties, especially in the lower social strata (Angelusz and Tardos 2001). In their analysis 
of panel data from 1987 and 1997, Angelusz and Tardos (2001) find that personal networks 
have become more homophilous over time and more closed with respect to key social 
markers, such as education and occupation. Increasing homogeneity results from the 
fragmentation of social space in the form of smaller personal networks and increased reliance 
on family/kin ties. In such a structural landscape one would expect weak ties to be especially 
valuable as bridges across disjointed clusters of ties. This is indeed one of the conclusions 
reached by Angelusz and Tardos (2001: 319), who emphasize the tendency toward increasing 
inequality in access to non-family, non-kin network resources in post-communist Hungary. 


Limitations of the Analysis and Future Research 


Several limitations of the analysis need to be assessed with respect to their impact on the 
validity and generalizability of findings. Most importantly, the analysis of status-devaluation is 
based on a single measure of relative standing, whose crudeness necessitated several 
assumptions to enable statistical analysis. This limitation is partly redressed by the use of an 
alternative measure of relative social status, which revealed an almost identical distribution 
and produced very similar results to the ones reported in the material-welfare analysis. 
However, a survey designed to measure self-evaluation on several dimensions will provide 
much greater analytical leverage over the antecedents of status-devaluation than hitherto 
achieved. In addition, to refine the scope conditions of findings and control for context- 
specific factors, the analysis should include data from other post-communist countries. The 
analysis focused on within-country variation, but some factors of status devaluation can only 
be assessed in comparative light. 

Data limitations prevented a more thorough analysis of the social-structural factors of status 
devaluation. However, even the rudimentary network measures used were sufficient to 
ascertain the relationship between the properties of personal networks and precision in self 
evaluation. A better survey instrument will most likely make this relationship even more salient. 
Another limitation of the analysis is the inability to control for the role of current reference 
points in estimating the effects of past ones. However, upward comparison is a universal 
human proclivity and it is not readily apparent why it should play a more significant role in this 
context than in any other. Similarly, external reference points, such as Western Europe, are no 
doubt important in self-evaluation and must be taken into account if allowed by the data, but 
their omission does not invalidate the results. Consider the change in self-evaluation patterns 
over time, as reflected in the self estimates of survey respondents."® While the distribution for 
1989 reveals the clustering of self estimates above the midpoint (63 percent), that number 
drops stepwise to 24 percent in 1996 and 6 percent in 2002. Increased travel and exposure to 
Western lifestyle since the end of communism has undoubtedly contributed to the decline in 
the share of those considering themselves high-status. Yet, the effect is probably much 


1514 © Social Forces Volume 84, Number 3 * March 2006 


stronger for the early than late 1990s and is a poor guide to the decrease in mean self 
estimates between 1996 and 2002. One should also bear in mind that internal or socially 
proximate reference points are more deeply experienced than external or remote reference 
points. Nevertheless, the role played by external reference points in the occurrence of the 
status-devaluation bias needs to be addressed in greater detail in future research. 

The cross-sectional nature of the data inevitably challenges the validity of inferences made. 
Perhaps most vulnerable in this respect is the established effect for past reference points, 
raising the question as to whether these are an antecedent or a consequence of status 
devaluation. It is not unreasonable to expect that seeing oneself in less favorable light than 
merited paints a rosier picture of past achievements.” If so, the question arises as to what does 
account for the bias observed? The breadth of the data allowed me to control for a range of 
socio-demographic characteristics, which significantly narrowed the space available for 
developing orthogonal explanations. Against a heavily controlled background, the hypothesized 
effects reached statistical significance at the accepted levels. The findings supported an 
interpretation of status devaluation that is consistent with previous research, albeit in ways that 
have not attracted much attention so far. 


Conclusion 


The main contribution of the analysis is in highlighting a previously ignored mechanism 
whereby perception drifts away from empirical reality. In asserting a distinctly sociological 
account of how perceptual biases emerge and are perpetuated over time, this paper examined 
two sets of factors. The first one, derived from relative-deprivation and identity theories, relates 
to the identity-interrupting experience of postcommunism. The use of old identities as a 
platform in the creation of new ones carries the risk of aggravating discontent with the present. 
This risk is revealed in the finding that reference points of high past expectations and high past 
status predispose to status devaluation. The second factor is structural in nature. The cohesion 
and inertia of personal networks generate pressure toward status-devaluation among higher- 
status actors by reducing exposure to diversity and encouraging oversampling on similar 
others. This finding lends support to Burt's contention that “network closure produces ignorant 
certainty” (2001: 63) as a result of the tendency for attitudinal reassurance in cohesive 
networks and the restricted inflow of non-redundant information.” 

These factors are generalizable beyond the local context and are useful in addressing 
dependencies far removed from the intricacies of postcommunism. The status-devaluation bias 
is not exclusive to a country or a region sharing a particular historical trajectory. Latent status- 
devaluation tendencies are expected to intensify under conditions of high uncertainty and rapid 
change, which encourage in-group identification and invidious comparison to the past. However, 
the emphasis on general mechanisms invites the recognition that local conditions make these 
mechanisms more or less salient. Thus, the past weighs in heavier in the assessment of the 
present in the aftermath of a historical rupture of such magnitude as the demise of socialism. 

The analytical approach advanced here, straddling the interface between micro-and macro- 
level processes, has generally been eschewed in postcommunist research, which tends to focus 
on political and economic changes, giving short shrift to the ways in which people interpret and 
respond to these changes. However, as burgeoning research on “emergence” and complexity 
makes clear, the interdependence between individual responses generates perceptual patterns 
and collective sentiments that find form in macro-level outcomes (Macy and Willer 2002). 
Postcommunism provides a good opportunity to observe how systemic change sets in motion 
processes of identity-conversion and structural transformation, whose interplay can lead to the 
establishment of social-psychological equilibria that are rarely observed elsewhere. 
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Notes 


iy 


The General Social Survey (GSS) is a survey that has been administered to Americans 
since 1972. Previously an annual survey, the GSS became biennial in 1994. The basic 
purpose of the GSS is to gather data on contemporary American society in order to 
monitor and explain trends and constants in attitudes, behaviors and attributes. The data 
was obtained from the website of the Inter-University Consortium for Political and Social 
Research (ICPSR) at the University of Michigan. 


Note that if the BTA effect applied fully to class evaluations, most people should rank 
themselves as upper-middle class. | am indebted to a reviewer for pointing this out. 


The exact wording of the question in the Bulgarian survey is “Compared with others, how 
would you define your level of material welfare?” The options are “below-average,” 
“average” and “above-average.” 


There is an important scope condition associated with this bias. Implicit in the question on 
material welfare is the traditional three-tier view of the social order (low, middle and high 
status). Therefore, the bias may apply only to middle- and high-status actors, while low- 
Status actors are either accurate or enhance their self estimates. 


The survey was commissioned by the “Open Society Foundation” and designed by the 
Center for Liberal Strategies (Sofia, Bulgaria). The data were collected by “Alpha Research 
Marketing and Social Research.” The response rate exceeds 80 percent. 


Ethnicity is a dummy variable, where the “minority” category includes Bulgarian Muslims 
and Gypsies. 


The perceived-status distribution in 1989 conforms to a status-enhancement form, rather 
than of status devaluation. This indicates that the status-devaluation bias is not a cultural 
idiosyncrasy. Although the data are subject to reliability concerns, there is little to suggest 
that a tendency for lower self estimates in the present creates a bias toward higher 
estimates of one’s past status. 


The crudeness of the occupational prestige variable is acceptable given its role as a 
control variable. It is not intended to capture the finer prestige distinctions within the 
occupational system under socialism, but simply to provide background against which to 
measure the relative predictive power of perceived status. 


The categories are “very good,” “rather good,” “rather bad” and “bad.” 


. The respective number for the present is 22 percent. The question of whether 


respondents tend to exaggerate their life prospects as of a prior period is inconsequential 
for the analysis that follows, which focuses on how the perception of the past conditions 
self-identity in the present. 


“In recent years many people changed their friends and the people they interact with. 
Which of the following applies to you?” 


1. The people | feel close to and rely on are the same ones | was close to several years ago. 


1516 © Social Forces Volume 84, Number 3 *« March 2006 


2s 


18. 


20. 


2. The people | feel close to and rely on are different from the ones | was close to years ago. 
3. There are no people outside my family whom | feel close to and rely on. 


“People feel a different level of closeness to those around them. How would you 
characterize your relationships with the people in your network?” 


1. There are people | can rely on in various cases in my professional and personal life. 
2. There are people | can rely on in specific cases only. 
3. There are no people | can rely on. 


. am thankful to a reviewer for bringing this alternative interpretation to my attention. 


. The results of the analysis are substantively similar when the income data is standardized 


by the median rather than the mean. 


. 1 Lev is equal to $.63 (U.S.) as of March 2004. 


. The spread of the average category is therefore 40 levs. The assumptions involved can 


best be illustrated by means of a comparison with the United States. A spread of 40 levs 
constitutes 40 percent of the mean value of income in the sample (102 levs) and 60 
percent of the standard variation (68 levs). Let us assume the average American 
respondent earns $40,000 a year or $3,333 a month. The spread in this case would 
amount to $16,000. This means that everyone earning between $32,000 and $48,000 a 
year or between $2,667 and $4,000 a month would be considered average. Apparently, 
this procedure allows considerable room for error to respondents and is likely 
underestimating, rather than overestimating, the share of those who are average and the 
magnitude of error in self evaluation. 


. Responses in the range of 1 to 3 were aggregated into a below-average category, those 


from 4 to 7 into an average category and those from 8 to 10 into an above-average category. 


The mean number of friends reported in the Panel Study of Hungarian Households 
decreased from 7.1 to 4.5 from 1993 to 1997 and the proportion of respondents who said 
they had no friends increased from 20 to 30 percent over the same period (Angelusz and 
Tardos 2001: 301). 


. Respondents were asked to designate the position they occupied in the status order in 


1989, 1996 and 2002 ona scale from 1 to 10, where 10 stands for highest status. 


Stated differently, the alternative hypothesis is that pessimistic attitudes lead to inflated 
estimates of past status and expectations. Yet this alternative tells us nothing about how 
pessimistic attitudes evolve in the first place. In the case of past expectations, the 
causality concerns are weaker, because it is not readily apparent why discontent in the 
present would produce a systematic upward bias in recalling past expectations. To 
address the possibility that discontent or pessimistic attitude drive much of the 
dependencies observed, | included an indicator for pessimistic disposition in the analysis: 
“Do you look forward to the future with hope or fear?” The coefficient is insignificant and 
does not change the pattern of findings, providing some evidence that psychological 
idiosyncrasies Cannot account for the observed bias toward status devaluation. These 
analyses are available from the author upon request. 
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21. While the two factors are distinct in nature, it may be argued that they exert combined 
pressure towards status devaluation. However, no interaction effects were found in the 
analysis. 
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Abstract 

Using an abbreviated version of Levenson’s (1981) locus of control scale, we examine 
change over time in the locus of control of displaced steelworkers. The first data collection 
occurred approximately six months after plant shutdown, the second occurred a year later. 
Utilizing a multidimensional measurement model, we test the major assumption that 
locus of control is a stable personality characteristic. The results of our analysis suggest that 
the measurement model is stable over time. However, we find evidence that locus of control 
changes over time. We argue that heightened perceptions of both internal and external 
control among men forced into unemployment may in part be a function of the reference 
group to which workers compare their perceived success or failure. Moreover, locus of 
control may be a function of the reality of institutional constraints in the face of 
unemployment. 


Locus of control (LOC) may be one of the most studied concepts of any personality 
characteristic theorized in psychology or social psychology (Phares 2001). The first internal- 
external locus of control research was conducted by Phares (1957), but the concept was 
made popular by Rotter (1966) who developed a scale which was subsequently incorporated 
into many national and longitudinal studies. The concept also found its way into sociology 
where external control has been conceptualized as powerlessness, a dimension of alienation 
(Seeman 1959, 1972). Internal control has been variously described as self-directedness, a 
sense of efficacy, instrumentalism or as self-mastery. A sense of control or mastery over life 
is one of the most common indicators of personal coping resources (Thoits 1995). Much of 
the stress and coping research, for example, assumes that a high internal locus of control is 
associated with greater individual competency and an ability to adapt in the face of life 
transitions or difficult situations (Pearlin et al. 1981). Most psychologists and many 
sociologists assume that locus of control is fairly stable over time (Phares 2001), but there is 
disagreement about whether locus of control is a personality characteristic indicative of one’s 
capacity for adaptation or is reflective of one’s position in the social structure (Ross and 
Mirowsky 2002). 

Using Levenson’s (1981) three dimensional LOC model, we examine the locus of control 
of a unique sample of steelworkers who were forced into unemployment. Levenson posits 
three dimensions of control. The first dimension, internal control, measures the extent people 
believe they have control over their own lives. The second measures the recognition of 
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powerful others in one’s life. The third measures perceptions of chance. In this analysis we 
test Levenson’s model on data collected in 2002 and 2003 from steelworkers who 
experienced an 18-month period of heightened stress and employment ambiguity. The 
steelworkers were employed at a large publicly owned steel company that experienced 
severe financial difficulty as a result of global competition and poor management. The 
company filed for bankruptcy in 2001 and lay-offs resulted soon thereafter. For a full year, 
between November 2001 and November 2002, workers waited while management 
attempted to secure additional financing to re-open the plant. In the meantime, the 
steelworkers lived with a high degree of ambiguity. If the plant re-opened, they might get their 
jobs back. But if the plant was not going to re-open, they needed to begin looking for long- 
term employment elsewhere. We did not collect data prior to the lay-off, and therefore, do not 
have before-and-after measures of locus of control. However, the first data collection 
occurred about seven months after the first lay-offs (June and July 2002); the second data 
collection occurred 12 months later (May and June 2003). By the second data collection, 
some workers had found new jobs or enrolled in retraining programs. Others had retired, and 
a significant number were still unemployed. 

The results are not generalizable to a broader population, but the analysis serves as a case 
study for testing long standing assumptions about the nature of locus of control and its 
relationship to re-employment. Interviews with a sample of those who responded to the first 
survey offer insight into just how difficult adjusting to the plant closure was for many. One 
respondent describes the situation and suggests the early death of one of his friends resulted 
from the stress and ambiguity caused by the lay-off. 


Even after the lay-off came you had from January to roughly November. 
The rumors [were] flying. Who's gonna do this? Who was gonna come 
in and maybe buy the plant? Who was maybe going to come in and 
loan money to start It up? That's why | say | think my friend that just 
passed away, with all of this, he was one that really worked 
conscientiously, but he also worried about his family all the time. He 
wanted to provide for them and he didn't want anybody helping him. 
And | really think that with all of this talk that kept going on with 
reopening, and there was some other stuff that floated around. And 
managers out there [were] supposedly sending out information to 
people and all of a sudden somebody would get all uplifted. Man 
there's somebody gonna come in and do something. We're gonna go 
back to work. And then two weeks later they say no, that wasn't even 
true... And I’m sure it’s taking a toll on a lot of others. 
— Male, age 42, employed in the steel industry for 19 years 


Adding to worker stress, many in the community were pleased to see the steel plant close 
because they believed the plant was an eyesore and source of pollution. Job lay-offs in the 
high tech industry had occurred in the same community a year earlier, which also lessened 
the community's concern for the economic well-being of the laid-off steelworkers. The vast 
majority of steelworkers were union members, but many of those who lived in the area were 
strongly anti-union (Utah is a right-to-work state). Steelworkers complained that state and 
community leaders had not been very supportive. In addition, more than half of the union 
members expressed indifference or dissatisfaction with union leadership. 

What might we expect to find in an analysis of locus of control in a population of 
steelworkers who suddenly lose their jobs as a result of company bankruptcy and plant 
closing? These are men who have a stable employment record, for whom working hard and 
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providing for their families is central to their identity; they face unemployment as a result of 
economic change and management failure. They are bound to their communities and face a 
tight labor market with few jobs that require the skills they learned making steel. One might 
expect some impact on locus of control in these circumstances. Using structural equation 
modeling, we first test the reliability of a multidimensional LOC measurement model and then 
test for change over time. In a second analysis, we examine patterns of change between 
various Categories of steelworkers based on their situation at time two: the employed, those 
with temporary or part-time employment, students (many workers qualified for re-training 
through the federal Trade Adjustment Assistance program), and the retired. 


Internal-External Locus of Control 


Internal-external control was initially hypothesized to be of major significance in the nature of 
learning processes across different kinds of situations. Accordingly, persons differed in their 
generalized expectancies of whether a reward or reinforcement was contingent on his or her 
behavior. External control was characterized by action “not entirely contingent” upon a 
person; rather “perceived as the result of luck, chance, fate, or under the control of powerful 
others, or as unpredictable because of the great complexity” of surrounding forces. Internal 
control was characterized by the perception that action is contingent upon one’s own 
behavior, the result of relatively permanent individual characteristics (Rotter 1966: 1). 

Pearlin et al. (1981:340) argued that mastery was a central dimension of the self-concept. 
Mastery was defined as the extent to which people “see themselves as being in control of the 
forces that importantly affect their lives.” Developing their own measure of personal control, 
Mirowsky and Ross (1991) have demonstrated that employment and household labor (Ross 
and Mirowsky 1992, 1996) as well as age and gender (Lewis, Ross and Mirowsky 1999; 
Mirowsky 1995; Ross and Mirowsky 2002) influence personal control. Personal control, they 
argue, “reflects the real constraints and opportunities of one’s ascribed and achieved 
statuses.” They find that personal control “declines with a heavy burden of domestic work, 
economic hardship, job loss, and poor health and physical impairment” (Ross and Mirowsky 
2002:127). According to their research, women, minorities, the working class and the 
unemployed report lower levels of personal control. 

In a recent study of working class men in the United States, Lamont (2000) describes an 
inherent connection between definitions of morality and expectations of self-control, self- 
mastery and discipline. For working class men who face economic insecurity, “being hard 
working and responsible is a means of ensuring a predictable environment for oneself and 
others.” (Lamont 2000:20) Consequently, Lamont argues that working class men in particular, 
“find their self-worth in their ability to discipline themselves and conduct responsible yet 
caring lives to ensure order for themselves and others.” (2000:2-3) 


Job Loss, Unemployment and Locus of Control 


Research on plant closings and job lay-offs has demonstrated a number of negative effects 
on individuals and their families, including increased conflict and tension, a decline in marital 
satisfaction, and strained parent-child relationships (Friedemann and Webb 1995; Moen, Kain 
and Elder 1983; Perrucci 1994). Job loss is associated with a decline in self-esteem and 
changes in the self-concept of workers (Avison 2001). Studies on unemployment indicate that 
displaced workers are at greater risk of depression and suicide as well as other psychological 
disorders (Wasserman 1984). 
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Research examining locus of control and unemployment has sometimes produced 
inconsistent results, as noted by Goldsmith, Veun and Darity (1995). Nonetheless, their own 
analysis of data from the National Longitudinal Survey of Youth suggests increasing levels of 
external control and feelings of helplessness among both young men and women 
experiencing extended periods of unemployment. Ross and Mirowsky (1992) found that 
employed persons reported a greater sense of control than the unemployed, and individuals 
who had had a history of interrupted employment experienced accelerated declines in 
personal control over the life course. However, they also argue that these effects are primarily 
the result of differences in social background — education, minority status, age and gender. 

Gianakos (2002) found an association between internal locus of control and an increased 
likelihood of help-seeking behavior and positive thinking among employed students from 
working-class backgrounds. They also concluded that high external control increased the 
likelihood of avoidance and resignation as well as the use of alcohol as a coping mechanism. 
In a study of unemployment and re-employment, Waters and Moore (2002) found that the 
continuously unemployed and the re-employed were more likely to report belief in powerful 
others than were the continuously employed. In addition, the continuously unemployed 
reported slightly lower internal control beliefs. They concluded that high internal control 
influenced re-employment and lack of internal control contributed to continuing 
unemployment. However, they also caution that “positive psychological health may do little to 
assist re-employment if an individual is attempting to find a job in occupational areas that are 
saturated or on the decline.” (Waters and Moore 2002:602) 


Sample 


The steelworkers in our study were employed by a steel company operating in Vineyard, 
Utah. The plant was first constructed during World War Il and was owned and operated by 
U. S. Steel for many years. U.S. Steel decided to close the plant in the 1980s, but in 1987 a 
group of investors re-opened the plant, noting the importance of manufacturing jobs for the 
economic health of the community. The changing economy of Utah Valley since the 1970s 
had presented severe obstacles to re-employment for the steelworkers. Although once 
dominated by manufacturing, agriculture and higher education (one large private university 
and a small community college), when the plant closed in 2001, the local economy had 
changed dramatically to high technology, marketing and service (call centers) and 
information management, and higher education (one large private university and a rapidly 
growing state college). 

With the support of the local steelworkers union, researchers mailed each of the displaced 
union workers an 18-page survey intended to collect personal, financial and attitudinal 
information to assess levels of distress following the lay-off. The data were collected in two 
waves. Although more than 600 individuals responded in each wave, only 306 of those who 
identified themselves in the first survey also responded to the second survey. This analysis is 
based on panel data from 293 men who responded to both surveys. We found only slight 
differences between the panel respondents and those who responded to only one survey. 
Three quarters of the panel respondents were older than 45, and 27 percent were older than 
55. Many workers were hoping the plant would re-open long enough for them to retire with full 
benefits. Median length of employment at the plant was 24 years. Twenty-five percent of the 
respondents had worked at the plant for 30 or more years. Sixty-two percent had completed at 
least one year of college although only 6 percent had a university degree. Their adjusted 
median annual income was $40,800, which was 89 percent of the 1999 median household 
income for residents of Utah. All but seven respondents reported their race as “white.” In terms 
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of marital status, 88 percent were married and 9 percent divorced or separated. Like other Utah 
residents, the median number of persons per household was three. 

The survey included an abbreviated version of Levenson’s (1981) locus of control scale, 
which appeared midway through the questionnaire following questions about employment, 
decisions to retrain, satisfaction with the response of union and community leaders, ties to 
the community, marital satisfaction and financial situation. This brief version of Levenson’s 
scale consisted of 10 of the original 28 scale items that form three subscales. The first, 
“powerful others,” is characterized by a “belief in the basic order and predictability of the 
world, coupled with the expectancy that powerful others are in control,” and the second, 
“chance” is characterized by a “belief in the basic unordered and random nature of the world.” 
(Levenson 1981:15)' The importance of the distinction, according to Levenson, is that belief 
in powerful others assumes sufficient regularity in the social order that a person can still 
“obtain reinforcements through purposeful action.” (1981:15) Levenson cautions us not to 
think of the three scales as a tool for creating typologies of individuals. Instead, individuals are 
not either internally or externally oriented, but can-score high or low on all three scales; “that 
is, a person could say he or she was personally in control yet also say that life is a random 
series of events controlled by powerful others.” (1981:18; 1974) 

Given the average education level of the population and the heightened level of stress 
following the plant closure, we were forced to keep the length of the questionnaire to a 
minimum. Furthermore, the self report format of the survey necessitated an abbreviated 
version of the scales in order to encourage survey completion. Previous analyses using 
Levenson's locus of control scale have resulted in mixed support for the use of an abbreviated 
version of the scale. Although Lumpkin (1988) concludes that brief scales derived from both 
Rotter’s I-E scale and Levenson’s multidimensional scale may be very unstable among certain 
populations, Sapp and Harrod (1993) argue that a brief Levenson scale can indeed provide a 
reliable and valid alternative to the full scale.? Because we have data from two time points, we 
are able to test the reliability of the measurement model over time. 


Analysis 


We conduct two analyses. We begin with a confirmatory factor analysis using AMOS 5.0 
(Arbuckle 2003). Then using a series of t-tests, we explore whether the patterns of change 
are similar across various groups of respondents. The path diagram (see Figure 1) represents 
the relationship between the endogenous and unobserved exogenous variables.’ A basic 
model is repeated to create a longitudinal model. The full model consists of 20 indicators 
(scale items), six first order latent variables and two second order latent variables. Each 
observed item was measured on a five-point Likert scale, one indicating “strongly disagree” 
and five “strongly agree.” The first three latent variables and associated scales items are 
constructed using time one, 2002 data. The second three latent variables and associated 
scale items are constructed using time two, 2003 data. 

We first test whether the theorized longitudinal model of locus of control fits the data. 
Then we test whether the measurement model is consistent over time; in other words, that 
the latent constructs and observed variables are stable. We then test whether there are 
significant differences in the steelworkers’ sense of internal and external control over time. 
The best way to test the hypothesis that the parameters are stable is through a hierarchy of 
invariance, where parameters are constrained in an ordered sequence and compared to the 
previous model. If the parameter values are stable, then there should be no significant 
difference in fit between the constrained model being tested and the comparison model. We 
form the following hierarchy of models (see Table 1): the baseline mode/ (Test 1, Model A) 
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with first and second order latent means constrained to zero, the equal loadings model (Test 
1, Model B) with the factor loadings restricted across time and first and second order latent 
means set to zero, and the equal loadings, variance, and covariance model (Test 1, Model C) 
with both the factor loadings and the variances and covariances of the latent variables 
restricted. 

To test whether there is a change in perceived control over time we examine a hierarchy 
of invariance similar to the method we used to establish the fit of the model. Rather than 
comparing the means to the baseline model, we create a free first order latent means 
model (Test 2, Model A) where we set the first intercept of each set of observed variables 
to zero and all other observed intercepts are set equal across time in order to freely 
estimate the latent means. By setting the first order latent variable means equal from 2002 
to 2003 (equal first order latent means model) we can test whether there is a significant 
change in the steelworkers’ perception of control during the year following the lay-off (Test 
2, Model B). 

Because this test only tells us that there has been a significant change overall, we must 
also use a forward inclusion approach in which we test each pair of means separately. In each 
model we add an additional constraint to the “ree first order latent means model by setting the 
time one and time two latent means of each LOC dimension equal (one at a time) while 
allowing the other LOC dimensions’ latent means to be freely estimated. By comparing each 
model to the free first order latent means model we can test for significant changes in latent 
means from time one to time two (Test 2, Models C through E). 

Finally, in order to test for a change in overall LOC over time, we conduct one more test. 
We test whether a model that constrains the second order latent means to be equal across 
time is a better fit of the data than a model that allows the second order latent means (an 
overall LOC score) to be freely estimated (Test 3, Models A and B). If, (1) there is no significant 
difference between the equa/ second order latent means model and the free second order 
latent means model, and (2) the forward inclusion tests of each pair of means separately 
demonstrate change in any one specific locus of control dimension, we must conclude that it 
is important to attend to the multidimensionality of locus of control. 

The second analysis utilizes a series of t-tests. Although structural equation modeling 
allows for group analysis, when we group respondents based on employment status the 
degrees of freedom are severely constrained due to small sample size. So much so that we 
are unable to adequately perform the necessary tests. In order to test for different patterns of 
change in LOC based on employment status at time two, we compute an individual average 
score for each subscale. We then conduct a paired t-test (time one and time two) for each of 
the three LOC dimensions. 


Results 


Model Fit 


The basic longitudinal model produces a chi-square of 316.28 (degrees of freedom = 153, p- 
value <.001). To preserve cases with missing data, we use maximum-likelihood estimation (N 
= 293).° Although the significant chi-square value indicates a poor fit, both the estimated 
comparative fit index (CFI) and root mean square error of approximation (RMSEA) values (.91 
and .06 respectively) indicate an acceptable fit.* The unstandardized coefficients produced in 
the model that allows the first order latent means to be freely estimated (Test 2, Model A) are 
presented in Figure 1.’ 
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Figure 1. Unstandardized Estimates for Free First Order Latent Means Model (Test 2, Model A) 
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Test of Parameter Invariance Over Time 


Table 1 displays the results of the first analysis. In test one we find no statistically significant 
decline in fit between the baseline and the two subsequent constrained models; in other 
words, the same constructs are being measured in 2002 and 2003 and the relationships 
between the latent variables are stable across time. Because the relationship between the 
locus of control latent variables and the indicators'remains stable, it is safe to test for changes 
in perceptions of control over time.® 


Testing for Change in Locus of Control 


The results of the second test (a comparison of models A and B) indicate that there is indeed 
a significant change between respondents’ sense of control from 2002, when the future of 
the plant was unknown, to 2003, when respondents knew that the plant was to be 
permanently closed. However, in test two we do not find a significant difference between 
model A, where the first order latent means are freely estimated, and models C and D, where 
the time one and time two Powerful Others and Chance latent means are set equal. This 
result suggests that estimating a common mean across each year produces a model with as 
good a fit as estimating a model with separate means. This result also indicates that Powerful 
Others and Chance do not change significantly over time.? We do find a significant increase in 
respondents’ sense of internal control, however. Model E, in which the /nterna/ Contro/ means 
are set equal to each other, is significantly different from Model A, in which the first order 
latent means are freely estimated. However, doing so does not improve the fit of the model, 
indicating that estimating a common mean across time is not appropriate. Figure 1 shows the 
means estimated for each locus of control subscale in 2002 and 2003. We find a slight 
increase in Powerful Others (from 3.06 to 3.16), but the increase is not significant as indicated 
by a critical ratio of parameter difference (p = .132). Chance also does not change 
significantly (from an estimated value of 2.89 to 2.84, p = .273). However, /nterna/ Control 
significantly increases over time from a value of 3.63 to 3.94 (p < .001). We must interpret 
these results with caution because only two Items are used. However, the two items load 
consistently over time. 

The result of test three, comparing Models A and B, indicates that there is no significant 
difference (p > .05) between the models; constraining the second order latent means does 
not improve the fit of the model, thus estimating common means across time is sufficient. 
The correlation between the second order latent means (LOC Time 1 and LOC Time 2) is .77, 
suggesting a fairly high association between time one reports and time two reports. Thus, the 
results suggest that respondents’ overall LOC scores do not change significantly over time 
although the other previously mentioned tests indicate that the separate LOC dimensions 
may change over time (i.e., /nterna/ Contro/). 


Group Differences 


Table 2 reports paired t-tests conducted across the various categories of steelworkers based 
on their employment status at time two. At the second data collection, more than half of the 
panel respondents had found some kind of employment, either fulltime, parttime or 
temporary. Twelve percent of the panel was enrolled in government-sponsored retraining 
programs and an equal proportion had retired. Fourteen percent of the panel was still 
unemployed at time two. Examining the total sample, we find a significant change in the 
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Powerful Others and /nternal Contro/ scales (both increase), although the effect is more 
significant for /nternal Contro/. Consistent with the structural equation model, we find no 
difference across time in the Chance scale. The inconsistency between the structural 
equation models and the paired t-test results is primarily a function of the borderline 
significance of both tests. That is, in the structural equation model the chi-square approaches 
significance when we test for a change in Powerful Others scores. In the paired t-tests, the 
difference is significant, but the t-value is not very large. 

We find that those employed (either in full-time jobs or in part-time/temporary jobs) reported 
higher internal control over time. We also find that retired steelworkers and students report 
higher internal control. These results seem to be consistent with previous work that emphasizes 
the importance of a sense of internal control as a coping resource. On the other hand, we find 
an increase in Powerful Others for the unemployed and students. Despite similarity on the 
Powerful Others dimension, the /nterna/ Contro/ scores significantly increase for students, but 
not for the unemployed. Also, surprisingly, belief in Chance declined among the retired 
steelworkers. We offer possible post hoc explanations for these results in the following section. 


Discussion 


The measurement model was sufficiently consistent over the two time periods to test 
whether the first and second order latent means changed over time. The lack of change in the 
second order latent means is suggestive of a construct that does not change greatly over 
time and corresponds to assumptions found in most of the LOC literature. However, the 
differential patterns that appear among the first order latent variables suggest that more is 
going on “beneath the surface.” For example, although students and the unemployed score 
similarly on the Powerful Others dimension, the /nternal Contro/ scores significantly increase 
for students over time, but not for the unemployed. One post hoc explanation: the increase in 
Internal Contro/ is an artifact of both the reference group and the attributions workers make 
about their own and others success. Furthermore, the increase in Powerful Others may be a 
reflection of institutional constraints. The students are workers who qualified for government 
retraining, but did so under very firm guidelines which restricted the type of retraining in which 
they could enroll and limited the retraining period to 24 months. The high Powerful Others 
scores for the unemployed may also be a reflection of the unique situation some of the 
workers found themselves in as they attempted to find work. The same worker who 
previously described the stress of not knowing if the plant would reopen, described his 
experience looking for work: 


I've been laid-off now for over a year. Because | had the [stee/ plant] 
label attached to me when | would apply for a job people would take 
[that] into [account]; “Well, you're just going to go back to [the stee/ 
plant] soon as they start up if they start up.” Nobody wants to look at 
you. People look at you as a [steel plant] employee and they'll say, 
“Well if you have the chance, you're going to go back, plus you're 
making a higher wage.” So they don't want to hire [youj. They don't 
even want to give [you] the opportunity to take on another job. For a 
while, | was very depressed, because everywhere | went to look for a 
Job —| know my steel, | can drive equipment, | can weld, | can do lots 
of things — but every time you went to somebody and you told them 
your prior employer was [the steel plant], they were like, “We don't 
want you.” 
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Many of the men we spoke with expressed that their abilities were not being adequately 
recognized by potential employers. Another worker described his observations about how 
finding a job requires knowing the right people. 


And you know too, | got to think, it's not what you know, it’s who you 
know: the right people for the right job. | got friends that seemed like 
they knew the right people. | wish | knew the right people... instead 
of getting rejections. But that don't happen to everybody, because 
one of my friends, he had to have his son help him, because his son 
is a contractor. He got him a job. 

— Male, age 58, employed in steelwork for 30 years 


Another interesting result that merits discussion is that belief in Chance declined among 
the retired steelworkers. We offer an additional possible post hoc explanation: the 
circumstances of the retirees. Because of the uncertainty of the bankruptcy, the lay-off and 
the possibility of the plant closing, the steel mill could not guarantee the defined contributions 
401(k) plans for the laid-off workers. For more than a year the workers waited in anticipation 
of news that the Pension Benefit Guaranty Corporation (PBGC) would guarantee their 
retirement funds. The president of the local steelworkers union describes this difficult period: 
“We've got a lot of anxious guys sitting here waiting to see what exactly is happening [with 
their pensions].” (Earl 2002) Finally, 13 months after lay-offs began, the PBGC announced that 
they would guarantee 60 percent of the pensions. The time one data collection was during 
this period of waiting. By time two, workers knew what they could plan for. Many of the 
workers who feared the loss of their pension were then able to take more control of their lives 
and begin to plan for the future. Again, it would appear that responses on the locus of control 
items are to some extent related to the amount of institutional constraints the respondents 
were experiencing. 

Researchers interested in the relationship between LOC and unemployment or between 
LOC and responses to stressful life events may want to pay more attention to the balance 
between the various dimensions. Note that the item means for the various scales differ. The 
mean score for /nterna/ Contro/ is consistently higher overall and within groups than the mean 
scores for Powerful Others and Chance — both at time one and time two. Furthermore, 
according to the structural equation analysis, even though internal control changed over time, 
the overall measure of LOC did not. Most interesting, we found no change in internal control 
for the unemployed, although a sense of powerful others did change. Could it be that the 
increased /nternal Contro/ values found among the various groups are the resu/t of having 
found a new job or enrolled in school and not the cause? 

Most scholars assume that locus of control is fairly stable over the life course, a personal 
characteristic that orients some towards greater achievement and better adaptation. Again, 
the lack of change in the second order latent means would support this assumption. 
However, we find sufficient variation in the pattern of scores across the various subscales to 
wonder whether responses to the LOC items are, at least in part, a reflection of the real 
constraints and opportunities an individual experiences. 

LOC has been utilized in both the stress and coping literature and the job loss and 
unemployment literature. The research generally assumes that LOC is a resource for coping or 
eventually finding a new job. However, these results raise questions about the direction of the 
causal relationship. Mirowsky and Ross (1991) argue that bias introduced by defensiveness 
and agreement bias may complicate LOC results. Based on these results, we would suggest 
the need to explore the reference group tapped by respondents when they assess their 
response. All workers lost their jobs. Some found work; others did not. A steelworker who is 
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employed, enrolled in school or able to retire evaluates his success relative to others. Since 
he is more successful compared to others, his successes can be attributed to his own planning 
and initiative. He has a job; he must have planned better than those who do not have a job. A 
steelworker who is unemployed evaluates himself relative to others as well. He has tried to find 
work, but has not; the difference must be because of powerful others. He does not attribute 
his lack of employment to anything he has done — certainly he must have been as diligent as 
others. Responses on both the Powerful Others and Chance scales may be a function of 
attributions made, but may also reflect real institutional constraints. Not only would this 
conclusion argue against the assumption of a stable personality characteristic, we must also 
question the causal direction of the relationship often found between LOC and unemployment 
and between LOC and coping strategies. 


Conclusions 


The findings raise a number of questions about the assumptions being made about locus of 
control and its relationship to unemployment, stress and coping strategies. As March and 
Olsen (1984:734) noted two decades ago, the “social, political, and economic institutions” of 
modernity “have become larger, considerably more complex and resourceful, and prima facie 
more important to collective life.” When we examine aggregate data following a plant closure, 
changes in LOC may appear minimal. But when we pay attention to the multidimensionality 
of LOC and respondents’ ability to find work, we find sufficient evidence of variation by life 
circumstance to encourage another look at perceptions of control among those forced into 
unemployment. The Ross and Mirowsky research has demonstrated the usefulness of 
examining how one’s personal sense of control changes over the life course or is subject to 
variation by class, race, gender and age. In addition, scholars may want to pursue analyses 
that take into account institutional constraints as well; for example, the consequences of plant 
closures, global economic trends and the loss of manufacturing jobs. 

The results also call for more extensive longitudinal research and analysis based on panel 
data. This study provides data at only two points in time beginning in a heightened period of 
stress. While the overall LOC score indicates no change, we find sufficient change in the first 
order latent variables to call for more research that uses a multidimensional measurement. 
More importantly, the different patterns of change among the various categories of workers 
raises questions about the causal direction often specified in stress research. Perhaps people 
who are coping well with a disaster or life stress are more likely to report a greater sense of 
internal control. If so, internal control is not a resource; rather it may be another indicator of 
successful coping, or given institutional constraints, access to new opportunities. 

We find some evidence that life-altering events affect a person's sense of control. Those 
tempted to assume that locus of control is a stable personality trait predicting greater capacity 
for adaptation and coping may need to reconsider whether life transitions and stressful life 
events affect locus of control. Certainly these findings suggest the need for more research 
in which locus of control is the dependent, not an independent, variable. Furthermore, we 
need more theorizing about what is being measured — a generalized expectancy that affects 
social learning, an indicator of the felt reality of institutional constraints, or-an attribution about 
one’s own ability to adjust compared to others in the same situation. 

Static models of the impact of LOC on unemployment or re-employment are inadequate. 
Among other things, research must take into account whether jobs are available, the 
employment history of the individual, and the impact of long periods of unemployment on a 
person's sense of control. Extended periods of unemployment may increase one’s sense of 
powerlessness. 
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These steelworkers, all unemployed because one company went bankrupt, may be 


unique, and thus the findings are not generalizable. But what happened to them is indicative 
of what other skilled or semi-skilled workers with a long record of regular employment might 
experience. We may find similar results in other cases where employees suddenly lose their 
jobs in a global economy. 


Notes 


ts 


Powertul Others has four observed indicators: “| feel like what happens in my life is mostly 
determined by powerful others;” “Although | might have good ability, | will not be given 
leadership responsibility without appealing to those in positions of power;” “My life is 
chiefly controlled by powerful others;” and “Getting what | want requires pleasing those 
people above me.” Chance has four observed indicators as well: “When | get what | want 
it is usually because | am lucky;” “I have often found that what is going to happen will 
happen;” “It’s not always wise for me to make plans too far ahead because many things 
turn out to be a matter of good or bad fortune;” and “Whether or not | become a leader 
depends on if | am going to be in the right place at the right time.” /nterna/ Contro/ has two 
observed indicators: “When | make plans | am almost certain to make them work;” and 
“| pretty much determine what will happen in my life.” 


Alpha reliability coefficients for the Powerful Others (.79 and .73) and Chance (.71 and .69) 
scales for time one and time two were consistent with alpha scores found in other 
analyses using the full 28 items (see Waters and Moore 2002). The /nterna/ Contro/ alpha 
scores (.43 at both time points) are extremely low, partly because the scale consists of 
only two items. We began with three internal control indicators, but the third indicator did 
not load with the other two items so we dropped it from the model. The item read, “How 
many friends | have depends on how nice a person | am.” We must exercise caution in 
interpreting conclusions regarding the /nterna/ Contro/ construct due to its low reliability. 
However, we will show the confirmatory analysis suggests a high degree of consistency 
in model measurement of /nterna/ Contro/ over time. (See Table 1.) 


The correlation matrix and other test statistics are available from the authors upon request. 


Using AMOS, latent variables are represented by ovals, the observed variables are 
represented by boxes, and error terms are represented by circles. Straight arrows 
represent a causal relationship between the two variables and the double headed arrows 
indicate covariance between two variables. (See Figure 1.) 


We also estimated the models with listwise deletion of missing values so that the results 
could be compared for extreme discrepancies (N = 270). Because the results from both 
analyses were comparable, only the results from the maximum-likelihood estimated 
model are reported. 


Chi-square estimates are particularly sensitive to sample size and the distribution of the 
observed variables. The CFI compares the hypothesized model with the independent 
model and also takes sample size into account (Kline 1998). The RMSEA considers the 
complexity of the model in that it is sensitive to the number of estimated parameters 
included. A CFI value above .90 (see Kline 1998: 131) and an RSMEA value less than .08 
(Byrne 2001: 83-85) indicate an acceptable fit. 
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7. In longitudinal models, examining unstandardized coefficients is preferable to examining 
standardized coefficients because the distribution of standardized measures sometimes 
shifts over time. 


8. This result also suggests that despite the low alpha reliability of the /nterna/ Control! scale, 
the measure is consistent from time one to time two. 


9. If we remove constraints on the observed variables and assume that each first order latent 
variable is only a function of the observed variables and their factor loadings, we find a 
significant change in Powerful Others, although the difference is not large and the model 
fit is much reduced. 
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A Common Explanation for the Changing Age Distributions 
of Suicide and Homicide in the United States, 1930 to 2000 


Robert M. O’Brien, University of Oregon 
Jean Stockard, University of Oregon 





Abstract 

A longstanding debate focuses on whether suicide and homicide rates walk hand in hand 
or whether they are reciprocally related. Much of the research on this issue investigates 
whether suicide or homicide predominates in certain geographic areas or whether they 
trend together over time. We theorize that the degree of social integration and social 
regulation associated with birth cohorts is negatively related to both of these forms of lethal 
violence. We develop a common explanation for shifts in the age distributions of homicide 
and suicide in the United States from 1930 to 2000. In this context, suicide rates and 
homicide rates walk hand in hand and their parallel movements are associated with two 
variables linked to social integration and regulation. 


Research on suicide and homicide often appears in different venues and within different 
scholarly traditions. Students and researchers of public health are likely to read Suicide and 
Life Threatening Behavior while sociologists and criminologists interested in homicide are 
more likely to read the journal Homicide Studies. The normative response to suicides and 
suicide attempts involves mental health treatment while the normative response to homicide 
involves the criminal justice system. Geographically, rates of suicide tend to be higher in 
northern states, and rates of homicide tend to be higher in southern states. Black Americans 
exhibit lower suicide rates than whites, while the rates of homicide deaths for blacks exceed 
those for whites. Further, the age patterns of homicide and suicide differ — homicide offending 
and deaths typically peak in the 20- to-29-year-old age range with a strong decrease in rates 
after that peak. Suicides in the United States, in contrast, historically reached their highest 
levels among those more than 60 years of age. 

Figure 1a illustrates the shifts in the age distributions of homicide and suicide that occurred 
over the past 70 years, as well as some of the persistent features of these distributions. 
Throughout this period homicide deaths were concentrated in younger age groups. The age 
distribution for homicides, however, was relatively more concentrated at younger ages in 2000 
than it was in 1930 or in 1965. The shift in the age distribution of suicides from 1930 to 2000 
has been dramatic. In 1930 suicide increased almost monotonically with age and the increases 
in each succeeding age group were generally large. In 1965 suicides increased monotonically 
until ages 55-59, but the increase in each succeeding age group was less strong than in 1930. 
Then the suicide rate fell until ages 70-74. By 2000 this pattern of increase was much weaker: 
the suicide rate rose quickly from the age groups 10-14 to 20-24, increased slightly for ages 25- 
29, and then dropped slightly. It reached a peak in the age group 40-44 that was not exceeded 
until ages 75-79. 


Direct correspondence to: Robert M. O'Brien: Department of Sociology; 1291 University of Oregon; 
Eugene, Oregon 97403-1291. E-mail: bobrien@uoregon.edu. Jean Stockard: Department of 
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Figure 1b. Homicide and Suicide Residuals after Controlling for Age Group 
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Despite differences in the relative rates of homicide and suicide based on factors such as 
race, age and geographic location, much discussion and theorizing posit a connection between 
suicides and homicides. These discussions range from the writings of theologians and 
philosophers as early as St. Augustine (c. 426 A.D.) and in academic debates from the writings 
of the moral statisticians in the early 19" century to the work of Freud and his followers in the 
early years of the 20" century and to American sociologists in the mid-20" century. 

Much work suggests that suicide and homicide are different expressions of the same 
phenomenon and that similar causal factors influence the total level of lethal violence within 
a society, although different variables may influence which type of violence is more common. 
Early explanatory variables included biological characteristics cited by the moral statisticians; 
forces of aggression cited by the Freudians; and levels of frustration and strength of the 
superego cited by followers of Freud. More recent sociological work often builds implicitly 
upon the neo-Freudian assumption that suicide and homicide are related to frustration, but 
translates this understanding into a macro-level analysis of rates (Unnithan et al., 1994). 

Dating from at least the 19" century, however, were claims that suicide and homicide are 
negatively related (Ferri 1883-84; Morselli 1882 [1879]). The two forms of lethal violence 
(against the self or against others) were seen as substitutable for one another. Ferri and 
Morselli argued that this could be seen throughout Europe where homicide rates tended to 
be higher in the southern areas and suicide rates higher in northern areas. The pattern of 
higher homicide rates in the South and higher suicide rates in the North (east) is still evident 
in the United States. Ferri and Morselli also cited time series data for countries in which 
suicide rates and homicide rates trended in opposite directions. Many years later Henry and 
Short (1954) found that the correlation between suicide rates and the business cycle was 
negative while the correlation between homicide rates and the business cycle was positive. A 
modern literature attempts to explain the inverse relationship between homicide and suicide 
rates (Unnithan et al 1994). 

Our analysis examines whether changes in suicide and homicide rates can be explained 
by common underlying factors and departs from this earlier work in several important 
respects. First, instead of focusing on total rates of lethal violence, we examine age-specific 
rates, thus allowing us to account for changes in the age distribution of lethal violence. 
Second, to explain changes in the age distribution of lethal violence, we rely on cohort theory 
and research. Third, rather than using the Freudian and neo-Freudian notions of aggression 
and frustration, we build on Durkheimian notions of control through social integration and 
regulation. Durkheimian approaches of various types are widely used by researchers studying 
both homicides and suicides, and provide an excellent vehicle for uniting the study of these 
phenomena. 


The Durkheimian Perspective 


While sociologists are familiar with Durkheim's analysis of suicide, his comments regarding 
homicide are less well known, in large part because they were made almost in passing in 
Suicide. Nevertheless, scholars of both suicide and homicide have drawn upon the 
Durkheimian tradition. 

In Suicide, Durkheim delineated four types of suicide, all revolving around the central 
concepts of social integration and regulation. He suggested that too much integration or too 
much regulation can lead to altruistic or fatalistic suicide, respectively. Too little regulation is 
related to anomic suicide, while too little integration is related to egoistic suicide. Durkheim 
posited that in modern society the problem is rarely too much integration (altruistic suicide) or 
too much regulation (fatalistic suicide), but instead, too little integration and too little regulation. 
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Many researchers and theorists who examine Durkheim's typology of suicides have difficulty 
differentiating social integration and regulation. Johnson (1965) posits that integration is 
necessary for regulation, and therefore, there is only one cause of suicide.’ Gibbs and Martin 
(1964) argue that integration is the concept most central to Durkheim's theory of suicide. 
Bearman (1991) states that integration and regulation “walk hand in hand” — meaning that 
persons who are strongly integrated into society (family, religious groups, work groups) tend to 
be socially regulated, and those who are not integrated into society tend not to be socially 
regulated. We follow this tradition by emphasizing the tie between social regulation and social 
integration and the rates of suicide, as well as homicide, in modern societies. 

Although Durkheim theorized that the lack of social integration in modern societies 
increases the rate of egoistic suicide, he maintained that this same condition is not conducive 
to homicides: 


“Egoistic suicide and homicide, therefore, spring from antagonistic 
causes, and consequently it is impossible for the one to develop 
readily where the other flourishes.” (Durkheim 1966 [1897], p. 356) 


He argues that egoistic suicide is: 


“characterized by a state of depression and apathy produced by 
exaggerated individuation. The individual no longer cares to live 
because he no longer cares enough for the only medium which 
attaches him to reality, that is to say, for society... Homicide depends 
on opposite conditions. It is a violent act inseparable trom passion. 
Now, whenever society Is integrated . . . the intensity of collective 
states of conscience raises the general level of the life of passions.” 
(p. 356) 


Durkheim used this argument to explain data indicating that where “homicide is very 
~ common, it confers a sort of immunity against suicide” (p. 351). On the other hand, Durkheim 
suggested that a lack of social regulation (anomy) fosters both increased homicide and 
Suicide (anomic suicide): 


“T]here exists today, especially in great centers and regions of intense 
civilization, a certain parallelism between the development of 
homicide and that of suicide. It is because anomy Is in an acute state 
there.” (Durkheim 1966 [1897]: 358) 


‘We do not share Durkheim's view regarding the divergent effects of egoism on suicides 
and homicides, and we are not alone. Others hypothesize that the lack of social integration 
(egoism) is positively associated with homicides and other violent crimes. One of the major 
theoretical traditions within criminology (control theory) cites Durkheimian roots and implies 
that lower levels of integration are related to higher rates of deviant behavior, including 
homicide. Advocates of control theory argue that low levels of integration produce ineffective 
internal and external social controls, which, in turn, promote deviant behaviors (e.g., Hirschi 
1969; Gottfredson and Hirschi 1990). Hirschi (1969: 3) in the Causes of Delinquency states: 
“Durkheim's theory is one of the purest examples of control theory.” In addition, several 
authors (see, Hagan and McCarthy 1997) focus on the ways in which social integration 
promotes the formation of social capital, which in turn deters the likelihood of deviant and 


criminal behavior. 
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The connection between lack of regulation and deviance/homicide, which Durkheim does 
not dispute, has a long tradition in sociology. Merton's classic presentation of strain theory, 
“Social Structure and Anomie,” (Merton 1938) explicitly cites Durkheim. Merton cites a lack of 
fit in the United States between means of achieving socially valued goals and the inculcation 
of those goals. In Durkheimian terms, society has failed to regulate the desires and 
expectations of its members. 


Cohort Theory and the Analysis of Lethal Violence 


A growing literature on cohort explanations of lethal violence emphasizes three major tenets 
of cohort theory: 


(1) The “life stage principle” (Elder 1974): suggesting that the 
experiences of one cohort differ from those of another because they 
experience different historical events at different ages or 
developmental periods. 


(2) The “lasting effects principle” (O’Brien, Stockard, and /saacson 
1999): positing that certain events or conditions, such as the size of 
one’s birth cohort, can produce lasting changes in the attitudes and 
behaviors of cohort members. 


(3) The “confounding principle”: pointing to a methodological and 
conceptual problem that occurs because knowing the period and 
age group determines cohort membership. 


In each period a particular age group corresponds to a particular birth cohort and in the 
next period the same age group corresponds to a different birth cohort. This suggests that 
changes in the age distributions of homicide and suicide may be related to the propensities 
of different cohorts for lethal violence. This is plausible, but in examining the relationship of 
cohorts to shifts in age distributions across periods, we are confronted with the “confounding 
principle” noted above. Recent cohort-based explanations of changing rates of lethal violence 
address this problem, as well as the “life stage” and “lasting effects” principles, by utilizing 
Age-Period-Cohort-Characteristic (APCC) models that incorporate strong controls for age and 
period and use cohort-based measures of factors that theoretically generate social integration 
and regulation (O'Brien, Stockard and Isaacson 1999; Savolainen 2000). The cohort 
characteristics used are tied to each cohort and their effects on a cohort’s propensity to lethal 
violence are measured across the lifespan of the cohorts. Research provides strong evidence 
that cohort characteristics are related to age-period-specific rates of suicide deaths, both 
within the United States and cross-culturally (Stockard and O'Brien 2002a; 2002b), and 
homicide deaths and offenses within the United States (O’Brien, Stockard and Isaacson 1999: 
O'Brien and Stockard 2002; Savolainen 2000). 

We recognize, as Durkheim did, that marriage, having children, being in a nation at war, 
etc. are not direct measures of social integration and regulation. Instead they are conditions 
that may help generate social integration and regulation. One of the two generators of cohort- 
based social integration and regulation that we examine involves the childhood family. 
structure of the cohort. Durkheim's suggestion that higher rates of disrupted family structures 
are associated with higher rates of suicide (both egoistic and anomic suicides) is supported 
by a large modern literature (e.g., Breault 1986; Danigelis and Pope 1979; Stockard and 
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O'Brien 2002a, 2002b). A similarly large literature documents the relationship of disrupted 
family patterns to higher levels of deviant and criminal behavior, including homicide. This 
literature highlights the integrating and regulating effects of the family in terms of monitoring 
and supervision (Gottfredson and Hirschi 1990), insulation from peer group influence (Warr 
2002), and providing needed financial resources for children (McLanahan and Sandefur 1994). 

A second cohort characteristic that generates cohort-based social integration and 
regulation is the relative size of the cohort. Some explanations for the effect of relative cohort 
size focus on the economic disadvantages that face larger cohorts when they enter a job 
market crowded with a large number of people seeking entry-level employment (Easterlin 
1987). This is followed by delays in marriage and family building, two prominent indicators 
associated with integration in the Durkheimian tradition. Other explanations emphasize the 
relationship between increased cohort size and the overloading of the institutions that help 
produce social integration and regulation — families, schools, organized youth groups (e.g., 
O’Brien, Stockard and Isaacson 1999; Stockard and O’Brien 2002a) or emphasize the 
increased influence of peer groups (Warr 2002) — with their distinctive youth culture that is 
insulated from the influence of older generations. 

In part, cohort effects reflect the aggregation of individual effects — the result of more 
children growing up in larger or single-parent families. Many members of a birth cohort, 
however, are affected by these characteristics, no matter the size or composition of their 
families. Using notions of social capital and social closure (Coleman, 1990), we suggest that 
when cohorts are relatively large, all children, not just those from larger families, experience 
the reduced ratio of adults to children and lower levels of adult resources within school and 
community groups. When cohorts have more single-parent families, the networks that tie a 
teacher to a parent or a parent to one or more parents tend to have less closure and peers 
become a more important influence on all children, regardless of their family backgrounds. 

Factors not specifically tied to cohorts also influence rates of lethal violence. For example, 
factors associated with different periods affect both suicide and homicide rates. These 
factors can range from improved emergency responses and emergency care room facilities 
to the effects of unemployment and recession. Factors associated with age such as church 
attendance, marrying and parenting are also strongly associated with both homicide and 
suicide rates. This calls for a method that includes factors directly tied to age and period, and 
the Age-Period-Cohort-Characteristic model does this by including dummy variables for 
periods and age-groups. 

While the differences have not been especially large in magnitude, some studies find 
different patterns of influence of cohort characteristics for race-sex-specific groups, especially 
involving relative cohort size. For example, relative cohort size appears to have a stronger 
influence on male suicide rates than female rates, but the pattern is reversed (although 
smaller in magnitude) for homicide rates. A stronger difference appears across the race 
groups with relative cohort size significantly influencing suicide rates for only whites and 
homicide rates for only non-whites (O'Brien and Stockard 2002; Stockard and O’Brien 2002a). 


Summary and Hypotheses 


The Durkheimian tradition and previous literature on cohort effects suggest that cohort- 
related factors generate social integration and regulation, specifically family structure and 
relative cohort size. These factors are related in similar ways to changes in the age-period- 
specific rates of suicide deaths and homicide deaths. Specifically, we suggest that decreases 
in integration and regulation for cohorts are associated with increased rates of both suicide and 
homicide for those cohorts (controlling for other relevant variables). We test this by examining 
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two factors that generate lower levels of social integration and regulation within cohorts. This 
is not a hypothesis about tota/ rates of homicide or suicide in societies, but explicitly refers to 
the rates of cohorts. 

Most researchers investigate homicide and suicide separately. Certainly, the age 
distributions of these two phenomena (Figure 1a) are strikingly different. In addition, rates of 
homicide and suicide are strikingly dissimilar across different segments of the population, with 
homicides more common among non-whites (asa group) and suicides more common among 
whites, and both phenomena more common among men than women. In this paper, we 
examine the association of shifts in the age-period-specific rates of homicide and suicide for 
the total population and race-sex specific subgroups. Because we posit that shifts in the age- 
period-specific rates of both homicide and suicide are negatively related to shifts in the social 
integration and regulation of cohorts (after controlling for age and period), we hypothesize that 
the residuals in the age-period-specific suicide and homicide rates in a model that only controls 
for age and period are positively correlated. \f these residuals are positively correlated, it 
suggests that some common factor or factors, besides age and period, are affecting the 
cohort’s suicide and homicide rates.’ 

To examine whether part of the correlation between these residuals for homicide and 
suicide can be accounted for by factors that generate cohort-based integration and regulation, 
we add the cohort family structure indicator and relative cohort size to the model that 
contains the age and period dummy variables. We hypothesize that adding these cohort 
characteristics to the equations for suicides and homicides will reduce the correlation between 
the residuals. \n all of our analyses, we analyze data on the total population and disaggregated 
data on four race-sex groups. This provides a “pseudo-replication” and some indication of the 
generalizability of our results. 


On the Use of Homicide Deaths 


In Durkheim's era researchers used data on the death of individuals from the vital statistics 
of nations to examine the relationship of social factors to crime rates. Yet much of the theory 
relating social integration and regulation to homicide rates involves homicide offenders. In this 
paper, we use the data on homicide deaths (victims), because it is available broken down by 
age for the period 1930 to the present. Homicide offender data for the United States (based 
on those arrested) broken down by age is available only from 1958 to the present. 

Researchers, however, consistently find that homicide offenders and victims tend to have 
similar demographic characteristics in terms of sex, race/ethnicity and age (Cook and Laub 
1998; Wolfgang 1958). Given these similarities, it is not surprising that macro-level data on 
victims and offenders are similar. Cook and Laub (1998) note that trends in the age of both 
homicide offenders and homicide victims are similar over time for blacks and whites. The 
rates for both victims and offenders increased dramatically for younger age groups from the 
mid-1980s to the 1990s (Fox and Zawitz 1998). It is important to note, however, that these 
distributions are not identical. The age distribution for victims tends to peak at a slightly older 
age than that for offenders — but the shape of the age distributions are quite similar.? 


Methodology 


We use an Age Period Cohort Characteristic model to analyze the data. The data are grouped 
in five-year intervals (the data on age-period-specific rates for homicide deaths and suicides 
are available only for five-year age groupings over much of the time span covered in our 
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analyses). Typically, in APCC models with five-year age groupings, the periods are spaced five 
years apart, and the cohorts are based on five-year groupings. Thus, the age groups in any 
one period do not contain people who were in that age group during another period, and age 
groups in any one period are based on only a single specified cohort. In our analyses there are 
14 age groups (10-14, 15-19,..., 75-79); 15 periods (1930, 1935,..., 2000); and 15 cohorts 
(beginning with those born between 1915 and 1919 who were 10-14 in 1930 and ending with 
those born between 1985 and 1989 who were 10-14 in 2000).* Our measures of the age- 
period-specific homicide death rates and suicide rates for the total population and for white 
males, white females, non-white males and non-white females come from the Annual Vital 
Health Statistics Reports (U.S. Department of Health, Education and Welfare various years).° 

Consistent with previous work in this area (O'Brien and Stockard 2002; Stockard and 
O'Brien 2002a), we use the percentage of the birth cohort born to unwed mothers, which we 
label as the percentage of non-marital births (NMB), to gauge one important aspect of 
cohorts’ childhood family structure. This measure comes from Vital Statistics of the United 
States (United States Bureau of the Census 1946, 1990). This is a proxy measure for some 
important shifts in the family structure of birth cohorts. Our measure of NMB is, for example, 
highly associated with the estimated percentage of the birth cohort who lived in single-parent 
families when they were 5 to 9 years old,® and those growing up in single-parent families are 
five times more likely to grow up in poverty (McLanahan and Sandefur 1994). The measure 
of relative cohort size (RCS) is the percentage of the population ages 15-64 who were 15-19 
in the period when the cohort was 15-19 and is computed from data available in the Current 
Population Surveys: Series P-25 (United States Bureau of the Census, various dates). Both 
NMB and RCS can, and do, reflect historical shifts such as the Great Depression (associated 
with a decrease in the relative cohort size of cohorts born in that period) or the unavailability 
of marriageable men due to imprisonment or war (associated with increases in the 
percentage of non-marital births for a cohort). Although these two measures are demographic 
ones, they are certainly not due solely to demographic factors. They provide important 
measures of theoretically relevant shifts in the characteristics of cohorts that are available 
over an extended period of time broken down by race. They can reasonably be viewed as 
generators of social integration and regulation in their own right. Other measures of cohort 
related social integration and regulation or measures of factors that generate social 
integration and regulation, whether demographic or historical in nature, would be appropriate 
for testing our theory. Measures of NMB and RCS were calculated for the total population and 
for whites and non-whites separately. These characteristics of cohorts are hypothesized to 
generate lower levels of social integration and regulation within cohorts. 

In our regression analyses, we log the age-period-specific death rates and the measures 
of relative cohort size and non-marital births. This is consistent with our assumption that 
proportionate increases in each of these independent variables are related to proportionate 
increases in the rate of lethal violence. This assures, for example, that a 20 percent shift in the 
age-period-specific homicide rate for those 45-49 is weighted the same as a 20 percent shift 
for those who are 20-24 (controlling for other variables in the model). This transformation is 
important, because we are interested in shifts in homicide across all ages and periods and not 
only those with the highest rates. The coefficients associated with each of these logged 
independent variables are, thus, interpretable as the percentage change in the rate of lethal 
violence associated with a percentage change in the cohort characteristic controlling for the 
other independent variables in the model. 

Recognizing the dramatic differences in the age distributions of homicide deaths and 
suicides, we examine the relationship between suicide and homicide conditioned on both age 
group and period by regressing the logged measures of the age-period-specific homicide and 
suicide rates on the age group and period dummy variables. Then we examine the 
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relationship between the residuals for homicide and suicide. Conceptually, entering age group 
and period dummy variables into the model predicting homicide (or suicide) allows us to 
examine the average age curve for each period. We could do this using the coefficients from 
the age-period model that contained just those dummy variables. The age curves for each 
period would all have the same shapes (proportionately), but would be higher or lower 
depending upon the average rates for the particular period. 

Central to our claim of a common explanation for shifts in the age distributions of both 
suicide and homicide is our expectation that the residuals for the age-period-specific rates for 
homicide and suicide are positively correlated after controlling for age and period. Our theory 
predicts that cohorts lower in social integration and regulation will be more prone to both 
homicide and suicide than predicted on the basis of age and period alone while those higher 
on social integration and regulation will be less prone to both homicide and suicide. 

Comparing the line graphs in Figure 1 explicates our strategy and indicates at least one 
reason why examining the relationship between homicide death rates and suicide rates, 
without controlling for shifts in the age distribution, is likely to be misleading. As shown in 
Figure 1a, the age distributions of suicide and homicide rates for the total population differ 
dramatically with the rates for homicide deaths being relatively higher for young people in 
comparison to suicides. Figure 1b, based on the same data and years, depicts the residuals 
for homicide and suicide when the rates are predicted on the basis of age and period.’ The 
distributions of these residuals track one another closely, indicating that some other factor(s) 
are likely affecting both of these variables in a similar manner. The residuals for 2000 show a 
dramatic pattern with the younger age groups having much higher rates than was typical 
across the entire data set for both suicides and homicides. We have hypothesized that two 
characteristics of cohorts (RCS and NMB) are likely to explain part of the relationships 
depicted in these residuals. 

To directly examine these expectations, we analyze the age-period-specific data for suicide 
and homicide using a Seemingly Unrelated Regression (SUR) model that employs Feasible 
Generalized Least Squares estimation. The SUR model allows us to run the APCC models for 
homicide deaths and suicides simultaneously while taking into consideration the potentially 
correlated residuals produced by the two models. The output from the program provides 
efficient estimates for the effects of each of the dummy variables and cohort characteristics 
on the suicide and homicide rates and for the correlation between the residuals for these two 
models. 

The basic APCC model that we use in this SUR framework appears in equation (1), where 
the dependent variable is either the log of the age-period-specific suicide or homicide rates. 


In [APSR =p +a; +a) +p In(RCSp) + v In(NMBy)+ ei (1) 


where wp is the intercept, a i is the age effect for the ith age group, m/ is the period effect for 
the /" period, p is the regression coefficient for the logged values of RCS, /n(RCS,) is the 
logged relative cohort size for the k" cohort, n is the regression coefficient for the logged 
values of NMB, /n(NMB,/ is the logged percentage of non-marital births for the k" cohort, and 
e,; is the error term for the /" observation. We use the youngest age group (10 to14) and the 
earliest period (1930) as reference groups for the dummy variable sets for age groups and 
periods. When running models for the total population, white populations, or non-white 
populations, we use values of RCS and NMB that are based on the corresponding total or — 
race-specific populations. 

The APCC model controls for the main effects of age and period by using dummy 
variables. Controlling for period adjusts for different average logged lethal violence rates 
(suicide or homicide) in different periods. Controlling for age adjusts for different average 
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logged lethal violence rates across different age groups. Controlling for period also controls 
for variables associated with period whose effects are constant across age groups. Thus, to 
the extent that the effects of medical technology or a particularly warm year are constant 
across age groups, these effects are controlled for by the inclusion of the period dummy 
variables. Similar statements may be made about the inclusion of dummy variables for age. 
Additionally, including the age group and period dummy variables in the model controls for 
the linear effect of cohort year of birth. Because this trend is controlled for, any effect of the 
cohort characteristics that is linearly related to cohort time of birth is controlled (O'Brien, 
Stockard and Isaacson 1999). 

We first run this model for homicide deaths and suicides separately (for the total 
population, white males, white females, non-white males and non-white females) without the 
cohort characteristics and report the correlation between the residuals. We then include RCS 
and NMB as independent variables. We expect these cohort-related measures will be 
positively related to both age-period-specific homicide and suicide rates, and, that the 
inclusion of these two variables will reduce the positive correlation between the residuals. 
These residuals may still be positively related, since any factors that make cohorts more or 
less prone to lethal violence and are not included in the model would produce such a result. 


Results 


Descriptive Results 


Table 1: Descriptive Statistics for Age-Period-Specific Homicide Death Rates, Suicide 





White Non-White 
Variables Total Male Female Male Female 
Homicide Rate 
Mean 8.88 7.66 2.43 58.60 12.31 
SD 5.60 4.92 1.20 40.10 7.81 
Range 50-2242 .36-19.89 22-542 1.99-149.90 .74-28.50 
Suicide Rate 
Mean 11.76 19.32 6.02 11.80 Oakn 
SD 5.82 9.92 3.56 6.21 1.48 
Range 28-20.70 .41-42.95 10-1401 20-2766 .10-6.82 
Non-Marital Birth Rate 
Mean 4.81 2.54 18.76 
SD 3.85 Do2 9.08 
Range 2.10 — 24.5 1.30 -16.94 11.86 — 54.74 
Relative Cohort Size 
Mean 12.87 12.58 14.99 
SD 1.60 1.61 1.80 
Range 10.41 - 15. 10.04 -14.90 12.05 — 18.60 
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Rates, Non-Marital Births and Relative Cohort Size 


Table 1 presents descriptive statistics for the measures of lethal violence for the total group 
and each race-sex subgroup and descriptive measures for NMB and RCS for the total group 
and for each race subgroup. Each of the measures shows substantial variation. For the total 
population, homicide death rates range from .50 for those 10-14 in 1960 to 22.4 for those 20- 
24 in 1990; and suicide deaths range from .28 for those 10-14 in 1950 to 20.70 for those 55- 
59 in 1975. The percentage of non-marital births among the total population ranges from 2.1 
to 24.5 and relative cohort size varies from 10.41 to 15.33. Both the average values of NVB 
and RCS, as well as the measures of their variability, are higher for non-whites than for whites. 
Patterns of fluctuation for both measures, however, are similar for the race groups over time. 
Importantly, RCS has a somewhat cyclical time trend: for the cohort born between 1915 and 
1919, the total population RCS is 13.89 percent and then drops to 10.80 percent for the 1930- 
1934 cohort, it then increases to 15.33 percent for the cohort born between 1955 and 1959 
and then declines to 10.41 percent for the 1985-89 birth cohort. The total population NMB is 
2.10 percent for the cohort born between 1915 and 1919 and nearly doubles to 4.08 percent 
for the cohort born between 1935 and 1939, it then decreases and does not exceed 4.08 
percent until the cohort born between 1955 and 1959 when it reaches 4.82 percent. This is 
followed by a dramatic increase to 24.54 percent for the cohort born between 1985 and 1989. 


Explaining the Joint Distributions of Suicide and Homicide 


Table 2 presents results from the SUR models for the total population and for each genderrace 
(white/non-white) group. The first two sections report the results for suicide and homicide, 
respectively. These are, in a sense, replications of the research reported in Stockard and O’Brien 
(2002a) on suicides and O'Brien and Stockard (2002) on homicide deaths: however, (1) our 
analysis includes data for the period 2000, (2) we log the cohort characteristics as well as the 
rates of suicide and homicide to specify that proportional changes in these characteristics are 
associated with proportional changes in suicide and homicide rates, and (3) we use a SUR 
model which produces more efficient estimates of the regression coefficients given that the 
residuals from these two equations correlated. These changes result in only minor differences. 
In Stockard and O'Brien (2002a), for example, the effects of RCS and NMB on suicide rates are 
positive in all of the models (total, white males, white females, non-white males and non-white 
females) and are statistically significant in all of the models except for RCS in the models for non- 
white males and females. This is identical to the results reported in Table 2 for LNRCS and 
LNNMB. In O'Brien and Stockard (2002) the associations of both cohort characteristics and 
homicide rates for each of the models are positive and are highly statistically significant except 
for the effect of RCS in the models for white males and females. In Table 2 the association of 
LNRCS with homicide death rates is positive for each of the models and is also statistically 
significant (p < .05).* 

For suicide the R-squares range from .904 to .973 in the models that contain only the age 
and period dummy variables. When the cohort characteristics are added to the model, the R- 
squares range from .915 to .988. As with Stockard and O’Brien (2002a) the smallest variance 
explained is for non-white females. The homicide models have R-squares that range from 
.943 to .964 for the models that contain the age and period dummy variables and from .966 to 
.987 for the models that add the cohort characteristics. , 

With R-squares above .90 for all of these analyses when only the age group and period 
dummy variables are in the equation, it is reasonable to ask if substantively important variation 
remains in the residual. The age group and period dummy variables provide a baseline model 
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Table 2: Seemingly Unrelated Regression Models for Age-Period-Specific Suicide and Homicide Rates 
(Logged) Regressed on Age and Period Dummy Variables and Logged Values of RCS and NMB 


White Non-white 
Total Male Female Male Female 
Suicide Equations 
Riau “AP 970 973 942 937 904 
R“insuicide *A,P,CC 988 988 969 956 915 
Die A.B3or* qi2527¢ 1:108*** 583 .430 
(.208) (.157) (.258) (.333) (.352) 
Dit 4.567""% .188*** 1.061*** 1.979*** 1.190** 
(.139) (.077) (.127) (.367) (.409) 
Homicide Equations 
R roms “AP .962 .964 .959 .943 .946 
R* homicide *A,P,CC .987 .980 .966 .982 .970 
Dies 1:426°" .437* O43" 1:043"* 1:038°"* 
(.121) (.171) (.161) (.209) (.202) 
Die 127 6278s .309*** 2.829*** 1.866*** 
(.180) (.084) (.079) (.229) (.234) 
Residual Correlations 
homtail 242 oe .639*** .406*** o54e 236" 
Thom,sui #4,P,CC 504"** .456*** .202* A215" 032 
N= 119 119 119 118 109 


Note: The analyses for the total population and the race-sex specific analyses are based on 

number of cases with no missing data on either homicide or suicide for the SUR model. 

*R-squared for the dependent variable regressed on the age and period dummy variables. 
R-squared for the dependent variable regressed on the age and period dummy variables 

and the two cohort characteristics. 

oe 0S ap <01ls  p'<.001 


in which the shape of the age distribution is the same (proportional, since the dependent 
variable is logged) across all periods and the period coefficients move these age distributions 
“up or down.” Thus, for example, the dramatic shifts in the shapes of the age distributions for 
suicides from 1930 to 2000 in Figure 1a are based on these residuals. The shifts in the age 
distributions for homicide and suicide are important and have been labeled as epidemics of 
youth homicide and suicide. 

The correlations between the residuals for the age-period-specific rates from the homicide 
and suicide equations appear in the bottom section of Table 2 For the equations that contain 
only the age and period dummy variables, the correlations are all positive and statistically 
significant and, except for non-white females, at least moderately strong. For example, the 
correlation between the residuals for homicide and suicide for the model based on the total 
population is .779. These residuals share 61% (= .779? x 100%) of their variance in common. 
The shared variances for the race-sex subgroups range from 6 percent for non-white females 
to 41 percent for white males. In each case these correlations are statistically significant (p 
< 05). This indicates that shifts in the age distributions of homicide rates and suicide rates 
moved together from 1930 to 2000 in the United States.® 
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When we add LNRCS and LNNMB to the models (second row in the residual correlations 
section of Table 2) the correlation between residuals drops to .504 for the total sample. Thus, 
after including the two cohort characteristics the variance shared by the homicide and suicide 
residuals is 25% (= .504? x 100%). This remains a highly significant amount of shared variance 
(p < .001). Some of this variance may be due to social integration and regulation inherent in the 
cohorts that are not captured by (or associated with) LNRCS and LNNMB, or to other common 
factors that affect both suicides and homicides that are not included in our model. Including the 
cohort characteristics in the model for the total population reduced the shared variance by 59 
percent [=((61 - 25)/61) x 100%]. For each of the analyses of residuals reported in Table 2, the 
two cohort characteristics explain more than 50 percent of the shared residual variance. 


Summary and Discussion 


Our results indicate that changes in the age distributions of suicide and homicide rates 
throughout much of the 20" century in the United States are strongly related. We posit that 
this relationship is due, at least in part, to the dependence of both forms of lethal violence on 
the degree of social integration and regulation that characterize cohorts. On theoretical and 
empirical grounds, we suggest that two characteristics of cohorts, relative cohort size and 
non-marital births, are negatively related to their degree of social integration and regulation 
and positively related to the age-period-specific rates of suicide and homicide deaths for the 
total population and for the rates disaggregated by race and sex. 

A careful examination of Figure 1a shows that over time the age distributions of the suicide 
rates have shifted such that in 2000 there were relatively more suicides committed by younger 
age groups than in 1930 or 1965. This is also the case for homicides. Using Seemingly Unrelated 
Regression, we have shown that for the period from 1930 to 2000 these age shifts are related. 
Further two cohort characteristics can explain much of the communality in these shifts in the 
age distributions for suicide and homicide rates (more than half of the shared variance in these 
shifts). The remaining common variance in these shifts may be due to social integration and 
regulation in these cohorts that is not associated with these two measures, or it may be due to 
other common factors that affect shifts in these distributions over time. 


Theoretical Implications 


Durkheim maintained that two conditions prevalent in modern society led to suicides (egoism 
and anomie), but that one of these conditions (egoism or lack of integration) was negatively 
related to homicide while the other (anomie or lack of regulation) was positively related to 
homicide. His hypothesis of the “mixed. effects” of integration on suicide and homicide rates 
is perhaps best viewed in the context of the conflicting research findings of his day. Some 
researchers claimed that homicide and suicide were reciprocally related (Ferri 1883-84; 
Morselli 1882 [1879]). If one predominated, the other was reduced. Others claimed that they 
often grew together (Tarde 1886). This dispute has continued to the present. (For a 
comprehensive review, see Unnithan, et al. 1994.) Unnithan, et al. (1994) suggest that there 
are common “forces of production” that produce both suicides and homicide, but there also 
are “forces of direction” that come into play to make suicide or homicide predominate in _ 
terms of growth or in terms of the dominant mode of a particular society or group. 

Our hypotheses are limited to the effects of social integration/regulation. We hypothesize 
that insufficient social integration/regulation is a “force of production” that increases both 
suicide and homicide. This view differs from Durkheim who saw lack of regulation as a “force 
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of production” for both suicide and homicide, but viewed a lack of integration as increasing 
suicide and decreasing homicide in modern societies. Most modern criminologists would 
agree with our position that a lack of social integration is positively related to homicide rates. 
That position is based on both theoretical reasoning and empirical evidence: 


Policy Implications 


Extent of the Problem 

Not surprisingly, the level of lethal violence varies greatly by race and sex. We present just one 
example (albeit a cogent one) of the impact of youthful lethal violence on one subgroup. 
Using data from 1999, the rate of lethal violence (suicides and homicides) is 130 per 100,000 
for African American males in the age range 20-24. This rate is nearly four times higher than 
the rate for white males of that age. (The rates for African American women and white 
women are even lower than that for white men). Thus, the loss of life due to lethal violence 
is particularly staggering among young black males. Even though black males comprised only 
7 percent of the population under age 25 in 1999 (U.S. Bureau of the Census various years), 
they accounted for 30 percent of all deaths due to lethal violence occurring to this age group: 
11 percent.of the suicides and 44 percent of the homicides (National Center for Health 
Statistics, 2001). In addition, black men comprise a large proportion of people arrested for 
homicide, which is a fairly direct result of one form of lethal violence. Such arrests removed 
over 5,000 people (most of them young males) from black communities in 1999 (Federal 
Bureau of Investigation, 2000), a loss close to two-thirds the loss derived from lethal deaths 
for black males (7,865 for black males in all age groups in 1999). 

Lethal violence and the impact of arrests for homicide cumulate over time. Over a 10-year 
period the removal of young men from black communities due to lethal violence are 
permanent, and for homicide arrests and convictions, typically long lasting. Such cumulative 
losses are especially severe for communities where these effects are concentrated. These 
losses affect the social structural and institutional foundations of communities through a 
number of mechanisms including: larger proportions of single-parent families, increased 
poverty, decreased social capital and generally lower levels of social organization (Wilson 
1987). The shift of lethal violence to younger age groups is not confined to the black 
community; for example, from 1960 to 2000 the rate of lethal violence doubled for white men 
in the age group 10-24. 


Providing for a Common Direction 
The current policies directed at suicide and homicide prevention have strikingly different 
orientations, with suicide prevention plans originating in departments of public health and 
focusing on mental health diagnoses and treatments, and homicide prevention policies 
occurring within criminal justice systems and focusing largely on punishment as a deterrent. 
Our analyses indicate that common factors underlie changes in the age distribution of both 
suicide and homicide and that these common factors can account for commonalities in 
changing rates of lethal violence across race-sex groups. If both forms of lethal violence have 
common roots in variations in social integration and regulation, a sensible approach would 
deal with these common roots and find ways to promote greater integration and regulation. 
Focusing on suicide and homicide together can help bring issues regarding lethal violence into 
better focus. First, the true magnitude of the problem may become more apparent with rates that 
are strikingly high. Second, the problem cannot be easily dismissed as one that belongs to 
“someone else.” Instead, the issue of lethal violence, when seen in its broader format, faces all 
demographic groups: rich and poor, white and non-white. Third, focusing on the common roots 
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and the need for greater cohort integration and regulation builds on the findings and policy 
suggestions of generations of researchers within the fields of both suicidology and criminology. 


Needed Policies 

We have found that two characteristics of cohorts are highly associated with the dramatic 
increase in lethal violence among the young and have suggested that cohort size and disrupted 
families affect the degree of social integration and regulation that cohorts experience. Thus, 
policies that focus on the provision of alternative sources of social integration and regulation 
should help decrease the amount of lethal violence experienced by cohorts. 

Recent analyses of cohort-related variations in suicide rates throughout a range of modern 
western countries (Stockard and O’Brien 2002b) show that cohorts characterized by higher 
levels of RCS and NMB exhibit higher suicide rates. Those analyses, however, indicate that 
these effects are moderated within societies that provide greater support for families, 
mothers and children. Such policies can take a variety of formats such as child allowances, 
higher welfare payments, extensive programs of paid maternity leaves and subsidized child 
care (Pampel 2001), but the important point is that they provide ways to help families 
supplement or replace losses in income and supervision and counter the stronger roles of 
peer groups that result from higher levels of RCS or NMB. Within the criminological literature 
a number of studies have documented the strong role of social support and relationships, 
whether they come from employment or family ties, in stemming and even reversing criminal 
careers (Sampson and Laub 1993). Policies designed to ensure entry-level jobs for those 15- 
29 should be especially useful in providing support during the crucial young adult periods and 
in helping to launch young people on supportive career and family trajectories. 


Notes 


1. Durkheim (1966 [1987]:288) writes “Two factors of suicide, especially, have a peculiar 
affinity for one another: namely egoism and anomy. We know that they are usually merely 
two different aspects of one social state; thus it is not surprising that they should be 
found in the same individual. It is indeed almost inevitable that the egoist should have 
some tendency to non-regulation; for, since he is detached from society, it has not 
sufficient hold on him to regulate him.” 


2. The incidence of homicides followed by suicides is relatively rare in the United States. 
Stack (1997) reports 245 such incidents out of 16,245 homicides in Chicago. He notes that 
this percentage of homicides followed by suicides (1.63 percent) is not unusual for studies 
conducted in the United States, but that in many other “developed” countries the 
phenomenon is much more common. This low rate of joint occurrences (given the strong 
shifts in the age distribution shown in Figure 1a) should have little effect on our analyses. 


3. Given our method of analysis that controls for age groups this consistent lag in the peak 
age of homicide victimization compared to homicide offending matters little in terms of 
the direction and magnitude of our results (compare, O'Brien, Stockard and Isaacson 
1999; O'Brien and Stockard 2002). 


4. There are other reasons for using 10 to 14 year olds as our youngest age group. Suicides 
are so infrequent at younger ages that the rates tend to be unstable and zeros are difficult 
to handle in our analysis (since we log rates). Further, the correspondence between rates 
of homicide victimization and homicide offending is not good for the very young. 
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5. Beginning our series in 1930 reflects the time when Vital Statistics data were available 
from the vast majority of states and the point at which data on non-marital births, one of 
our measures of cohort characteristics, was first available for the ages included in the 
analysis. We use non-white rates rather than black rates because non-marital births are 
not broken down by white and black until the 1970s. 


6. Savolainen (2000) obtained Public Use Micro Sample (PUMS) census data on the 
percentage of those in five-year cohorts who lived in single-parent families when they 
were ages 5-9 for the years 1910, 1940, 1960, 1980 and 1990 (the only available years). He 
then estimated the percentage in single-parent families for each year from 1911 through 
1939 and from 1941 through 1959, and so on, then aggregated these into estimates for 
five-year cohorts. The interpolated data were not broken down by race. The correlation of 
our measure of NMB for the total population with Savolainen’s is .98. The correlation 
between the first differences of those two measures is .90. We acknowledge that we use 
only one measure of family structure, but it is reassuring that it is highly correlated with 
the percentage of those in five-year birth cohorts who lived in single parent families when 
they were ages 5-9. Although the reasons for non-marital births may have varied over time 
— the percentage of non-marital births for cohorts exhibits a strong relationship to growing 
up in single-parent families. Our focus is on such aggregate-level relationships. 


7. The age and period coefficients were based on the OLS regressions of both homicide 
deaths and suicide deaths using a data set that includes five-year age groups from 10-14 
to 70-74 for the periods 1930 and 1935 and from 10-14 to 75-79 for periods from 1940 to 
2000 (1940, 1945,..., 2000). 


8. In the analyses in Table 2 we treat zeros as missing data because the log of zero is 
undefined. This resulted in 10 missing cases for non-white females and one missing case 
for non-white males. We also ran the analysis substituting for the zero values the lowest 
observed rate for the race-sex group and then taking the log. This did not substantively 
change the results for the non-white males, but did produce a coefficient for LNNMB in 
the suicide equation for non-white females that was statistically insignificant (t = .869) and 
a correlation between the residuals in the model that contained only age and period 
dummy variables that was positive, but not statistically significant. - 


9. The results reported in Table 2 are based on the cases for which we have cohort characteristic 
data. This allows us to compare the ability of the cohort characteristics to explain the 
relationship between the residuals (based on the same set of cases). When we do not use 
the cohort characteristics, it is possible to analyze data from each of the age groups for each 
of the periods, for example, data for each of the age groups in 1930 rather than just the age- 
period-specific rate for those 10-14 in 1930. For this larger set of data, the correlations 
between the residuals for homicide rates and suicide (after controlling for age and period) are: 
total sample, r = .71; white males, r = .82; white females, r = 38; non-white males, r = .20; 
non-white females, r = .17. Each of these correlations is statistically significant at the .05 level. 
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Abstract 

Wilson’s deindustrialization thesis has been the focus of much recent research. This study is 
the first to empirically test his thesis as it relates to suicide among young black males, which 
has increased dramatically over the past two decades. Using 1998-2001 Mortality Multiple 
Cause-of-Death Records and 2000 census data, we examine the influence of concentrated 
disadvantage on suicide among young black males across U.S. cities. After establishing its 
role in shaping suicide rates, we explore the extent to which industrial composition (the 
outcome of deindustrialization) affects concentrated disadvantage in urban communities. 
We perform similar analyses for whites to compare and contrast explanatory processes. 
Our findings show that while disadvantage is related to suicide for young black and white 
males, industrial composition only influences the structural covariates of suicide among 
blacks. These findings demonstrate the ability of Wilson’s thesis to help explain a pressing 
social problem — rising rates of young black male suicide. 


Wilson's (1987, 1996) deindustrialization thesis has been the focus of much recent research 
(Crutchfield 1989; Ousey 2000; Sampson and Wilson 1995; Wadsworth 2004). Wilson argues 
that widespread economic transitions since the 1960s have altered the socioeconomic structure 
of American inner cities. Significant declines in manufacturing work, and its replacement with less 
stable and poorly compensated service sector employment, have led to racially segregated 
communities often characterized by acute poverty, joblessness, and a sense of alienation from 
mainstream society. These worsening conditions, in turn, have produced a variety of social ills. 
Most frequently documented is the strong association between structural disadvantage and 
crime in urban environments — typically poor black communities (Krivo and Peterson 1996; Kubrin 
and Wadsworth 2003; Kubrin and Weitzer 2003; Sampson 1987). Very few studies, however, 
have assessed the relationship between structural disadvantage and another social ill — suicide. 
Yet a significant increase in the number of suicides committed by young African American males 
since the 1980s indicates that greater attention to possible connections between macro-social 
characteristics and suicide among this population is warranted. From 1980-1995 suicide rates 
among males ages 15-19 increased by 146 percent for blacks but only 22 percent for whites 
(CDC 1998:194) and from 1980-1992, the percentage increase in suicide among young black 
males was more than six times greater than the increase for young black females (CDC 1995). 
These statistics suggest that causal processes may be gender- and race-specific. The present 
study examines these issues in a number of ways. First, we discuss the structural and cultural 
implications of acute disadvantage on the black community, with special attention paid to the 
group with the highest suicide rate — young black males. In this discussion we focus on how 
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social integration, a major inhibitor to suicide, may be affected by concentrated disadvantage and 
related characteristics. Second, we empirically examine the association between concentrated 
disadvantage and young black male suicide across U.S. cities. And third, we explore the extent to 
which the industrial composition of a city, as well as changes in industrial composition over the 
past two decades, affect concentrated disadvantage in urban black communities. We perform 
similar analyses for whites to compare and contrast explanatory processes. Ours is the first study 
to empirically test Wilson's thesis with respect to Suicide. 


Integration and Suicide 


In Suicide (1897) Durkheim theorized that “suicide varies inversely with the degree of 
integration of the social groups of which the individual forms a part.” (1951, p. 209) 
Accordingly, suicide is hypothesized to be lower in communities that feature a high degree 
of marital, religious or familial integration, compared to communities characterized by 
excessive individualism. Central to Durkheim's theory is the notion that social integration acts 
as a form of social control or regulation over individuals. When individuals drift without a 
sense of belonging or community they become isolated and the social ties that might 
otherwise inhibit suicidal tendencies are weakened. This idea forms the basis for the social 
integration-regulation thesis. 

Scholars have examined the relationship between aggregate rates of suicide and various 
measures of social and economic cohesion including family integration (Stafford and Gibbs 
1988; Stockard and O’Brien 2002), religious homogeneity (Ellison et al. 1997), economic 
convergence and inequality between races (Burr et al. 1999; South 1984), and cohort size 
(Stockard and O'Brien 2002). Regardless of measure, integration is consistently found to 
affect suicide rates as the social integration-regulation thesis predicts. ' 

For example, Stockard and O'Brien (2002) find that suicide varies inversely with family 
stability; members of cohorts with more non-marital births are at greater risk for suicide 
throughout their lives. More importantly, according to the authors, increases in non-marital births 
are most responsible for recent spikes in youthful suicide. Burr et al. (1999) report similar 
findings specifically for black males. Suicide for this population is associated with high rates of 
marital disruption and greater percentages of female-headed households. And findings from 
Stafford and Gibbs (1988) lend further support for the effects of marital integration, with one 
important caveat — after controlling for occupational integration, marital integration is less 
relevant for suicide. While this may call into question the marital integration-suicide relationship, 
Wilson (1987) asserts that both types of integration are interrelated within the social and 
economic structure of a community. Employment has been viewed as a precursor for marriage 
and as such, communities with fewer job opportunities may also have lower marriage rates. 

Ellison et al. (1997) focus on religious homogeneity defined as “the extent to which 
community residents adhere to a single religion or a small number of faiths.” (1p.76) They find that 
religious homogeneity is negatively associated with suicide, even after controlling for family and 
economic integration. Other aggregate measures of religious integration, such as church 
membership, are also negatively related to suicide, at least for black males (Burr et al. 1999:1061). 

Another approach researchers have used to test the social integration-regulation thesis 
considers the role of economics as an integrating (or isolating) force. These researchers study 
the effect of economic convergence or its opposite, inequality, on suicide rates. With respect 
to socioeconomic status inequalities between blacks and whites, Burr et al. (1999) find that 
higher levels of inequality result in greater suicide risk among black males. Consistent with 
Durkheim's concept of anomie, the authors suggest that “blocked opportunities within a 
climate of higher expectations about increased opportunity lead to a greater probability of 
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violence.” (p. 1054) South (1984), on the other hand, examined whether economic 
convergence between whites and non-whites from 1949-1979 influenced each group's 
suicide rates. In contrast to Burr et al. (1999), he found that “diminishing racial income 
inequality fails to impact either the white male or the white female suicide rates, but as 
expected, tends to raise the suicide rates of both non-white males and non-white females.” 
(p. 178) Although South (1984) maintains that this finding supports Jedlicka et al.'s (1977) 
claim that “the racial convergence in suicide rates is another and less welcome indication of 
black assimilation in the general stream of American society,” (1984:173) three 
methodological issues must be noted. (1) Few structural characteristics were controlled for in 
the study, potentially resulting in missing variable bias. (2) Aggregating to the national level 
may disguise important regional or city-specific heterogeneity. (3) Using the measure of 
median income may oversimplify the role of convergence. Still, findings from both studies 
point towards the importance of economic factors as integrative forces. 

While integration has been conceptualized in a variety of different ways in studies that 
explore Durkheim's propositions, collectively, this body of research offers support to the 
integration-suicide thesis. This is important when one considers that over the past 40 years, 
industrial restructuring has resulted in significant changes within black urban communities in 
the United States, with potential consequences for community integration. We turn now to a 
discussion of such changes, their consequences for community integration, and the potential 
effect such forces may have on suicide among young black males. 


Deindustrialization and Concentrated Disadvantage in the Inner City 


Beginning in the 1960s, a general decline in urban manufacturing industries led to a significant 
reduction in labor market opportunities available to less educated urban residents. African 
American city dwellers, who had fewer resources and faced discrimination in both job and 
housing markets, were especially affected. Individuals that once worked in well-paid, semi- 
skilled manufacturing jobs found themselves unemployed or working parttime in retail stores, 
restaurants and other service sector jobs that paid less, offered few benefits or promotional 
opportunities, and were less secure. 

In addition to the “individual-level effects” of unemployment and employment in less 
desirable jobs, labor market shifts within some communities created “concentration effects” 
by altering the economic well-being of the community as well as the distribution of human 
activity, According to Wilson (1987, 1996) and Sampson and Wilson (1995), these changes 
were detrimental to family formation, community participation and the support of local 
institutions such as schools, churches and community centers. The negative consequences 
of the growing deprivation of many residents encouraged those who could afford to leave to 
do so. The exodus of working- and middle-class black families into the suburbs and 
surrounding areas excelled this downward spiral into joblessness, economic despair and the 
subsequent disorganization of the inner city. As Wilson (1987:57) explains “in ghetto 
neighborhoods that have experienced a steady out-migration of middle- and working-class 
families — communities...that lack a social buffer — a sudden and/or prolonged increase in 
joblessness creates a ripple effect resulting in an exponential increase in related forms of 
social dislocation.” The social and economic implications of these shifts were calamitous for 
the already disadvantaged members of the community. Prior to the 1970s, middle- and 
working-class residents acted not only as the economic backbone of inner-city businesses 
and organizations but also as role models and mentors for younger generations. Their 
presence in disadvantaged neighborhoods ensured that educational, religious and civic 
organizations were maintained, and supportive networks were formed within the community. 
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Perhaps one of the most perceptible indications of the diminishing social organization in 
the inner cities is the breakdown of the black family unit. Prior to the 1970s, two-parent 
households were far more prevalent. In 1965, about 25 percent of all black households were 
headed by women; this figure grew to 45 percent in 2000 (U.S. Census 2002). The correlation 
between a diminished structure of opportunity and an increase in single-parent households is 
not difficult to comprehend. With so few legitimate jobs available, the pool of marriageable or 
employed men declined significantly, making marriage a less viable option for many women. 
Between 1970 and 1980, the number of black children growing up in fatherless families 
increased by 41 percent (Wilson 1987:70). Some scholars argue that family disruption has 
resulted in weakened community networks and informal social control (Sampson 1987). 


Suicide and the Cultural Implications of Structural Disadvantage 


A concentration of single family-households, poverty and unemployment, coupled with the 
social isolation that characterizes many inner-city neighborhoods has created an environment 
in which the tendency toward deviant, anti-social behavior is high (Almgren et al. 1998; 
Anderson 1999; Sampson and Wilson 1995). It is not hard to imagine that the same 
environmental factors found to be associated with externalized violence (Krivo and Peterson 
1996; Kubrin and Wadsworth 2003; Kubrin and Weitzer 2008; Sampson 1987; Wadsworth and 
Kubrin 2004) might correspond to violence turned toward the self. To understand the 
relationship between structural disadvantage and suicide, it is necessary to consider the 
structural and cultural implications of these dire conditions, particularly among the younger 
generation of blacks who have been faced with persistent disadvantage throughout their lives. 

Scholars assert that acute structural disadvantage leads to social isolation (Anderson 1999; 
Bruce et al. 1998; Krivo and Peterson 1996) — defined by Wilson (1987:60) as “the lack of contact 
or of sustained interaction with individuals and institutions that represent mainstream society.” 
The interaction between structural disorganization and cultural social isolation in the 
determination of deviant behavior is emphasized by Sampson and Wilson (1995) who claim that, 
“Structurally disorganized communities are conducive to the emergence of cultural value 
systems and attitudes that seem to legitimize... crime and deviance.” (p. 50) 

Along these lines, scholars have found that the alienation of the black poor from mainstream 
society has resulted in the emergence of a subculture within urban ghettos, often characterized 
by hostility, aggression and a general devaluation of human life (Anderson 1999; Kubrin and 
Weitzer 2003; Sampson and Wilson 1995). It is not argued that the oppositional subculture is a 
consequence of the racial make-up of a given locality, but rather as Bruce et al. (1998:32) 
articulate, it is a “manifestation of local structural conditions and general levels of opportunity.” 

Social isolation and the experience of living within the confines of extremely 
disadvantaged environments have been shown to create an acute sense of hopelessness 
among inner-city youth (Anderson 1999; Bruce et al. 1998; Sampson and Wilson 1995). With 
fewer organizational institutions such as churches, businesses or community-based 
programs, youths coming of age in this environment frequently lack the structure or 
opportunities that give them a sense of purpose in their lives. Specifically for young black men 
living in the inner city, the frustration of being unable to assert their manhood through 
legitimate structures, coupled with the persistent threat of interpersonal violence, can create 
a profound sense of despair. Growing up in an environment filled with violence and _ 
aggression can lead to psychological states such as nihilism (Bruce et al. 1999). Nihilism and 
a lack of hope for the future are key determinants of the relationship between structural 
disadvantage and suicide for young black males. Often, children living in acutely 
disadvantaged, isolated communities acquiesce to their bleak futures with a sort of grim 
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acceptance. Of their sense of mortality, Anderson (1999:135) writes, “One must understand 
that some young people bereft of hope for the future have made their peace with death and 
talk about planning their own funerals... The high death rate among their peers keeps many 
from expecting to live beyond age twenty-five.” Not surprisingly, residents living in these 
environments are more prone toward deviant behavior, including drug use, teenage 
pregnancy and most notably, interpersonal violence. We suggest that these environmental 
forces can also lead to higher rates of suicide. 

The lack of opportunity within disadvantaged areas has resulted in a sort of adaptive 
strategy on the part of its inhabitants. The adaptations employed by men and women are key 
to understanding why structural conditions may particularly influence black male suicide rates. 
For the generation of young men living in urban ghettos, the implications of structural 
disadvantage are especially hard to rectify. Whereas women may use motherhood as a means 
of developing supportive social networks, the pressure on young men to assert their 
independence and strength often leaves them unable to seek emotional support from others. 
Young men fail to establish the same social bonds or sense of purpose through fatherhood 
(Anderson 1999). And poverty, joblessness and the lack of social capital deprive young men of 
the “traditional way of proving their manhood — by supporting a family.” (Anderson 1999:177) 
Moreover, studies suggest that black families and other institutions provide differential levels 
of support and positive reinforcement for black male and female adolescents with 
consequences for deviant behavior, ideas in line with social learning theory (Akers 1985). For 
example, studies of black families show that boys are disciplined more harshly, trained for 
independence earlier, and positively reinforced for adolescent aggression and sexuality (Gibbs 
1988:84). In addition, differential reinforcement by teachers in schools and differential 
treatment by police, employers and other gatekeepers of society factor in. This pervasive and 
persistent differential treatment results in less nurturance, fewer social supports, lower levels 
of positive reinforcement and fewer opportunities for black males than black females. 

While community organization and strong neighborhood role models once helped to 
establish a sense of cohesion and normative values that served to deter anti-social behavior, 
their presence in urban ghettos has faded in the past several decades (Anderson 1999; 
Wilson 1987). For the younger generation growing up in these environments today, there is 
little in the community that offers them a sense of purpose or a connection to mainstream 
society; consequently, nihilism has become a side effect of living against a backdrop of 
persistent socioeconomic despair. 

The relationship between structural disadvantage, cultural counian and externalized 
violence is one that has been demonstrated clearly in previous research but fewer studies 
exist which examine how these factors may relate to violence turned toward the self in the 
form of suicide. Yet today suicide among young black males is at an all-time high. The present 
study determines whether and to what extent an association exists between structural 
disadvantage and young black male suicide. That a connection may exist is suggested by 
results from one recent study of Chicago neighborhoods. Almgren et al. (1998) find that 
suicide rates for young adult black males are positively associated with a secular decline in 
the economic opportunity structure. 

In addition to testing a disadvantage-suicide relationship, the study contributes to the 
literature in other ways. There are some noted limitations in the suicide literature: (1) Race has 
been seriously understudied (Stack 1996:405). (2) Little sociological research has dealt with 
the greater incidence of youth suicide (Stockard and O'Brien 2002:607). (3) There is limited 
knowledge as to geographic variations in suicide rates (Willis et al. 2002:917). This research 
addresses each of these points by examining the effect of structural characteristics on city- 
level suicide counts disaggregated by race, gender and age. In particular, we argue that 
variation across metropolitan areas in the suicide levels of young black males can be 
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explained, in large part, by structural characteristics associated with acute disadvantage. This 
argument leads to the following hypothesis: 


(1) Cities with greater levels of disadvantage will have higher suicide 
rates among young black males, controlling for other socio-economic 
characteristics. 


Although social isolation is not empirically measured in the analysis, it serves as the 
rationale behind the hypothesized disadvantage-suicide relationship, as discussed above. We 
also argue that industrial composition and changes in industrial composition are directly 
related to concentrated disadvantage, which leads to second and third hypotheses: 


(2) Cities dominated by individuals employed in manufacturing or 
“good” jobs, compared to less-desirable service sector jobs, will 
experience lower rates of black economic disadvantage. 


(3) Cities that experienced an increase between 1980 and 2000 in the 
percentage of employed individuals working in manufacturing jobs 
will witness lower levels of concentrated disadvantage among 
blacks. 


To test these hypotheses we present two sets of models. The first set measures the 
effects of economic disadvantage on suicide. The second examines whether static levels and 
changes in industrial composition influence disadvantage. We perform similar analyses for 
whites to compare explanatory processes. 


Data and Methods 


Dependent Variables 


We collected data on suicide from the Mortality Multiple Cause-of-Death Records provided by 
the Centers for Disease Control and Prevention, National Center for Health Statistics (NCHS). 
Included in the file are data records indicating a cause of death and location for each death 
that occurred in the United States. We selected out those cases where the cause of death 
was listed as a suicide. In the study we focus on suicide among young black males ages 15 
to 34. We focus on young black males for two important reasons. First, Wilson's thesis is 
particularly relevant for this population (Wilson 1996:30). Second, researchers studying 
suicide argue that this group's relatively high rates are of particular concern among public 
health officials (Burr et al. 1999:1056; Gibbs 1997:69). 

We aggregated the suicide data to the city-level, the unit of analysis of the study. The city 
is the most appropriate level of aggregation for four reasons. (1) Given the process by which 
MSAs (a larger level of aggregation) are designated, these units can be very heterogeneous. 
Indeed, in their analysis, Burr et al. (1999) include a percentage urban variable to control for 
the wide variation in urban and rural territories that exists in many of the MSAs. Cities, on the 
other hand, are more homogeneous. (2) Suicide, and particularly black suicide, is more likely _ 
to occur in urban areas (Stack 1997). For example, in 2000, the rate of suicide for young black 
males living in large cities (greater than 100,000 population) was 31 percent higher than for 
young black males living in other areas. (3) Kowalski, Faupel and Starr (1987:85) find that 
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sociological variables in urban counties tend to have much stronger explanatory power than 
in rural or middle-urban counties and conclude that sociological explanations for suicide apply 
primarily to urban environments. (4) Wilson’s argument applies to blacks in urban settings. 
To ensure reliable estimates, we included only those cities with 100,000 population or more 
(n = 219). Twenty-four cities did not report any mortality data during the study time period and 
were eliminated from the analyses, resulting in a final sample size of 195 cities.” 

One potential concern is the validity of official suicide statistics, in particular, underreporting 
due to possible misclassification (Warshauer and Monk 1978). However, Pescosolido and 
Mendelsohn (1986) analyzed sets of independently collected suicide data for county groups, 
evaluating the impact of underreporting suicide on standard suicide correlates, and they found 
that the relationships among these variables and official suicide data were not appreciably 
altered in terms of direction and magnitude of effects as a result of underreporting. Thus, we 
argue that official suicide data are sufficiently accurate to permit analysis (Burr et al. 1999: 1069; 
Cutchin and Churchill 1999:103; Gibbs and Martin 1964; Kowalski, Faupel and Starr 1987:89; 
Marshall 1981). Following common practice (Burr et al. 1999:1056; Cutchin and Churchill 
1999:102; Kowalski, Faupel and Starr 1987:90), we use total suicide counts computed over a 
four-year period (1998-2001) to account for the relatively rare nature of suicide and to minimize 
the impact of annual fluctuations. 

Even after aggregating the data over a four-year period, suicide is a rare event and many 
cities have few incidents. Including cities with no or few suicides results in a heavily skewed 
distribution, which violates the assumptions of ordinary least squares regression. For this 
reason we employ an alternative approach discussed in Osgood (2000), which has been used 
in recent suicide (Burr et al. 1999) and homicide research (Kubrin 2003; Kubrin and 
Wadsworth 2003; Wadsworth and Kubrin 2004). We use suicide counts instead of rates and 
employ Poisson regression models. Given over dispersion in the data, the negative binomial 
variant of Poisson regression is used. As recommended by Osgood (2000:33), we include the 
log of the city’s young black and young white male populations as exposure variables 
(populations at risk) and constrain these coefficients to equal 1. Controlling for population size 
in this way is comparable to analyzing rates. 


Independent Variables 


We regress suicide counts on the structural characteristics of U.S. cities. Information on 
structural characteristics comes from the 2000 census. The characteristics chosen reflect 
both our interest in modeling Wilson's deindustrialization thesis and our attempt to 
incorporate key variables from the aggregate suicide literature. Structural variables specific to 
the black analyses include the number of blacks, the number of young black males ages 15- 
34, black mobility (percentage of blacks ages 5 and older who have moved in the past five 
years), racial segregation (Index of Dissimilarity), percentage of black male joblessness 
(percentage of black males ages 16 and older not working), black median family income, 
percentage of black poverty, percentage of black high school graduates, and percentage of 
black single female households with children under 18 years. There are also five measures 
of racial inequality that reflect black-white differences in education (ratio of percentage of 
whites to blacks who have graduated from high school for their respective populations aged 
25 and older), income (ratio of white to black median family income), unemployment (ratio of 
black to white employment rates), joblessness (ratio of black to white joblessness rates), and 
poverty (ratio of black to white poverty rates). Incorporating multiple forms of inequality 
represents an improvement over previous research that uses more limited measures (Burr et 
al. 1999: South 1984). Although Wilson’s argument underscores the role of absolute 
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deprivation, we include the inequality measures because they have been found to influence 
black suicide rates in previous research (Burr et al. 1999; South 1984). 

Structural variables specific to the white analyses include the number of whites, the 
number of young white males ages 15-34, white mobility (percentage of whites ages 5 and 
older who have moved in the past five years), percentage of white single female households 
with children under 18 years, percentage of white male joblessness (percentage of white 
males ages 16 and older not working), white median family income, percentage of white 
poverty, and percentage of white high school graduates. 

Finally, we also include non-race specific measures in the analyses: total population size, 
region of country (Northeast, Central and West with South as the reference category), 
percentage black, percentage Hispanic, and industrial composition (defined as the number of 
employed individuals working in durable and non-durable manufacturing industries divided by 
the number of employed individuals working in retail, wholesale and the manufacturing 
industries) (Wadsworth 2004). This measure reflects the proportion of jobs available to young 
males that are more likely to be stable, well-paid and that offer opportunities for 
advancement. We also include a measure that captures the change in industrial composition 
from 1980 to 2000 - the increase or decrease in stable, better paying jobs. These measures 
represent a key component of both the deindustrialization thesis and dual labor market theory. 

After running collinearity diagnostics, we determined that including many of these 
variables as independent predictors in the models would be problematic due to the high 
correlations between them. Guided by previous research (Burr et al. 1999) we performed 
factor analysis with varimax rotation separately for blacks and whites. For blacks, the factor 
analysis generated two indices that capture black disadvantage and racial inequality between 
blacks and whites. The following variables loaded strongly on the African American 
disadvantage index (factor loadings follow in parenthesis): percentage black male joblessness 
(.81), black median family income (-.88), percentage black poverty (.90), percentage black high 
school graduates (-.84), and percentage black single female households (.75). This factor had 
an eigen value of 5.42. The second index, racial inequality, with an eigen value of 1.61, is 
comprised of the ratio of white to black high school graduation rates (.61), ratio of white to 
black median family income (.66), ratio of black to white poverty rates (.80), ratio of black to 
white unemployment rates (.82), and the ratio of black to white joblessness rates (.78). Black 
disadvantage and racial inequality are used with total population, blacks, black mobility, white- 
black dissimilarity, northeast, central and west to predict young black male suicide.’ 

Factor analysis for whites produced similar results. The white disadvantage index had an 
eigen value of 3.32 with the following variables loading on the index: percentage white male 
joblessness (.83), white median family income (-.86), percentage white poverty (.85), 
percentage white high school graduates (-.85), and percentage white single female 
households (.67). This factor is used with total population, whites, white mobility, northeast, 
central and west to predict young white male suicide. 


Findings 


Descriptive Statistics 


Means and standard deviations for all variables are presented in Table 1. Average four-year 
suicide counts for young males were 8.8 for blacks and 23.2 for whites.’ Mean city population 
was 357,596 with an average of 20 percent blacks and 19 percent Hispanics. About 34 
percent of the cities were located in the western, 20 percent in the central, 34 percent in the 
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Table 1: Variable Names, Descriptions and Descriptive Statistics 





Mean S.D. 
Race-Specific Young Male Suicide Counts 
Young Black Male Suicide Counts, Ages 15-34 8.8 16.4 
Young White Male Suicide Counts, Ages 15-34 eae 24.5 
Non-Race-Specific Structural Variables 
Population of City: 357,596 681,005 
% Northeast (0 = No, 1 = Yes) 12 33 
% Central (0 =No, 1 = Yes) 20 40 
% West (0 = No, 1 = Yes) 34 48 
% South (0 = No, 1 = Yes) 34 47 
% Black in Population 20 19 
% Hispanic in Population 19 19 
Industrial Composition: percentage of workers in 
manufacturing relative to the number of retail, wholesale 
and manufacturing employees 43 11 
1980-2000 Percentage Change in Industrial Composition -3 
Structural Variables for Black Analyses 
Black Population 78,853 183,873 
Young Black Male Population 11,265 25,882 
Black Mobility: % that moved in the past 5 years 57 10 
Racial Segregation: white/black index of dissimilarity 46.45 17.14 
Variables Loading on Black Disadvantage Factor 
% Black Male Joblessness 42 10 
Black Median Family Income $35,589 $11,638 
% Black Poverty 24 9 
% Black High School Graduation 77 9 
% Black Single Female Households with Children 31 8 
Variables Loading on Racial Inequality Factor 
White/Black HS Graduation Rate Ts rio 
White/Black Median Family Income Nee 44 
Black/White Poverty Rate 2.70 84 
Black/White Unemployment Rate 2.49 83 
Black/White Joblessness Rate 1735 .28 
Structural Variables for White Analyses 
White Population 164,996 239,868 
Young White Male Population 24,495 34,593 
White Mobility: % that moved in the past 5 years 48 7 
Variables Loading on White Disadvantage Factor 
% White Male Joblessness 31 6 
White Median Family Income $58,503 $14,181 
% White Poverty 9 4 
% White High School Graduation 87 7 


% White Single Female Households 9 2 
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southern, and 12 percent in the northeastern states. The average percentage of employed 
individuals working in manufacturing industries was 43 percent and the average change in 
this measure from 1980 to 2000 was a decrease of 3 percent. 

Examining the variables for the black suicide analyses, cities had on average 78,853 blacks 
and 11,265 young black males. Forty-two percent of the black males were jobless, 77 percent 
of the black population had graduated from high school, 31 percent of black families with 
children under 18 were headed by a single female, and 57 percent of the black population had 
moved within the previous five years. Average median black family income was $35,589, and 
24 percent of black families were living in poverty. The average index of dissimilarity score 
was 46.45. Finally, the average values for the racial inequality variables were as follows: 
white/black high school graduation rate (1.13), white/black median family income (1.72), 
black/white poverty rate (2.70), black/white unemployment rate (2.49), and black/white 
joblessness rate (1.35). 

Turning to variables for the white analyses, the average city had 164,996 whites and 
24,495 young white males, a 31 percent rate of white male joblessness, and a white family 
poverty rate of 9 percent. On average, 87 percent of whites had graduated from high school, 
48 percent of the white population had moved in the past five years, and 9 percent of the 
families with children under 18 were headed by single females. The average median income 
for white families was $58,503. 


Structural Characteristics and Suicide among Young Black Males 


Table 2 displays the negative binomial regression results revealing the structural factors 
associated with young black male suicide. Other than being located in the central states, 
black disadvantage — comprised of joblessness, median family income, poverty, high school 
graduates and single female households — is the only significant predictor. Cities with greater 
levels of concentrated disadvantage among blacks experience greater numbers of young 
black male suicides, a finding consistent with Wilson's argument. While disadvantage and 
segregation often have similar effects on black urban ills, the latter was not significant in this 
analysis, consistent with Almgren et al. (1998:1489) and Burr et al. (1999:1065). Racial 
inequality and mobility also are not related to young black male suicides in large U.S. cities. 

As discussed above, concentrated disadvantage is not randomly distributed — it is the result 
of macro-economic forces, such as industrial composition, and the influence these forces have 
on local economies. To explore static and dynamic effects of industrial composition on more 
proximate predictors of suicide, we regressed the black disadvantage factor on the two 
industrial composition measures as well as other exogenous structural characteristics. Table 
3 displays the results from this analysis. The findings show that industrial composition and 
changes in industrial composition significantly influence black concentrated disadvantage. 
Cities dominated by individuals employed in manufacturing or “good” jobs, compared to less- 
desirable service sector jobs, experienced lower rates of poverty, joblessness, female-headed 
households and higher income and educational attainment, which in turn, decreased the 
number of suicides among young black males. Moreover, cities that experienced an increase 
in the percentage of employed individuals working in manufacturing jobs from 1980-2000 also 
enjoyed lower levels of concentrated disadvantage.® Black disadvantage was additionally 
influenced by percentage of the population that was black and Hispanic, as well as geographic - 
location, with the northeast and central cities experiencing greater disadvantage levels and the 
West experiencing lower levels. 

For comparison purposes we performed similar analyses for whites. First we regressed 
young white male suicide counts on city-level variables and then we examined the influence 
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Table 2: Predictors of Young Black Male Suicide Counts" 








Variables Regression Results 
138* 
Black Disadvantage (.070) 
031 
Black/White Inequality (.045) 
-.006 
White/Black Dissimilarity (.005) 
-.291 
Black Mobility (.680) 
.063 
Population (LN) (.076) 
-.023 
Blacks (LN) (.071) 
-.137 
Northeast (.141) 
.205* 
Central (.094) 
.148 
West (.127) 
-7.269*** 
Constant (.734) 
# Young Black Males (exposure) 
X’ 
20.60 
P 000 
-2 LL 412.132 


opo.0s  *p<.0l'-” **p<.00l 
* Entries are unstandardized coefficients followed by 
standard errors in parentheses. 
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Table 3: Predictors of Black Disadvantage* 





Variables Regression Results 


Industrial Composition 


1980-2000 Change in 
Industrial Composition 


Population (LN) 


% Hispanic (LN) 


% Black 


Northeast 


Central 


West 


Constant 


Adjusted R square = .39 


-.207** 
1.977 
(583) 


-166* 
-2.578 
(1.173) 


-039 
-.05 
(.084) 


259" 
216 
(.067) 


009" 
2.677 
(.431) 


373" 
1.107 
(.224) 


211% 
515 
(.186) 


-.184* 
-385 
(170) 


1.428 
(1.143) 





es p<.0l 
n 


tries are eee coefficients lowed by 
unstandardized coefficients and standard error’: in 


parentheses. 
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Table 4: Predictors of Young White Male Suicide Counts’ 








Variables Regression Results 
et 
White Disadvantage (.034) 
-.853 
White Mobility (.495) 
-.047 
Population (LN) (.072) 
077 
Whites (.076) 
~o24*** 
Northeast (.116) 
-.170* 
Central (.081) 
054 
West (.064) 
-6.701*™ 
Constant (.482) 
# Young Black Males (exposure) 
2 
S 157.65 
P 000 
saLk 618.743 
< .05 p<.01 ae 


ntries are tian Es soefilcients f followed by 
lass errors in parentheses. 
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Table 5: Predictors of White Disadvantage* 


Variables Regression Results 


.026 
241. 
Industrial Composition (.623) 


«096 


1980-2000 Change in -1.502 
Industrial Composition (1.253) 


-.186** 
-.254 
Population (LN) (.090) 


.266 
.223 
% Hispanic (LN) (.072) 


439" 
2.325 
% Black (461) 


.458*** 
1.367 
Northeast (.239) 


eo. 
198 
Central (.199) 


.089 
187 
West (.182) 


2 05e: 
Constant (1.222) 





Adjusted R square = .31 





SP<05 | p<ol, + p< .001 

‘Entries are standardized coefficients followed by 
unstandardized coefficients and standard errors in 
parentheses. 
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of industrial composition and other exogenous variables on disadvantage.° The results for the 
first regression are displayed in Table 4. For the most part, the findings for whites are identical 
to those for blacks. In particular, disadvantage — comprised of joblessness, median family 
income, poverty, high school grads and single female households — significantly increased 
young white male suicides.’ 

While the findings for blacks and whites look quite similar when suicide counts are 
regressed on the structural characteristics of cities, when we examine the influence of 
industrial composition on white disadvantage the findings point to an important divergence. 
The results of this analysis are displayed in Table 5. Unlike for blacks, neither industrial 
composition measure has an effect on white disadvantage and thus cannot be linked to 
patterns of suicide among young white males. To confirm the divergent effects of the 
industrial composition measures on race-specific deprivation we used the equation t = b:— bz 
/ + (SEb? + SEb#) (Paternoster et al. 1998) to test for significant differences between the 
coefficients across the samples. The results demonstrate that the observed difference is 
statistically significant for the static (t = -2.48, p < .01)-but not the dynamic measure. With the 
exception of population, which significantly decreases white, but has no effect on black 
disadvantage, all other variables have similar effects across the race-specific samples. 


Discussion and Conclusion 


Collectively, these findings suggest that the deindustrialization thesis, as outlined by Wilson 
(1987, 1996) and others (Anderson 1999; Crutchfield 1987; Wadsworth 2000; 2004), is quite 
useful for explaining city-level variation in young black male suicide. The results demonstrate 
that just as black concentrated disadvantage has been shown to increase violent crime and 
other social ills, it also significantly increases suicide among young black males. As a lack of 
social integration may be at the root of these social pathologies this, perhaps, should not be 
a surprising finding. Over a century ago, Durkheim (1897) argued that integration was 
inversely related to the suicide rates of European countries and states. Building upon Wilson's 
thesis, we suggest that the same can be said for contemporary American cities. In the current 
work, we propose that joblessness, poverty, educational failure and family disruption are 
important indicators of social integration (or a lack thereof) in African American urban 
communities. These factors influence, and are in part the result of, the degree to which young 
black males are able to successfully establish age-appropriate pro-social roles and positions. 
When conditions hinder their ability to do so, research finds that violent behavior, drug abuse 
and now suicide, are more likely. 

Apart from regional location, none of the other variables we examined influenced young 
black male suicide. Previous research suggested the potential importance of racial inequality 
(Burr et al. 1999) yet it was not a significant predictor in our models. The competing relevance 
of inequality vs. absolute deprivation has a long history in criminology with some evidence that 
both are important covariates of violent crime (Blau and Blau 1982; Jacobs and Wood 1999; 
Messner and Golden 1992: Wadsworth and Kubrin 2004). While we do not claim to have 
resolved the issue for the study of suicide, our analysis uses well-established measures to 
represent both concepts. The results suggest that concentrated deprivation, reflecting 
economic and educational disadvantage as well as family disruption, matters while inequality 
does not. Why might this be? Drawing on the ideas of Durkheim as well as others who stress 
the importance of inequality (Blau and Blau 1982; Jacobs and Wood 1999), race-based 
disparity in resource allocation may act to unify an ill-treated group in the face of a common 
enemy and thus act as an integrating, rather than alienating, force. If this is the case, we would 
expect inequality to have an inhibitive, not aggravating, effect on young black male suicide — 


1574 © Social Forces Volume 84, Number 3 * March 2006 


similar to the explanation Durkheim offered for the low suicide rate among European Jews at 
the end of the 19" century. Along these lines, previous research reporting inequality as a 
significant predictor of young black male suicide may have suffered from missing variable bias. 
As high levels of inequality correspond with disadvantage (the two indicators are related), 
inequality may appear to be influential when key indicators of cumulative disadvantage are 
excluded from the model. 

In extending Wilson's thesis we also Suggested that residential segregation may 
contribute to the isolation of black communities and thus increase suicide. The results did not 
support this proposition, a finding consistent with Almgren et al. (1998) and Burr et al. (1999). 
However, we believe this is understandable given a more nuanced reading of the 
deindustrialization thesis. Residential segregation is an important result of shifts in labor 
market opportunities. Neighborhoods that are segregated — and thus more homogenous with 
respect to employment opportunities and educational achievement — offer fewer informal 
social networks that can be utilized to access jobs, housing and other resources. In other 
words, residential segregation contributes to the cycle of concentrated poverty and 
deprivation by limiting access to tools for social mobility. However, once poverty and 
disadvantage are considered, segregation may not have a direct effect on levels of 
community integration, and therefore would not influence suicide. 

While Wilson's thesis focused specifically on black communities, it is important to note his 
argument that the primary causes of urban social ills are racially invariant and can be traced to 
the structural conditions of localities. (See also Sampson and Wilson 1995.) In their study of 
extreme disadvantage and violent crime, Krivo and Peterson (1995) found support for this 
argument and our findings also support this contention as it relates to internalized violence. 
The structural characteristics that influenced suicide among young white males were almost 
identical to those that influenced suicide among young black males — concentrated 
disadvantage and location in the central states. 

Wilson’s thesis, however, goes beyond the role of deprivation in determining the 
development and persistence of social and psychological pathologies. He identified 
deindustrialization as the root cause of concentrated poverty, joblessness, educational failure 
and single-parent households, all of which give rise to, and maintain a variety of, social ills. 
Wilson argued that the process of deindustrialization was especially detrimental to black 
communities due to their historical involvement in manufacturing industries, lower rates of 
educational achievement, higher levels of poverty and residential concentration in central 
cities. While it is not possible to measure the process of deindustrialization with cross- 
sectional data, following previous research (Crutchfield 1989; Wadsworth 2004) we measured 
the main outcome of the process — industrial composition and changes in industrial 
composition. When examining black deprivation, Wilson's claim is supported. Cities with 
higher percentages of employed individuals working in manufacturing industries (compared 
to retail and wholesale industries) and cities that experienced an increase in percentage from 
1980 to 2000 had significantly lower levels of concentrated black disadvantage, which, in turn, 
resulted in fewer suicides among young black males. Also consistent with Wilson's thesis, 
urban industrial composition was not found to affect white disadvantage. 

This study makes a number of important contributions. Foremost, while much of the 
suicide literature has focused on individual-level predictors, we have demonstrated the 
importance of concentrated disadvantage in explaining suicide across a large sample of cities. 
In so doing, we have linked the social integration/regulation thesis with the work of those who 
focus on contemporary issues in disadvantaged, urban, and often minority neighborhoods. 
This connection both theoretically and empirically extends our understanding of the structural 
correlates of suicide. Second, consistent with the arguments of criminologists, we have 
demonstrated that the impact of these structural forces is racially invariant. The macro-level 
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covariates of suicide rates are virtually identical for black and white young males. Lastly, by 
extending our analysis to focus on how industrial composition influences the structural 
covariates of suicide, we have demonstrated the ability of Wilson's thesis to help explain a 
pressing social problem — young black male suicide. 

The current study also raises some important issues to be addressed in sire research. 
One issue concerns the dynamic process of deindustrialization. With this process the 
economic base and labor market structure of the U.S. economy has shifted significantly. The 
outcomes of deindustrialization, such as concentrated disadvantage, are also dynamic as they 
continue to respond to a variety of social forces. The most appropriate methodological 
approach for studying the effects of this process is longitudinal in nature. Our study takes a 
step in this direction by modeling changes in industrial composition, but a time series analysis 
would help specify the exact nature of these relationships. A more thorough examination of 
the dynamic relationship between industrial composition, disadvantage and suicide is 
necessary for furthering our understanding of the social forces of suicide. 

An interesting question has to do with overall levels of suicide for blacks and whites. If 
disadvantage increases young male suicide, and blacks experience higher levels of 
disadvantage, why are white suicide rates, despite a drastic increase in black rates over the 
past two decades, still higher? Researchers have offered a variety of responses to this 
question, many of which point back to the role of integration. Aside from disadvantage, there 
are other factors that may encourage community integration and create or reinforce norms 
against suicide. Scholars have suggested that extended family networks, church participation 
and community involvement historically have mitigated the anomic effect of disadvantage in 
black communities (Gibbs 1997). More than 20 years ago, Davis (1980:228) wrote: 


For blacks, the stresses and anxieties that might lead to suicide have 
often been offset by strong family and communal ties. Effectively 
denied all other mechanisms to compensate for rejection and abuse, 
blacks have in the past used their families, communities, and 
institutions (i.e., churches, social clubs, fraternal organizations, etc.) to 
develop positive and functional forms of response to recurrent 
stressful social situations. 


In their interviews with black church leaders in a southeastern city, Early and Akers (1993) 
found that many black pastors viewed suicide as a “white thing” foreign to black culture. 
Interviewees suggested that this cultural message stemmed from a history of religion, 
oppression, hardship and moral values. Some went so far as to suggest that blacks who did 
commit suicide were those more integrated into white culture — a view taken by some 
scholars as well (Davis 1980; Jedlicka et al. 1977; South 1984). Consistent with these 
findings, Ellis and Range (1991) discovered that black college students scored significantly 
higher in certain sections of the Reason For Living Inventory (Linehan et al. 1983); specifically, 
blacks more strongly endorsed the importance of “survival and coping” and “moral values” as 
reasons to live. 

While Wilson suggests that integrative institutions such as church, extended family and 
social organizations are declining as a result of deindustrialization, it may be possible that 
their influence is still stronger in black communities than in white communities, or that the 
cultural messages they have created among blacks have survived (or have declined more 
slowly) resulting in lower suicide rates. Examining these inhibiting factors, and the degree 
to which they are influenced by structural forces, is clearly an important avenue for future 
research. 


1576 ° Social Forces Volume 84, Number 3 * March 2006 


Notes 


1. It is important to recognize the comprehensive literature on integration and suicide that 
exists at the individual level. Studies by Stack (1990; 1996), Watt and Sharp (2002), and 
Perez-Smith et al. (2002) — to name just a few — all show support in varying ways for the 
social integration-regulation thesis. These and other individual-level studies are not 
reviewed in the paper given its aggregate focus. 


2. We checked for outliers in the regression analyses and deleted two cases (cities) with 
standardized residuals more than three standard deviations from the mean: El Monte, CA 
and Overland Park, KS. We reran the analyses without those cases and the results did not 
change. We also examined other influence statistics (e.g., Cook's D and Standardized 
DFBETAs), which did not indicate any problematic outliers. We therefore retain all cases 
for the analyses. 


3. A remaining issue has to do with potential collinearity among the racial inequality and 
disadvantage factors — two economic measures. Correlations between the components 
that make up each factor, however, are not high suggesting that collinearity is not a 
problem. To further verify this, we ran the regressions and obtained collinearity 
diagnostics. Based on VIF scores for the racial inequality and disadvantage measures, 
there is no evidence of collinearity (no VIF score was above 2.0). 


4. Based on the four years of data, the average annual rate of young male suicide per 
100,000 is 21.75 for blacks and 25.75 for whites. 


5. We also examined whether industrial composition had a direct effect on suicide counts, 
or an indirect effect that was mediated by disadvantage. The findings did not support 
either of these possibilities. 


6. As there are no cities in the United States where whites are more disadvantaged than 
blacks (Sampson and Wilson 1995:42) and no theoretical rationale for why racial inequality 
would influence suicide among the more advantaged group, the racial inequality variable 
is not included in the white analyses. 


7. Since Wilson's argument stresses the role of acute disadvantage, we determined whether 
the relationship between disadvantage and suicide was nonlinear for blacks and whites. 
First, we created a multiplicative term for disadvantage (disadvantage* disadvantage) and 
included it in the regression analyses along with the disadvantage factor. Second, we 
created a dummy variable that captured whether each city was at the extreme of 
disadvantage (i.e., in the top 20%). Adding either variable did not significantly improve the 
model for blacks or whites. 


References 
Akers, Ronald L. 1985. Deviant Behavior: A Social Learning Approach. Wadsworth Publishing. 


Almgren, Gunnar, Avery Guest, George Immerwahr and Michael Spittel. 1998. ‘Joblessness, Family 
Disruption, and Violent Death in Chicago, 1970-90." Social Forces 76:1465-93. 


Anderson, Elijah. 1999. Code of the Street: Decency, Violence, and the Moral Life of the Inner City. WW. 
Norton and Company. 


Disadvantage and Suicide among Young Black Males * 1577 
Anderson, Robert N., and Betty L. Smith. 2003. Deaths: Leading Causes for 2001 52:1-86. Centers for 
Disease Control, Division of Vital Statistics. 


Blau, Judith R., and Peter M. Blau. 1982. “Metropolitan Structure and Violent Crime.” American Sociological 
Review 47:114-128. : 


Bruce, Marino, Vincent Roscigno and Patricia L. McCall. 1998. “Structure, Context, and Agency in the 
Reproduction of Black-on-Black Violence.” Theoretical Criminology 2:29-55. 


Burr, Jeffrey A., John T. Hartman and Donald W. Matteson. 1999. “Black Suicide in U.S. Metropolitan Areas: 
An Examination of the Racial Inequality and Social Integration-Regulation Hypotheses.” Socia/ 
Forces 77:1049-81. 


Centers for Disease Control (CDC). 1998. Suicide Among Black Youths — United States, 1980-1995. MMWR 
1998 47(10):193-206. 


Crutchfield, Robert D. 1989. “Labor Stratification and Violent Crime.” Socia/ Forces 68:489-512. 


Cutchin, Malcolm P, and Robert R. Churchill. 1999. “Scale, Context, and Causes of Suicide in the United 
States.” Social Science Quarterly 80:97-114. 


Davis, Robert. 1980. “Suicide Among Young Blacks: Trends and Perspectives.” Phylon 41:223-229. 
Durkheim, Emile. 1897. Suicide: A Study in Sociology. 1951 Reprint. The Free Press. 


Early, Kevin E., and Ronald L. Akers. 1998. “It's a White Thing’: An Exploration of Beliefs About Suicide in the 
African-American Community.” Deviant Behavior 14:277-296. 


Ellis, Jon B., and Lillian M. Range. 1991. “Differences Between Blacks and Whites, Women and Men, In 
Reasons for Living.” Journal of Black Studies 21:341-347. 


Ellison, Christopher G., Jeffrey A. Burr and Patricia L. McCall. 1997. “Religious Homogeneity and 
Metropolitan Suicide Rates.” Social Forces 76:273-299. 


Gibbs, Jewelle Taylor. 1988. “Conceptual, Methodological, and Sociocultural Issues in Black Youth 
Suicide: Implications for Assessment and Early Intervention.” Su/cide and Life-Threatening 
Behavior 18:73-89. 


Gibbs, Jewelle Taylor. 1997. “African-American Suicide: A Cultural Paradox.” Suicide and Life-Threatening 
Behavior 27:68-79. 


Gibbs, J.F, and WT. Martin. 1964. Status /ntegration and Suicide. University of Oregon Press. 


Jacbos, David, and Katherine Wood. 1999. “Interracial Conflict and Interracial Homicide: Do Political and 
Economic Rivalries Explain White Killings of Blacks or Black Killings of Whites? American Journal 
of Sociology 105:157-190. 


Jedlicka, Davor, Yongsock Shin and Everett S. Lee. 1977. “Suicide Among Blacks.” Phylon 38:398-407. 


Kowalski, Gregory S., Charles E. Faupel and Paul D. Starr. 1987. “Urbanism and Suicide: A Study of American 
Counties.” Social Forces 66:85-101. 


Krivo, Lauren J., and Ruth Peterson. 1996. “Extremely Disadvantaged Neighborhoods and Urban Crime.” 
Social Forces 75:619-48. 


Kubrin, Charis E. Kubrin. 2003. “Structural Covariates of Homicide Rates: Does Type of Homicide Matter?” 
Journal of Research in Crime and Delinquency 40:139-170. 


Kubrin, Charis E., and Tim Wadsworth. 2003. “Identifying the Structural Correlates of African-American 
Killings: What Can We Learn from Data Disaggregation?” Homicide Studies 7:3-35. 


1578 © Social Forces Volume 84, Number 3 * March 2006 


Kubrin, Charis E., and Ronald Weitzer. 2003. “Retaliatory Homicide: Concentrated Disadvantage and 
Neighborhood Culture.” Socia/ Problems 50:157-180. 


Linehan, M., J. Goodstein, S. Nielsen and J. Chiles. 1983. “Reasons for Staying Alive When You are Thinking 
of Killing Yourself: The Reasons for Living Inventory.” Journal of Consulting and Clinical Psychology 
51:276-286. 


Maris, Ronald W. 1969. Socia/ Forces in Urban Suicide. Dorsey Press. 


Marshall, J. 1981. “Political Integration and the Effect of War on Suicide in the United States, 1933-76." Socia/ 
Forces 59:771-785. ; 


Messner, Steven F, and Reid M. Golden. 1992. “Racial Inequality and Racially Disaggregated Homicide 
Rates: An Assessment of Alternative Theoretical Explanations. Criminology 30:421-447. 


Osgood, D. Wayne. 2000. “Poisson-Based Regression Analysis of Aggregate Crime Rates." Journal of 
Quantitative Criminology 16:21-43. 


Ousey, Graham C. 2000. “Deindustrialization, Female-Headed Families, and Black and White Juvenile 
Homicide Rates, 1970-1990.” Sociological Inquiry 70:391-419. 


Paternoster, Raymond, Robert Brame, Paul Mazerolle and Alex Piquero. 1996. “Using the Correct Statistical 
Test for the Equality of Regression Coefficients.” Criminology 36:859-866. 


Pattillo-McCoy, Mary. 1999. Black Picket Fences: Privilege and Peril Among the Black Middle Class. 
University of Chicago Press. 


Perez-Smith, Alina, Anthony Spirito and Julie Boergers. 2002. “Neighborhood Predictors of Hopelessness 
Among Adolescent Suicide Attempters: Preliminary Investigation.” Suicide and Life-Threatening 
Behavior 32:139-145. 


Pescosolido, Bernice A., and Robert Mendelsohn. 1986. “Social Causation or Social Construction of 
Suicide? An Investigation into the Social Organization of Official Rates.” American Sociological 
Review 51:80-101. 


Sampson, Robert J. 1987. “Urban Black Violence: The Effect of Male Joblessness and Family Disruption.” 
American Journal of Sociology 93:348-82. 


Sampson, Robert J., and William Julius Wilson. 1995. “Toward a Theory of Race, Crime and Urban 
Inequality.” Pp. 37-54. Crime and Inequality. John Hagan and Ruth D. Peterson, editors. Stanford 
University Press. 


Shihadeh, Edward, and Graham Ousey. 1998. “Industrial Restructuring and Violence: The Link Between 
Entry-Level Jobs, Economic Deprivation, and Black and White Homicide.” Social Forces 77:185-206. 


South, Scott J. 1984. “Racial Differences in Suicide: The Effect of Economic Convergence.” Social Science 
Quarterly 65:172-180. 


Stack, Steven. 1990. “New Micro-Level Data on the Impact of Divorce on Suicide, 1959-1980: A Test of Two 
Theories.” Journal of Marriage and the Family 52:119-127. 


Stack, Steven. 1996. “The Effect of Marital Integration on African American Suicide.” Suicide and Life- 
Threatening Behavior 26:405-414. 


. 1997. “Suicide in Memphis.” Suicide ‘97. J. Mcintosh, editor. Washington, D.C.; AAS:150-153. 


Stafford, M.C., and Jewelle PR Gibbs. 1985. “A Major Problem with the Theory of Status Integration and’ 
Suicide.” Socia/ Forces 63:643-660. 


Stockard, Jean, and Robert M. O'Brien. 2002. “Cohort Variations and Changes in Age-Specific Suicide Rates 
over Time: Explaining Variations in Youth Suicide.” Socia/ Forces 81:605-642. 


Disadvantage and Suicide among Young Black Males + 1579 
U.S. Bureau of the Census. 2002. Historical Poverty Tables, 1959-2000. Washington, D.C.: Government 
Printing Office. [http:/Avww.census.gov/hhes/poverty/histpov/ hstpov2.html] 


Wadsworth, Tim. 2000. “Labor Markets, Delinquency, and Social Control Theory: An Empirical Assessment 
of the Mediating Process.” Socia/ Forces 78:1041-1066. 


- 2004. “Industrial Composition, Labor Markets, and Crime.” Sociological Focus 37: 1-24. 


Wadsworth, Tim, and Charis E. Kubrin. 2004. “Structural Factors and Black Interracial Homicide: A New 
Examination of the Causal Process.” Criminology 42: 641-666. 


Warshauer, Ellen M., and Mary Monk. 1978. “Problems in Suicide Statistics for Whites and Blacks.” 
American Journal of Public Health 68:383-88. 


Watt, Toni Terling, and Susan F Sharp. 2002. “Race Differences in Strains Associated with Suicidal Behavior 
Among Adolescents.” Youth and Society 34:232-256. 


Willis, Leigh A., David W. Coombs, William C. Cockerham and Sonja L. Frison. 2002. “Ready to Die: A 
Postmodern Interpretation of the Increase of African-American Adolescent Male Suicide.” Socia/ 
Science and Medicine 55:907-920. 


Wilson, William Julius. 1987. The Truly Disadvantaged: The Inner City, the Underclass, and Public Policy. 
University of Chicago Press. 


. 1996. When Work Disappears: The World of the New Urban Poor. Vintage Books. 




























eyes foe ae enmaee Aja aes ok ata neat ad 
Ppa Ait ci pL pO 


Neate. mariage wanes 1 SS : 
Chie aye. yale 2 er 


, Bar 
a>,» mye ncnryaeanee "sant ben ce a iO NA i ; 
: Pe tm) eres i 
et a sient al Testi Pome tine att eh ncatee’ We NR Lenn son cane 
ok ty OAS TPA itT, Morne: Vn pa ene, 
weg Or in 
garenh brett! nd ferteeiel wise ‘A t eee: wih aren YON 0G, ‘avi senna 
vary yt CANT, APOE GL DN » ga Rea ea Ra Gi 
Sea. (tA itive = ‘magia il Pra Wate CPT eas, <A 
OiETF i eae SONe £220 ohn Seo brett) dog? SOG aut A 
Deed *. vee 2e a hle Gang SOAR a 
net rs: i ee ease) ai 
Katte oat OO? | dred ienoetetnt J Ne scored te roa ‘ates 
tans. WR WET Las bead Sea eCemuaah be Seat mel = 


et 


_ 






Ke ao, ' me oa | PMO err tee eater? 5 oa 
neantianatiowre josey haeh eA ae 7 moet : 
Wats nits 271d Meee eae ) Aang hy geval . et 


Ha, te Nea See Ca A Pa ae tein saa) 


erie tiy SL pay 


~ 


Send pene dee coat ad 1 Sal pos (oe. OD ince Perea ‘ ' 
Rey, ius _ ‘ tO CLA OA ae Smee: Garb oe ’ _ 
Shue 19s) tee vs ene 


‘tashaG are . oe ir. ? Mag a@ "as \Ete? » 
ached to oils. e ae beta Ong eeeihar er Oras 


Wine Yee . 

Atypon i 255.5 OS Vie Gerk ee ~ vee Nice in ee = 
an rh ae ye Nes ur of) a4 

a vt oo ee Kies “bere sie: Veet & Hearse 7 Th 


Sie te re JO ie ond Ganeet Nia er iieaee i) 
Pan ss th 

Ares, ides 2 Liteon ity 
(yn pe Vie Pe 


bet ap! he Va: 18 aha ins ies 
. set Y, i? — ie 
PRS ene TH “OS Nee ae coum 
: Peck ' Rima eae 


fa A 


daa iy O a al A 
, ie nae if ay a 
et, ae oan eae 


“= a 7 


Racial Differences in Congregation-based Political Activism 


R. Khari Brown, Wayne State University 





Abstract 

This study employs a resource mobilization model to explain racial differences in congregation- 
based political activism. The fewer resources (i.e., members, income, clergy leadership, civic ties) 
that black congregations possess relative to white congregations largely accounts for racial 
differences in congregation-based lobbying and protest politics — forms of political activism that 
exact relatively high resource costs upon political activists. That is, black congregations become 
more heavily involved than white congregations in lobbying and protest politics when they have 
a resource capacity similar to that of white congregations. Despite their relatively few resources, 
black congregations are, on average, more likely than are white congregations to involve 
themselves in voter registration efforts. This finding has much to do with the heightened social- 
political expectations of African American congregants concomitant with the relatively low 
resource cost of voter registration programs. 


Introduction 


This study assesses the applicability of resource mobilization theory to explain racial 
differences in the level and type of political activism in which a national representative sample 
of predominantly black and predominantly white American religious congregations engage. 
For the purpose of this paper, political activism refers to any activity that has the intent to 
influence government action. This may occur directly in attempts to affect legislative decision- 
making such as lobbying or protesting, or it may-occur indirectly by influencing the selection 
of people who make policy such as in voter-registration efforts (Verba, Scholzman and Brady 
1995). To be clear, Christian, Jewish and Islamic religious clergy in the United States are 
primarily involved in employing their institutional resources to reinforce three interrelated and 
important goals: a belief in God, a search for truth and a spiritual connectedness to one’s 
creator through collective worship experiences (Marty 2004). Nonetheless, as the voluntary 
association in which Americans are most likely to hold membership (Hodgkinson et al. 1996), 
congregations play a key role in providing spaces for concerned citizens to plan and act upon 
political strategies aimed at influencing public policy (Billingsley 1999; Lincoln and Mamiya 
1990; McAdam 1982; Morris 1984; Smith and Smidt 2003; Wald 2004). To the extent that 
congregational bodies express a conviction that it is right and necessary to involve 
themselves in what may be designated as worldly affairs and if they have the resources to 
do so, congregations tend to act on their collective political passions. 

In assessing the applicability of resource mobilization theory to the political behavior of black 
and white religious congregations, the current study first provides a cursory description of 
resource mobilization theory. It then discusses the importance of expanding the application of this 
theory to our understanding of the political behavior of a broad section of religious congregations 
in this country. This paper then moves to summarize the religion and politics and the resource and 
congregation-based civic activism literatures. And finally, this study relies upon the 1998 National 
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Congregational Study to empirically test the relevance of congregational resources to the 
electoral, lobbying, and protest behavior of black and white religious congregations. 


Theories 


Resource Mobilization Theory 


Resource mobilization theory suggests that the emergence and success of social movements 
are contingent upon aggrieved groups possessing a variety of resources, their linkages with 
other groups and their dependence upon external support (Cuzan 1990; McAdams 1982; 
McCarthy and Zald 1977; Morris 1984; Williams 2002). By empirically maintaining the 
importance of resources to social movement emergence, resource mobilization theorists 
refute classical social movement theories, which maintain that movement emergence is 
linked to the accumulation of psychological strain brought on by social discontent (McAdam 
1982; Morris 1984). To the contrary, resource mobilization theory suggests that discontent is 
more or less constant among aggrieved groups. It is the recognition of political opportunities 
and the availability of politically relevant resources to mobilize for insurgent activities that 
allow social movements to take form (McAdam 1982; Morris 1984). As such, social groups 
with well-developed internal social institutions and organizations that provide them with 
effective communication networks, experienced leadership and social resources are in the 
best position to formulate demands and coordinate political strategies to influence public 
opinion and legislative decision-making (McAdams 1982; McCarthy and Zald 1977; Morris 
1984; Williams 2002). The mobilization of social resources, such as money, labor, and civic 
skills are critical to the emergence and goal attainment of social movements (McAdams 1982; 
McCarthy and Zald 1977; Morris 1984; Williams 2002). 

Resource mobilization theory also maintains that movement success is contingent upon 
the involvement of out-group individuals and organizations, perhaps even splintering factions 
within governments that possess inside access to policy elites, influence over policy 
decisions and money to finance the movement itself (Cuzan 1990; McAdams 1982; McCarthy 
and Zald 1977; Morris 1984). Aldon Morris (1984), however, parts company with resource 
mobilization theorists in maintaining that the organizational resources of indigenous groups 
are critically more important to the early stages of social movements than are their out-group 
coalitional partners. This is largely because coalitions with out-groups are often established 
only after the movement has accrued some level of success in influencing public opinion, 
debate and policy. Whether or not out-group external resources are more or less important 
at the beginning of social movements arguably depends upon the special circumstances that 
give rise to a given movement (Cuzan 1990). Nonetheless, having more indigenous 
organizational and external resources places aggrieved groups in a better position to launch 
and sustain movements as well as to achieve their political goals (Cuzan 1990; McAdams 
1982; McCarthy and Zald 1977; Morris 1984; Williams 2002). 

Aldon Morris’ (1984) and Doug McAdam’s (1982) usage of a resource mobilization 
framework to explain the emergence of the black Protestant-led Civil Rights Movement of the 
mid-20" century is arguably the best example of the importance of resources to the 
emergence and success of religious-based social movements. They maintain that economic 
modernization during the 1910s through the 1960s contributed to blacks emigrating from 
Southern rural towns to urban metropolises where they increasingly joined larger churches 
with greater volunteer reserves and financial resources (Morris 1984). This reality contributed 
to the emergence of black congregations increasingly becoming financially independent from 


Racial Differences in Congregation-based Activism * 1583 


white churches and civic elites (McAdam 1982; Morris 1984). In turn, Southern blacks were 
able to groom their own indigenous leaders and form race-based organizations, such as the 
Southern Christian Leadership Conference, to coordinate their activities as they attempted to 
tear down the walls of oppression (McAdam 1982; Morris 1984). These developments 
positioned Southern black religious communities in a better structural position to form 
Coalitions with like-minded white Christian and Jewish organizations as they acted on their 
grievances. Ultimately, Morris (1984) argues that the Civil Rights Movement was successful 
because it relied on pre-existing churches with established leaders, volunteers and 
organization that were able to engage in rational planning and to establish strategic alliances 
while fostering a religious culture of social activism among participants. 


Expanding the Breadth of Resource Mobilization Theory 


The application of a resource mobilization theoretical model to the current study, however, is 
not a clean fit. This is largely because this investigation employs a survey methodological 
approach to assess the effect of congregational resources on the political activism of black 
and white religious congregations at one point in time. Therefore, this study is unable to 
measure congregation-based resource gains over time, how political challengers have 
influenced political strategies, or the goal attainment of politically active religious 
congregations as Morris (1984), McAdams (1982), and Williams (2002) have cogently been 
able to do. Even so, there is room for expanding the application of resource mobilization 
theory to explain the political behavior of a broader cross-section of religious congregations. 
To date, the empirical work utilizing resource mobilization theories to explain the political 
behavior of religious congregations have primarily been regionally specified to the South, 
time-bound to the modern Civil Rights Movement, racially specified to black congregations, 
and focused on protest-oriented political strategies (McAdams 1982; Morris 1984; 1993; 
Williams 2002). These studies have also largely employed archival and qualitative research 
designs. The substantive focus and methodological approach of these studies are sensible, 
given that the Civil Rights Movement was arguably the single most important insurgency 
movement of the 20" century. The lessons learned from this movement largely informed the 
political strategies used in the subsequent Poor People’s Movement (Piven and Cloward 
1977), AntiWar Movement (Shapiro 2000), Women’s Movement (Freeman 1971) and others. 
The impact of a congregation's organizational resources (e.g., money, competent leadership, 
labor, civic skills) and its ties to other civic organizations on its capacity to engage in political 
activity is unclear. The current study attempts to make a positive contribution to the resource 
mobilization literature by empirically clarifying the relevance of resources to a broad cross- 
section of black and white American religious congregations. This study's reliance on 
multivariate analytical techniques allows for the investigation of the independent effects of 
varying congregation-based organizational and external resources on the electoral, lobbying and 
protest activities of a national representative sample of predominantly black and predominantly 
white religious congregations. This analytical strategy may allow for a clearer understanding of 
why black and white religious congregations choose to participate in certain political acts rather 
than others. It may also point to the challenges that prevent congregations from acting on their 
social-political missions. That is, this study's analytical strategy may help explain why black 
congregational bodies express a greater interest in political activism (Pew 2001; Newport, 
Ludwig and Kearney 2001), but may not be any more involved than white congregations. The 
answer may be found in the unequal distribution of resources among black and white 
congregations. This study now turns to the literature review on religion and politics and on 
congregational resources and church-based civic activism. 
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Religion and Political Activism 


Few national studies have assessed factors that influence the political behavior of black and 
white religious congregations. The vast majority of national studies assessing the relationship 
between religion and politics have focused on the individual as the unit of analysis. These 
studies largely suggest that frequent church attendance increases the chances of exposure 
to political discussions and opportunities to participate in political activities (Brown and Brown 
2003; Calhoun-Brown 1996; Harris 1999). To the extent that congregants are exposed to 
social-political messages in church by clergy and fellow congregants, there is an increased 
chance that they will participate in voting and non-voting political activities such as campaign 
work, petition drives and demonstrations (Brown and Brown 2003; Calhoun-Brown 1996; 
Harris 1999; Verba, Scholzman and Brady 1995). The relationship between exposure to 
church-based political discourse and political activism is linked to the social capital resources 
of trust, mutual obligation and group identity that congregations provide their congregants. 
Attending church regularly should increase trust and obligations because individuals are in 
settings where they share a common world view and religious perspective. As congregants 
grow closer and develop a deeper sense of mutual obligation as members of a common 
religious community, they are more likely to involve and support each other in political 
activities as well. This is particularly the case among congregations that perceive their political 
activism as theologically grounded (Harris 1999; Morris 1984; Wald 2004). 

Frequent church attendance also increases the chance that congregants will become 
involved in church committees and ministries (Brown and Brown 2003; Harris 1999; 
Peterson 1990; Verba, Scholzman and Brady 1995). Honing these civic skills will further 
enable them to act on the inclination to influence their communities (Peterson 1990; Verba, 
Scholzman and Brady 1995). Being involved in church committees and ministries provides 
individuals with opportunities to engage in public speaking, to write letters, and/or to 
mobilize others around some collective concern, all of which are key skills for political 
activism (Brown and Brown 2003; Harris 1999; Verba, Scholzman and Brady 1995). In sum, 
this body of work suggests that congregations are able to influence the political behavior of 
their flocks by providing them with opportunities to become involved in social-political 
activities and to strengthen their civic skills. 

Nonetheless, the aforementioned studies provide very little information on the factors that 
influence a congregation's potential for political mobilization. It is also difficult to generalize 
about the political activity of congregations based on previous national studies. This is largely 
because these studies tend to focus on distinctive populations (e.g., Cavendish 2000 study on 
Roman Catholic congregations; Smith and Smidt's 2003 study on predominantly black 
congregations) or to under-represent minority congregations (Chaves and Higgins 1992). 

Beyerlein and Chaves’ (2003) study is the sole exception. Consistent with a resource-based 
perspective on congregation-based political activity, this study suggests that the more 
members that congregations have, the more likely they are to involve themselves in voting, 
lobbying and protest political activities. This study also suggests that black Protestant 
congregations are, on average, less likely than Roman Catholic congregations to participate in 
lobbying activities and protests efforts. However, black Protestant congregations are more 
likely than Roman Catholic, Mainline Protestant and marginally more likely than Evangelical 
Protestant congregations to involve themselves in voter-registration efforts. While this study 
increases our understanding of the roles played by resources and denominational affiliation in 
the political behavior of religious congregations, it does not provide empirical evidence as to 
why differences exist in the political activities chosen by black Protestant, Roman Catholic, 
Mainline Protestant and Evangelical Protestant congregations. Furthermore, by excluding a 
measure on the racial composition of congregations, this study excludes predominantly black 
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Roman Catholic, black Evangelical Protestant and black Mainline Protestant congregations 
from its analysis. Therefore, it is unclear as to how congregational resources and denomination 
affect differences in the political behavior of predominantly black or predominantly white 
congregations. 


Political Resource Costs and Public Policy Influence 


It is particularly important to explore how resources affect the political activities chosen by black 
and white congregations because some political activities require more resources than do 
others. This is important because resource costs are related to effectiveness in influencing 
public policy. For example, voter registration requires minimal resources. It requires relatively 
little money, volunteers and staff commitment if set up at church. Comparatively, congregations 
require less assistance from outside sources to participate in voter registration efforts than they 
would to engage in the more resource-intensive lobbying and protest efforts. Also, while 
lobbying and demonstrations may be ongoing, voter-registration efforts often occur during 
election seasons only. 

With regard to influencing public policy, election outcomes often reveal voters’ support, or 
lack thereof, for candidates’ broad policy agendas. Nonetheless, votes rarely provide cues 
about the position of voters on specific policy issues (Verba, Scholzman and Brady 1995). To 
the contrary, marches, boycotts and information pickets are a much clearer means of 
conveying positions on issues. Similarly, elected officials are quite clear where lobbyists stand 
on policy issues based on the face-to-face meetings, letters, e-mails and phone calls they 
receive. And, to the extent that protestors and lobbyists represent a broad constituency, 
elected officials are inclined to acknowledge their demands. 


Social-Political Aspirations of Black and White Religious Congregations 


The various resource costs required to implement various political activities and the impact 
these activities have on public policy makes It particularly important to assess racial 
differences in the political behavior of religious congregations. First, blacks and whites have 
different social-political expectations of their religious congregations. Findings from the 
American Citizen Participation Study (Verba et. al. 1990) indicate that black church volunteers 
are more likely than are white church volunteers to believe that their church work should 
involve improving conditions in their communities and their country, and that contacting 
public officials with their concerns is part of that belief. Blacks also believe more strongly than 
whites that religious institutions should work to influence public policy and that their pastors 
should take stands on social-political issues, talk about controversial social issues from the 
pulpit, and endorse political candidates (Pew 2001; Newport, Ludwig and Kearney 2001). The 
political centrality of black congregations to black communities is relative to the racial and 
social-economic inequalities these congregants experience (Billingsley 1999; Brown 2004; 
Lincoln and Mamiya 1990). That is, relative to white congregations, black congregations tend 
to be located within predominantly minority communities characterized by heightened levels 
of poverty, political alienation and social isolation (Brown 2004). And, unlike white 
congregations, black congregations are often one of the few civic organizations within their 
communities dedicated to community building and social change (Billingsley 1999; Lincoln 
and Mamiya 1990). Therefore, it is likely that black congregations face greater pressure than 
do white congregations to politically respond to social concerns that affect their congregants, 
their communities and society at large. 
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Organizational and External Resource Mobilization of Religious Congregations 


Despite the heightened desire and need for predominantly black congregations to operate as 
a social and political vehicle within black communities, black congregations tend to have 
fewer resources than white congregations. This reality is linked to the voluntary nature of 
religious congregations. As voluntary associations, the most important resource that 
congregations have is their membership. The members are relied upon to accomplish church 
objectives such as facilitating worship services, maintaining buildings and supporting social- 
political activities. This is made evident by the fact that 70 percent of the social programs that 
churches sponsor are, in some degree, operated by church volunteers (Chaves 1998). 
Additionally, 87 percent of congregations pay to support their own social programs; almost all 
of this financial support comes from tithes and offerings (National Congregational Study 
1998). That said, congregations with more of these resources —i.e., members, money, Civic 
skills and social ties — are in a better position to act on their values in the public arena. 


Race, Organizational Resources and Political Activism 

The fact that blacks trail whites on all socio-economic indicators means that black congregations 
do not have as many middle class members as do white congregations (Brown 2004). As a 
consequence, black congregations attract congregants with less political efficacy, a lack of 
interest in politics and fewer civic skills when compared to white congregations (Verba, 
Scholzman and Brady 1995). This also explains why black congregations are less wealthy than 
white congregations (Brown 2004) and have a more difficult time than white congregations in 
funding their social-political endeavors, paying staff persons to help oversee these efforts, hiring 
fulltime pastors, and hiring consultants for expert advice in developing and acting on their 
social-political interests. Furthermore, the plethora of religious congregations within urban 
communities likely contributes to black congregations relying on a smaller number of volunteers 
than do white congregations (Brown 2004). Thus, black congregations have a more difficult time 
spreading program responsibilities to a greater number of persons, thereby increasing the risks 
of volunteer burnout and shortening the longevity of their programs. 

Black congregations also tend to be led by less educated pastors. By having less access to 
higher education than white clergy, particularly at elite educational institutions, black clergy may 
have less experience with bureaucracies (organizations that rely upon formal procedures to 
accomplish complex tasks) (Chang et. al., 1994; Billingsley 1999). Thus, to the extent that 
education increases interest in politics, political efficacy, civic skills and ties to policy elites, clergy 
in predominantly black congregations may be less capable of involving their congregations in 
politics than white clergy (Verba, Scholzman and Brady 1995). Black pastors are also more likely 
than white pastors to be employed outside of their churches due to financial necessity, so 
perhaps they have less time to involve their congregations in political efforts. In sum, black 
congregations tend to be led by less educated pastors and possess fewer organizational 
resources in the form of middle class members, volunteers, income and staff, relative to white 
congregations. To that end, black congregations face greater constraints than white 
congregations in pursuing political activities. 


Race, External Resources and Political Activism 

The resource constraints that confront black congregations, relative to white congregations, 
make it more difficult for black congregations to build ties with other organizations and/or rely on 
consultants to assist them in various political activities (Brown 2004). Black congregations simply 
have a relatively smaller cadre of volunteers, paid staff, financial resources and full-time clergy to 
build and maintain such relationships (Brown 2004). As evident during the Civil Rights Movement, 
coalitions are particularly important to congregation-based political activism because they allow 


Racial Differences in Congregation-based Activism * 1587 


congregations to share resources such as volunteers, staff, facilities and money in order to meet 
shared civic goals (McAdam 1982; Morris 1984; Williams 2002). These arrangements often 
facilitate long-term efforts to research issues of importance to community members and to 
recruit participants (Warren 2001; Wood 2002). Church collaboration also pushes church leaders 
to dedicate more time and energy in determining what political strategies work best as well as 
how to replicate these practices for future efforts (Trulear 2001; Warren 2001; Wood 2002). 
Coalitions also raise the visibility of social causes and increase the potential for public support 
(McAdam 1982; Morris 1984; Warren 2001; Wood 2002). The relative lack of collaborative ties 
further contributes to the difficulty black congregations have in mobilizing critical resources for 
political involvement. That said, it is unlikely that black congregations’ greater social-political 
expectations translate into greater involvement in political activities that require long-term 
planning, research on policy issues, political mobilization and political activism. 


Denominational Affiliation 


The relatively weak political infrastructure of traditionally black denominations represents a 
further external resource constraint. Baer and Singer (1992) state that black denominational 
leaders often “refrain from making public statements on social issues, endorsing political 
candidates or lending financial or even moral support to political movements (p. 97).” This is 
largely because, unlike the Roman Catholic Church, Liberal Protestant and Evangelical 
Protestant denominations, few traditionally black denominations have instituted national 
parent organizations or even a board of trustees at the national or state level (Baer and Singer 
1992). This makes it more difficult for traditionally black denominations to reach a consensus 
on socio-political policies and offer their perspective in a collective fashion to leaders at the 
state and national levels. That lack of consensus at the state or national level also makes it 
difficult for traditionally black denominations to reach a position on the issues as a 
denomination and inform local congregations. 

Contrast that with the abilities of denominations whose national and/or regional political 
offices mobilize local congregations around national issues by coordinating the political strategies 
in which the membership engages (Fowler, Hertzke and Olson 1999; Olson 2002; Wald 2004). For 
example, the local campaigns led by Evangelical Protestants to constitutionally ban gay marriage 
during the 2004 Presidential campaign season likely had much to do with the mass mobilization 
of local congregations by national and regional Evangelical Protestant denominational 
organizations. The same can be said of the ongoing involvement of Roman Catholic clergy and 
laity:in influencing abortion policy and of Liberal Protestant congregations in their attempts to 
dissuade the U.S. government from waging war against Iraq in 2002 and 2003. 

The fact that black congregations tend to be affiliated with denominations that lack state 
or national lobbyists or political committees further suggests that black congregations 
possess fewer resources than white congregations to mobilize for political purposes. The 
recent work of Ammerman (2002) and Beyerlein and Chaves (2003) exemplify this point. 
Ammerman’s (2002) six-city study of religious congregations found that black Protestant 
congregations are, on average, less likely than are Mainline Protestant and Roman Catholic 
congregations to engage in policy advocacy. Similarly, and as articulated earlier, Beyerlein and 
Chaves’ (2003) national study of religious congregations suggests that black Protestant 
congregations, on average, tend to be less likely than Roman Catholic congregations to 
engage in lobbying and protest politics. These studies may point to the fact that 
denominations with strong political infrastructures are in the best position to research and to 
take positions on social and moral issues as well as to mobilize local congregations around 
these issues. The lack of political infrastructure within black Protestant denominations likely 
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places greater constraints upon the capacity of these congregations to involve themselves 
in political activities and costs them a greater voice in public policy. 


Hypotheses 


There is a greater desire and need for black congregations, relative to white congregations, to 
act as a political force within their communities. Black congregations are, on average, more 
likely than white congregations to involve themselves in voter registration efforts, largely due 
to its relatively low resource costs. Resource-costly forms of political activism, such as 
lobbying and protests, are held to be the domain of white congregations, according to a 
resource mobilization model. However, expecting black congregations to be more heavily 
involved than white congregations in lobbying and protests is consistent with classical social 
movement theories. What factors might make a difference? In sum, this study poses the 
following hypotheses: 


H:1 Race and High Resource Cost Hypothesis: With equal 
organizational (e.g., educated clergy, staff volunteers, financial 
resources) and external resources (e.g., Civic ties, consultants) as well 
as an affiliation with traditionally white denominations, black 
congregations are more likely than white congregations to involve 
themselves in high resource cost political activities such as lobbying 
and demonstrations. 


H:2 Race and Low Resource Cost Hypothesis: Black congregations 
are, on average, more likely than white congregations to involve 
themselves in low resource cost political activities such as voter 
registration. 


H:3 Organizational Resource Hypothesis: The more organizational 
resources that congregations possess, the more likely they are to 
involve themselves in high (e.g., lobbying and protest) and low 
resource cost political activities (e.g., voter registration). 


H:4 External Resource Hypothesis: The more external resources 
that congregations possess, the more likely they are to involve 
themselves in high (e.g., lobbying and protest) and low resource cost 
political activities (e.g., voter registration). 


H:5 Denominational Affiliation Hypothesis: Congregations affiliated 
with traditionally black denominations are less likely than 
congregations affiliated with non-black denominations to involve 
themselves in high (e.g., lobbying and protest) and low resource cost 
political activities (e.g., voter registration). 


Sample 


This study relies on the 1998 National Congregational Study (NCS), a national representative 
sample of congregations, to test the above hypotheses. The NCS gathered data via a 60- 
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minute telephone survey interview with a key respondent from a congregation (e.g., minister, 
priest, rabbi or other leader). Three-quarters of NCS survey interviews were with clergy; 91 
percent were pastoral staff of and the remaining 9 percent were non-staff congregational 
leaders (Chaves et. al. 1999). The data collection employed a hypernetwork sampling 
technique. This entailed relying upon the 1998 General Social Survey (GSS) to generate a 
representative sample of congregations. The GSS, a representative sample of English- 
speaking adults in the United States, included an item that asked respondents if they had 
attended religious services during the past year. Those who reported doing so were asked to 
report the name and location of their congregations. This procedure generated a nationally 
representative sample of 1,456 congregations and collected data from 1,236 of the 
nominated congregations, a cooperation rate of 85 percent (Chaves et. al. 1999). For the 
purpose of this study, however, only predominantly black (e.g., 75 percent or more of the 
congregation is of this racial group) and predominantly white congregations are included in 
the sample. In total, there are 1,020 respondents in this study. 


Table 1: Mean Differences in the Demographic Profile of a National Representative Sample of 
Predominantly Black and Predominantly White Religious Congregations: Chi-Square Analyses 





Black White 
Total Congregation _ Congregation 


Organizational Resources 


Church Attendees 363.3 339.5 307.5. 
Church Relig. Participants 937.8 631.0 992.8 
Poor Relig. Participants (% < $25k) 24.8 37.1 22.67* 
Affluent Relig. Participants (% > $100k) 10.5 D2 (ei 
Church Staff 12.5 7.6 13.4 
Church Income $590,452 $446,934 $616,167** 
Graduate School 
Clergy Education 63.2 45.2 66.5** 
External Resources 
Religious Collaborator 49.2 ae os 
Secular Collaborator 49.0 42.6 50.2 # 
Social Service 
Representative 39.4 39.77 37.42 
Consultant 44.6 29.7 AT.3t 
Community Demographics 
Poverty > 30% 10.3 31.6 6.5" 
Unemployment > 5% 24.5 69.0 16.55 
Some College > 50% 37.0 12.3 41.4 
N= 1,020 155 865 





#<.1 *p<.05 **p <.01 (two-tailed tests) 


Table 1 provides a demographic profile of the congregations presented in this study. At 
an average worship service, congregations have an attendance of 363 persons. Far more 
persons, approximately 938, are involved in the religious life of the congregation in some 
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form, whether or not they are members. Approximately, a quarter of these religious 
participants make less than $25,000 per year, and roughly one in ten makes more than 
$100,000 a year. As a body, congregants generate, on average, more than $595,000 a year 
and employ roughly 13 part- and full-time staff. Almost two-thirds of the senior pastors in this 
study report having at least a graduate education. 

In regards to external resources, approximately half of congregations have collaborated 
with a religious and a secular organization during the past 12 months to provide social service 
programs, economic development programs and/or to participate in political advocacy efforts. 
Close to half of congregations have had a representative from a social service agency speak 
to their congregation, and about 40 percent have relied on consultants for religious and/or 
secular efforts at some point during the past 12 months. 

American religious congregations tend to be located in middle class communities. Roughly 
10 percent are located in census tracts with a poverty rate of at least 30 percent, a quarter are 
in census tracts with an unemployment rate of at least 5 percent, and a little more than a third 
are census tracts in which individuals over the age of 25 have at least some level of college 
education. This table also suggests that black congregations, relative to white congregations, 
tend to possess fewer resources and are located in more impoverished communities. 


Measures 


Dependent Variables 


Political Activism 

Lobbying and Protest Demonstrations/Marches are the high resource cost political activities 
this study seeks to investigate and Voter Registration Efforts is the low resource cost political 
activity. All of these measures are dichotomous and assess if, during the past 12 months, 
congregations have participated in these activities." 


Independent Variables 


Congregational Racial Composition 

As alluded to above, the Racia/ Composition of congregations is a dichotomous measure that 
assigns congregations a zero if at least 75 percent of their members are white and a one if at 
least 75 percent of their members are black Americans. 


Organizational Resources 

The congregation-based Organizational Resources of this study are akin to the basic 
resources of “experienced leaders and social resources such as, money, labor, and charisma” 
(p. 284) that Morris (1984) identifies as being essential to the goal attainment of social groups 
engaged in social movement activity. For the purpose of this study, the Organizational 
Resource variables consist of the following measures: Congregant Human Capital, Internal 
Resources and Clergy Education. Congregant Human Capital is an additive index of 
congregants’ estimated income and education levels. /nterna/ Resources is an additive index 
of congregations’ estimated income, number of regular attending persons, number of 
persons that regularly volunteer for church programs or initiatives, and the number of part- 
and full-time staff. Clergy Education \s a dichotomous variable that assesses whether or not 
the senior pastor has a graduate degree. While clergy education alone does not get at the 
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senior pastor's willingness or capacity to involve his or her congregation in public affairs, it is 
the closest leadership variable within this data set. Furthermore, clergy education may serve 
as a proxy for political leadership to the extent that more highly educated persons express 
greater political interests, civic skills, political efficacy and are more likely than are less highly 
educated persons to have contact with political officials (Verba, Scholzman and Brady 1995). 


External Resources 

The congregation-based External Resources attempts to assess political coalition potential. It 
does so by assessing the involvement of congregations in collaborative arrangements with 
civic and other religious organizations to provide social programs, as well as through their 
reliance on consultants. Co/laboration is an additive index that consists of the following 
dichotomous variables: involvement in a work arrangement with a secular organization to 
provide varying social programs, involvement in a work arrangement with a religious 
organization to do the same, and having a representative from a social service agency speak 
to their congregation during the past 12 months about a pressing social issue. The Consultant 
variable assesses if congregations have utilized the services of a consultant for any purpose 
during the past 12 months. 


Denominational Affiliation 

The Denominational Affiliation variable essentially serves as an additional external resource 
variable. Because congregations affiliated with traditionally black denominations are 
hypothesized as being the least politically active, traditionally black denominations serve as 
the comparison category in this study's analyses. Therefore, Roman Catholic, Liberal 
Protestant, Evangelical Protestant and Non-Christian religious traditions. This variable is 
measured in the standard fashion of previous studies on religious affiliation (Beyerlein and 
Chaves 2003; Kellstedt et al. 1996; Melton 1993). 


Control Variables 


The measurement of the control variables (e.g., congregations’ theology/political ideology, the 
region of the country in which congregations are located, the urbanicity of the census tract 
in which congregations are located, and the social economic context in which congregations 
are located) and dependent and independent variables are more fully specified in the 
appendix. 

Missing values for all of the above scales were replaced with an imputed regression score. 
This procedure organizes its calculations by patterns of missing data so the missing-value 
regressions can be conducted more efficiently. The results reported below were not 
substantively or significantly altered by this analysis technique. 


Statistical Methods 


Logit analyses were employed to test the above hypotheses because all of the dependent 
variables are dichotomously ordered. The statistical package STATA was used to estimate this 
equation through maximum likelihood. Logit analyses yield beta weights that estimate the 
effect of the independent variables on the likelihood of observing the dependent variable. In 
this case, it is the effect of race, congregational resources, denominational affiliation and the 
specified controls on the likelihood of congregations engaging in voter-registration, lobbying 
activities and protest politics. 
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Results 


Effect of Race, Resources and Denominational Affiliation on Lobbying Efforts 


Table 2 assesses the effect of race, organizational and external resources, and denominational 
affiliation on congregation-based lobbying efforts. While not displayed, the full model of this 
and all subsequent analyses also controls for theology/ideology, region, urbanicity and social- 
economic context. These analyses provide support for the Race and High Resource Cost 
Hypothesis (H:1). That is, when black and white congregations have a similar level of 
organizational and external resources and are affiliated with similar denominations, black 
congregations are more likely than white congregations to engage in lobbying efforts. It 
should also be noted that denominational affiliation serves as the main catalyst for the 
suppression of the race effect on lobbying. These analyses also provide some support for the 
Organizational Resource Hypothesis (H.3) and strong support for the Externa/ Resource 
Hypothesis (H:4). There is limited support for the Denominational Affiliation Hypothesis (H:5). 
These relationships are specified in greater detail below. 

The bivariate model of these analyses indicates that black congregations are, on average, 
no more likely than are white congregations to engage in lobbying activities. The 
organizational resource model indicates that accounting for the organizational resources of 
congregations strengthens the relationship between race and lobbying. Relative to white 
congregations, black congregations become marginally more involved in this form of political 
activism (sig.=.096). This analysis also suggests that congregations with more internal 
resources are more likely to lobby. However, the external resource model indicates that 
accounting for external resources does not strengthen the relationship between race and 
lobbying to any statistical or marginal significance. Finally, collaborating with other 
organizations and relying on consultants increases the chance that congregations will 
participate in lobbying efforts. 

Relative to the bivariate model, the denominational model suggests that the involvement 
of black congregations in lobbying efforts, relative to white congregations, is strengthened to 
the point of statistical significance. As such, it follows that congregations affiliated with the 
Roman Catholic Church and non-Christian denominations are more likely than are 
congregations affiliated with traditionally black denominations to participate in lobbying 
efforts. Similarly, congregations affiliated with Liberal Protestant denominations are marginally 
more likely (sig. = .076) than are congregations affiliated with traditionally black 
denominations to engage in this resource costly form of political activism. 

Relative to the previous model, the denominational, organizational and external resource 
model suggests that the involvement of black congregations in lobbying efforts, relative to 
white congregations, is further strengthened. This model also suggests that congregations 
with more internal resources, those that maintain more collaborative ties and rely on 
consultants, tend to be more heavily involved in lobbying. Further, congregations affiliated 
with the Roman Catholic Church and non-Christian denominations tend to be more heavily 
involved than congregations affiliated with traditionally black denominations in this resource 
costly form of political activism. 

The race effect is weakened from the previous model to the full model. Nonetheless, in 
support of the Race and High Resource Cost Hypothesis (H.1), black congregations remain 
statistically more likely than white congregations to engage in lobbying. In partial support of the 
Organizational Resource Hypothesis (H.3), this model also indicates that congregations with more 
internal resources are more likely to involve themselves in lobbying. In full support of the External 
Resource Hypothesis (H:4), collaborative relationships and reliance on consultants increase the 
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likelihood of congregations participating in lobbying. Lastly, and in limited support of the 
Denominational Affiliation Hypothesis (H:5), congregations affiliated with the Roman Catholic 
Church and non-Christian denominations are statistically and marginally (sig. =.088) more likely 
than are congregations affiliated with traditionally black denominations to engage in lobbying. 


Effect of Race, Resources and Denominational Affiliation on Protest 
Demonstrations 


Table 3 assesses the effect of race, organizational and external resources, and denominational 
affiliation on congregation-based protest demonstration efforts. As in the lobbying analyses, 
these analyses provide support for the Race and High Resource Cost Hypothesis (H:1). That is, 
when black and white congregations have a similar level of organizational and external 
resources and are affiliated with similar denominational bodies, black congregations are, on 
average, more likely than white congregations to engage in protests. As in the lobbying 
analyses, these analyses also suggest that denominational affiliation serves as the main catalyst 
for the suppression of the race effect on protest and demonstration efforts. These analyses also 
provide some support for the Organizational Resource Hypothesis (H:3) and strong support for 
the External Resource Hypothesis (H:4). There is also partial support for the Denominational 
Affiliation Hypothesis (H:5). These relationships are specified in greater detail below. 

The bivariate model of these analyses indicates that black congregations are, on average, no 
more likely than white congregations to engage in protest activities. The organizational resource 
model indicates that accounting for the organizational resources of congregations does little to 
strengthen the relationship between race and protest efforts. This analysis also suggests that 
congregations with more internal resources are more likely to participate in protest activities. As 
in the organizational resource model, the external resource model suggests that taking into 
account the external resources of religious congregations does little to strengthen the 
relationship between race and protest politics. This model does suggest, however, that 
collaborating with other civic organizations and relying on consultants significantly and marginally 
(sig. = .060) increases the likelihood of congregations involving themselves in protest efforts. 

The denominational model suggests that the involvement of black congregations in protest 
efforts, relative to white congregations, is strengthened to the point of marginal significance (sig. 
= 083) when denominational affiliation is introduced into the model. As such, it follows that 
congregations affiliated with the Roman Catholic Church are more likely than congregations 
affiliated with traditionally black denominations to participate in protest demonstrations. In 
relation to the previous model, the denominational, organizational and external resource model 
suggests that the involvement of black congregations in protests, relative to white 
congregations, is strengthened to the point of statistical significance. This model also suggests 
that congregations with more internal resources, those that maintain more collaborative ties and 
rely on consultants, tend to be more heavily involved in protest activities. Further, relative to 
congregations affiliated with black denominations, affiliation with the Roman Catholic Church 
and Evangelical Protestant denominations increases the likelihood of congregations involving 
themselves in this resource costly form of political activism. 

In support of the Race and High Resource Cost Hypothesis (H:1), black congregations 
remain more likely than white congregations to participate in protest activities in the full model. 
In partial support of the Organizational Resource Hypotheses (H:3), this model also indicates 
that congregations with more internal resources are more likely to involve themselves in 
protest politics. In full support of the Externa/ Resource Hypotheses (H:4), collaborative 
relationships and reliance on consultants increase the likelihood of congregations participating 
in protest politics. Lastly, and in partial support of the Denominational Affiliation Hypothesis 
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(H:5), congregations affiliated with the Roman Catholic Church and Evangelical Protestant 
denominations are more likely than are congregations affiliated with traditionally black 
denominations to participate in protest demonstrations. 


Effect of Race, Resources and Denominational Affiliation on Voter 
Registration Efforts 


Table 4 assesses the effect of race, organizational and external resources, and denominational 
affiliation on congregation-based voter registration efforts. In support of the Race and Low 
Resource Cost Hypothesis (H.:2), these results suggest that black congregations are more 
likely than white congregations to adopt this low resource cost form of political activism. The 
bivarite model suggests that, on average, black congregations are more likely than white 
congregations to participate in voter registration efforts. The race effect also remains strong 
in the full model. However, these analyses provide conflicting support for the Organizational 
Resource Hypothesis (H:3), and limited support for the Externa/ Resource Hypothesis (H:4). 
These analyses do not support the Denominational Affiliation Hypotheses (H:5). These 
relationships are specified in greater detail below. 


Table 4: Effect of Race of Congregation, Organizational and External Resources and 
Denominational Affiliation on Congregation-based Voter Registration Efforts (Logit Analyses) 


Bivariate Full 

Race 

Black Congregation 20185. 6213) 2.447** (.363) 
Org. Resources 

Human Capital wa -.135# (.071) 

Internal Resources — .234™ (.042) 

Clergy Education a -.577* (.245) 
Ext. Resources 

Collaboration — .203# (.118) 

Consultant — .416# (.240) 
Denomination 

Roman Catholic — -.668 (.470) 

Evan. Protestant — -.330 (.393) 

Liberal Protestant — .672 (.457) 

Other/Non-Christian — .436 (.650) 

Log likelihood -320.235 -280.387 

N = 1,017 


eS SS SY 


#p<.1 *p<.05 **p S01 (two-tailed tests) Standard errors are in parentheses. 


The relationship between organizational resources and the involvement of religious 
congregations in the low resource cost political activity of voter-registration efforts is not a 
straightforward one. On the one hand, the more internal resources (e.g., congregational income, 
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number of congregants and paid staff) that congregations possess, the more likely they are to 
participate in voter registration efforts. However, it is also the case that human capital (e.g., 
congregant education and income) and clergy education marginally (sig. = .059) and significantly 
reduce the likelihood that congregations will participate in voter registration efforts. While the data 
seem to conflict, these analyses suggest that all other things being equal (i.e., controlling for 
resources, denominational affiliation and control variables), congregations with less affluent 
members and less educated pastors are more heavily involved than their more affluent 
counterparts in encouraging their congregants and wider community to vote. This may suggest 
that if congregations with less affluent and politically aware clergy and members are to involve 
themselves in politics, that they are far more likely to involve themselves in those activities that 
require fewer information costs and risks. In fact, they are actually more likely to do so than are 
their more affluent and politically aware brethren. These analyses provide limited support for the 
External Resource Hypothesis (H:4). That is, congregations involved in collaborative arrangements 
with other religious and secular organizations (sig. = .086) and rely on consultants (sig. = .083) are 
only marginally more likely than others to engage in voter registration efforts (sig. = .086). Finally, 
there is a lack of support for the Denominational Affiliation Hypothesis (H:5). 

In sum, this study provides support for the Race and High Resource Cost Hypothesis (H:1). 
That is, when the organizational and external resources and denominational affiliation of black 
and white congregations are taken into account, black congregations are more likely than are 
white congregations to engage in lobbying and protest politics. Denominational affiliation 
serves as the main suppressing agent that explains black congregations’ relative involvement 
leap in these resource costly forms of political activism. This study also provides support for 
the Race and Low Resource Cost Hypothesis (H:7), suggesting that predominantly black 
congregations are, on average, more likely than white congregations to participate in voter 
registration efforts. There is limited support for the Organizational Resource Hypothesis (H:3), 
with the internal resources of congregations being the only organizational resource predictor 
of all forms of political activism in the hypothesized direction. There is partial support for the 
External Resource Hypotheses (H:4); collaborative ties and reliance on consultants are 
strongly related to the resource-costly lobbying and protest efforts, but weakly related to the 
resource-inexpensive political act of voter registration. And, finally, there is limited support 
for the Denominational Affiliation Hypothesis (H:5). 


Discussion 


This study largely suggests that resource mobilization theory does well in explaining racial 
differences in the political strategies adopted by a national representative sample of 
predominantly black and predominantly white religious congregations. Resource mobilization 
theory largely suggests that social movements emerge when social groups possess the 
requisite resources to politically act upon recognized political opportunities (Cuzén 1990; 
McCarthy and Zald 1977; McAdam 1982; Morris 1984; Williams 2002). This theory has been 
used to explain the role that resources play in accounting for the political mobilization 
potential of aggrieved groups affiliated with religious congregations as well as the strategies 
they employ to influence policy makers (McAdam 1982; Morris 1984; Williams 2002). 
However, this is the first study that has employed multivariate analytical techniques to assess 
the importance of resources to the various political strategies employed by black and white 
congregations. As such, this study contributes to the understanding of the role that varying 
congregational resources play in the political strategies adopted by congregations seeking to 
influence public policy. This study also makes a positive contribution to the application of 
resource mobilization theory by specifying how resource disparities account for differences in 
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the levels and types of political strategies chosen by black and white congregations. 

The fact that black congregations are, on average, no more likely than are white 
congregations to involve themselves in lobbying and protest politics has much to do with the 
greater resource constraints confronting black congregations. Despite the fact that black 
Americans are more likely than are white Americans to look to their religious institutions for 
political leadership (Newport, Ludwig and Kearney 2001; Pew 2001), black congregations 
have greater difficulty acting on their political inclinations. This is largely because black 
congregations tend to be saddled with many of the same socio-economic concerns that 
confront their congregants and wider community. It is only when black and white 
congregations have access to an equal supply of organizational and external resources and 
are affiliated with the same denominational bodies that black congregations are more likely to 
adopt lobbying and protests politics as viable political strategies. 

Black congregations are, however, more likely than their white counterparts to participate 
in voter registration efforts. This is linked to the match between the relatively low resource 
capacity of black congregations with the relatively low resource cost required to implement 
this form of political activism. It is also likely that the historical memory of the Civil Rights 
Movement, which informs the political decisions of more than two-thirds of black pastors, 
plays a large role in the heightened involvement of black congregations in voter registration 
efforts (Lincoln and Mamiya 1990). Nonetheless, there is little reason to believe that such 
social and political expectations would not carry over to other forms of activism that tend to 
have a greater impact on political debates, legislative decision making and the appropriation 
of public resources. 

The principal resource constraint that limits the capacity of black congregations to involve 
themselves in more resource costly and politically effective political strategies, relative to white 
congregations, is denominational affiliation. Black congregations tend to lack affiliations with 
denominations that have strong political infrastructures (Baer and Singer 1992). The greater 
importance of denominational affiliation in explaining race differences in congregation-based 
lobbying and protest efforts, relative to the organizational and external resources of 
congregations, is likely linked to the bureaucratic infrastructure of denominational bodies. 
Unlike local congregations, the staff and volunteers within the political offices of 
denominations are often experts in the political issues in which they are working (Fowler, 
Hertzke and Olson 1999; Olson 2002; Wald 2004). Volunteers and staff of denomination offices 
also differ from those of local congregations in that they are able to devote all of their time to 
political issues (Fowler, Hertzke and Olson 1999; Olson 2002; Wald 2004). Further, local 
congregations tend to spend only 20 percent of their income on social programs (Chaves 
1999). In comparison, the political offices of denominational bodies are able to devote all of the 
money appropriated to them on socio-political issues (Fowler, Hertzke and Olson 1999; Olson 
2002; Wald 2004). In the same vein, if congregations are aligning themselves with other 
voluntary associations, it is unlikely that they are able to avoid the reality of other non-political 
responsibilities commanding their time and money. To that end, it is sensible that denomination 
affiliation serves as the principal resource constraint explaining racial differences in the 
involvement of religious congregations in the high resource costly political activities of lobbying 
and protest politics. 

It is plausible that affiliation with the Roman Catholic Church provides black congregations 
with the greatest boost in their likelihood of participating in more resource costly forms of 
political activism. This is largely because congregations affiliated with the Roman Catholic 
Church are consistently more likely than are congregations affiliated with traditionally black 
denominations to involve themselves in lobbying and protest efforts. The strength of this 
effect is indicative of the fact that the Roman Catholic Church is the most hierarchical of all 
religious denominations (Fowler, Hertzke and Olson 1999; Olson 2002; Wald 2004). The 


Racial Differences in Congregation-based Activism * 1599 


hierarchical structure of the Roman Catholic Church enables it to send directives on its 
positions on social issues down a well-established chain of command to local parishes. This 
structure may also contribute to the church's greater capacity to offer encouragement and 
training in the political strategies that local parishes should adopt, relative to Liberal Protestant 
and Evangelical Protestant churches. Indeed, Cavendish’s (2000) study on the community 
activism of black and white Catholic congregations suggests that black Catholic 
congregations are more heavily involved in social activism for social change than are their 
white counterparts. Taken together, Cavendish’s (2000) study and the current study suggest 
that being connected to denominations with the resources to assist and coordinate socio- 
political efforts puts black congregations in a relatively good position to act on their social and 
religious interests. 

In addition to specifying the role that resources play in explaining racial differences in 
congregation-based political activism, this study also makes clear the independent effect of 
varying resource types on the political strategies that religious congregations adopt. The most 
important organizational resource is a congregation's internal resource capacity. That is, if 
congregations are to participate in voting related efforts, lobbying or protest politics, they 
must have the volunteer base, financial resources and staff to do so. These resources 
essentially represent a refined version of the “basic resources” that Morris (1984) contends 
are necessary for congregational bodies to involve themselves in sustained political 
mobilization concomitant with protest demonstrations and intense lobbying efforts. This 
finding is also consistent with other studies that suggest that larger and better financed 
congregations are more capable of involving themselves in voter registration, lobbying and 
protest efforts (Cavendish 2000; Beyerlein and Chaves 2003; Smith and Smidt 2008). In short, 
this study suggests that the internal resources of congregations are essential to their capacity 
to involve themselves in any number of political activities. 

Other findings demonstrate that the external resources of congregations in the form of 
potential coalitional partners and reliance on consultants are seemingly most important to 
their involvement in high resource cost political activities such as lobbying and protest 
politics. These resources are less important to congregational involvement in the low 
resource cost political activity of voter registration efforts. In practical terms, this makes 
sense. Because of the fewer resources necessary to participate in short-term efforts that 
require relatively few volunteers and financial resources, there is less need for congregations 
to combine their resources to get involved in these activities. In fact, it may require more 
resources in terms of time commitment on the part of staff and volunteers to coordinate 
these joint events than it does to simply set up a voter registration booth in the lobby of one’s 
church. In contrast, lobbying and long-term public demonstration campaigns often require 
more professionalism, money, volunteers and staff than any one congregation possesses. It 
is, therefore, often necessary to combine resources with other congregations and civic 
organizations and even to seek the expert advice of consultants to initiate and maintain the 
desired effect of such campaigns. 

In short, this study contributes to the understanding of the application of resource 
mobilization theory to the political behavior of American congregations. It does so by 
empirically verifying the relative saliency of congregations’ internal resources to their capacity 
to adopt any number of political strategies to influence public policy. This study also suggests 
that the greater resource constraints facing black congregations relative to white 
congregations, principally denominational affiliation, largely explains the involvement of black 
and white congregations in the resource intensive activities of lobbying and protests politics. 
Finally, the low resource costs of voter registration concomitant with black Americans’ greater 
interests in church-based political activism, relative to white Americans, likely explains black 
congregations’ heightened involvement in voter registration efforts. 
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Conclusion 


As voluntary associations, the political capacity of congregations is almost wholly based upon 
the social ties, human capital, civic skills and other political resources of their congregants. 
Thus, the political capacity of congregations is symptomatic of the racially unequal 
distribution of socio-economic resources that exists within society at large. Consequently, 
predominantly black congregations have fewer resources to mobilize for political purposes 
than do white congregations. This directly affects the strategies that black and white 
congregations employ when demanding increased public funding for after-school programs, 
AIDS research and prevention, increased economic development within their communities, 
etc. So, while African Americans express a greater desire for their congregations to be 
politically active, black congregations face a relatively greater challenge in acting on their 
social and religious interests. 
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Appendix: Measurement of Variables 
Independent Variables: Organizational Resources 


The Human Capital variable is based on the personal report of the senior pastor or some other 
church leader. This variable was measured by an index that includes the percentage of 
congregants who are college graduates, have less than a high-school diploma, make more than 
$100,000 per year, and make less than $25,000 per year. Originally, all of the constituent variables 
in this index were measured as continuous variables. However, partially due to the skewed 
distribution of the percentage of persons making more than $100,000 a year, and those who 
lacked a high school diploma, these two variables were dichotomized. For both variables, 75 
percent or more of congregations were reported as having no more than 10 percent of 
congregants making such an income or lacking a high school diploma. For this reason, both of 
these variables were split at the 75 percent percentile mark and made into dichotomous variables. 
In regards to the high-income variable, congregations were given a zero if no more than 10 percent 
of congregants earned no less than $100,000 a year, they were given a 1 if at least 11 percent of 
their congregants earned this much. With regard to the high school education variable, 
congregations were given a zero if at least 11 percent of their congregants lacked a high school 
diploma and a 1 if no more than 10 percent of congregants lacked a high-school diploma. 

Both the percentage of congregants who had college degrees and earned less than $25,000 a 
year were recoded and split into quartiles. With regard to the college education variable, 
congregations were given a zero if no more than 12 percent of their congregants earned a college 
degree, a 1 if 13 to 30 percent had earned a degree, a 2 if 31 to 60 percent had earned a degree, and 
a3 if 61 percent or more of congregants had earned a degree. With regard to the poor member 
variable, respondents were given a zero if at least 36 percent of their congregants earned less than 
$25,000 a year, a 1 if 21 to 35 percent earned this much, a 2 if 11 to 20 percent earned this much, 
and a 3 if no more than 10 percent of their congregants earned this much. While this additive 
measure is not the most precise estimate of congregant human capital as evinced by its moderate 
reliability estimate of .590, it is the closest tnat can be ascertained of this concept by this data set. 

Originally, all of the constituent measures of the Internal Resource index were measured as 
continuous variables. However, partially due to the skewed distribution of these variables, these 
measures were recoded and split into quartiles. Regarding church income, congregations were 
given a zero if their income from all sources during their most recent fiscal year amounted to no 
more than $97,000, a 1 if they earned $97,001 to $250,000, a 2 if they earned $250,001 to 
$575,854, and a 3 if they earned in excess of $575,854. The church attendance variable was 
recoded such that congregations were given a zero if the number of persons that attended church 
in the previous week amounted to no more than 100 persons. Congregations were given a 1 if it 
was 101 to 213 persons, a 2 if it was 214 to 400 persons, and a 3 if it was in excess of 400 persons. 
The church volunteer variable was recoded such that congregations were given a zero if the 
number of church volunteers amounted to no more than 150 persons. Congregations were given 
a 1 if it was 151 to 360 persons, a 2 if it was 361 to 1000 persons, and a 3 if it was in excess of 
1,000 persons. The church staff variable was recoded such that congregations were given a zero 
if the number of part- and full-time staff amounted to no more than three persons. Congregations 
were given a 1 if they employed 4 to 6 persons, a 2 if they employed 7 to 13 persons, and a 3 if 
they had more than 13 persons on staff. The index has reliability estimate of .917. 


External Resources: Collaboration and Denominational Affiliation 


The Collaboration index has a reliability index 0.614. Denominational membership status in national 
religious organizations, such as the National Council of Churches or the National Association of 
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Evangelicals, were employed to classify various Baptist, Methodist, Lutheran, Presbyterian and 
Episcopalian denominations into mainline and evangelical Protestant traditions (Kellstedt et. al. 
1996; Melton 1993). Black Protestant congregations are affiliated with historically black 
denominations, such as African American Episcopal Church, the National Baptist Convention, USA, 
the Progressive National Baptist Convention, Church of God in Christ, to name a few, as indicated 
in J. Gordon Melton's Encyclopedia of American Religions (1993). 


Control Variables 


SocialtEconomic Context was measured by an additive index of 1990 census tract data 
surrounding the congregations. The three constitutive measures of this index include 
congregations located in census tracts with a 30 percent or greater poverty level, a 5 percent or 
greater unemployment rate among residents age 16 and older, and where 50 percent or less of 
residents age 25 and up have some college education. While this additive measure is not the most 
precise estimate of the socio-economic context of American congregations as evinced by its 
moderate reliability estimate of .531, it is the closest that can be ascertained of this concept by 
using this data set. The cut points of the dichotomously measured constitutive variables of this 
index were generated by the data set and cannot be manipulated. In total, this scale ranges from 
0 to 3, with zero indicating congregations located in the lowest social-economic context. 

Theology/Political Ideology was measured by a two-item additive measure of congregational 
theology and political ideology. Respondents were asked if they believed the theological beliefs 
of their congregation to be more on the conservative side, right in the middle or more on the liberal 
side. They were asked the same question of their congregations’ political ideology. This scale 
ranged from 2 to 6, with 2 being very conservative and 6 being very liberal. The index has reliability 
estimate of .731. Urbanicity is a census tract variable based upon the 1990 census data. Suburban 
and rural areas are the categories included in the analyses and urban areas are the comparison 
category. The Region variable was broken up in the following categories; Northeast, Northcentral, 
South and West. For the purpose of these analyses, West, the largest region, serves as the 
comparison category. 


Appendix: Political Activism Distribution by Race of Congregation: Chi-Square Analyses (# <1 
*p<.05 **p<.01) 
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Abstract 

Survey data from a southwestern metropolitan area are used to analyze whether the ability 
of personal Christian religiosity to predict social conformity is spuriously due to self- 
control. Results indicate that both personal religiosity and self-control display statistically 
significant, independent negative net relationships with many forms of projected 
misbehavior. And interaction between self-control and religiosity in predicting deviance 
appears to be limited. Thus, self-control does not seem to account for the effects of 
religiosity, leaving the issue of how and why religiosity leads to conformity unresolved. 


In explaining social order and social change sociologists have focused heavily on socialization 
(Akers 1985; Bandura 1977; Parsons 1951) and social control (Gibbs 1981, 1989: Ross 1901: 
Tittle 1977). They have paid particular attention to the structure and functioning of institutions 
oriented around those two processes, especially those institutions that might affect 
conformity to societal norms. Religion ranks high among such institutions. Its importance 
rests partly on the assumption made by many sociologists that religiosity helps generate 
internalized commitments to norms and effectively threatens sanctions for violation (Tittle and 
Welch 1983). Though the actual effect of religion on various kinds of misbehavior has been 
disputed (e.g., Harris 2003; Sherkat and Ellison 1999), most research confirms the ability of 
individual religiosity to predict adherence to the law, absence of alcohol and drug abuse, and 
obedience to other social norms such as school-based rules and mandates for appropriate 
juvenile conduct (Baier and Wright 2001; Benda 2002; Evans et al. 1995; but see Hirschi and 
Stark 1969 for contrary evidence). However, religiosity’s predictive efficiency appears to be 
contingent (Stark et al. 1982; Tittle and Welch 1983) on the type of offense or on the religious 
composition of the individual's community. (Evans et al. 1995 present an opposing view.) 

A major unanswered question is exactly how religion brings about conformity to social 
norms. Some contend that religious individuals are constrained toward obedience by fear of 
supernatural consequences that might be experienced while alive (Harris 2003) or anticipated 
for the afterlife (Burkett and White 1974; Jensen and Erickson 1979); and there is evidence 
consistent with this argument (Burkett 1993; Burkett and White 1974; Harris 2003). Others 
theorize that religious training helps people internalize moral commitments that motivate 
regulation of their own conduct to avoid feelings of guilt (Bandura 1977). Some evidence 
favorable to this idea is also available (Perrin 2000). Still others maintain that religious 
experience brings people together within networks of believers who mutually exercise 
informal social control and whose collective activities reduce the opportunities for 
misbehavior (Stark and Bainbridge 1996; Johnson et al. 2000). This ‘community effect’ 
contention has also enjoyed some support (Richard et al. 2000; Stark and Bainbridge 1996), 
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with research not only confirming the importance of the structure of religious communities, 
but also the structure of the larger social context in which the religious community is 
embedded (Ellison et al. 1997; Evans et al. 1995; Jang and Johnson 2001; Stark 1996; Welch 
et al. 1991). 

Because all of these potential causal processes have some credibility, many scholars have 
attributed religion's influence on individual adherence to social norms to the simultaneous 
operation of several variables having to do with religious experience/faith or the situation in 
which religious activities take place. Hence, the main task of research has been to assess the 
relative importance of the three processes, showing how and under what conditions they 
converge. However, an alternative contention calls into question the import of religion 
altogether by suggesting that its effects on misconduct are spurious. This contrary argument 
theorizes that the apparent effects of religious orientations and activities, as well as many 
other traditional sociological variables, such as peer influences and socio-economic status, 
are merely products of prior personality characteristics that presumably affect religion, these 
other traditional variables and misconduct. One version of the spuriousness argument 
emphasizes self-control, a socially learned characteristic (Gottfredson and Hirschi 1990; 
Baumeister and Exline 1999, 2000), while a second version focuses on a presumed 
genetic/biological predisposition toward activism that is manifest in restlessness and pursuit 
of stimulation among those who disobey the norms (Ellis 1987). Predispositions toward 
activism presumably make religious practice unattractive while at the same time making 
misbehavior appealing. 

The contention that religious influence on obedience to norms results from early acquired 
ability to exercise self-control is especially interesting because its main theoretical 
embodiment has attracted much attention from students of crime and social deviance and 
has stimulated a large body of work about its potential effects. A meta-analysis of 10 years 
of research (Pratt and Cullen, 2000) as well as numerous subsequent studies (e.g., Baron 
2003; Finkel and Campbell 2001; Hay 2001; Vazsonyi and Crosswhite 2004) show that low 
self-control is a persistent, though modest, predictor of many kinds of rules violations and that 
it helps account for many other well established associations between various variables and 
criminal or deviant behavior. In addition, considerable research suggests that religious people 
do, in fact, have greater self-control (Aziz and Rahman 1996; Bergin et al. 1987; Furrow and 
Wagener 2000; McClain 1978), although this research has used only small non-random 
samples and weak measurement. 

Self-control theorists claim that low self-control is, for all intents and purposes, the 
individual-level cause of crime (with crime defined as acts of force or fraud for self- 
gratification) (Gottfredson and Hirschi, 1990: 232). They also contend that most connections 
between social variables and force, fraud or analogous imprudent acts (smoking and drinking, 
being late for appointments and such) that suggest weakness in the face of temptation are 
due to the antecedent effect of self-control. Yet, this hypothesis has hardly been tested, 
especially as it bears on religiosity. A dearth of research is striking because of the seeming 
importance of the issue, given the central role ascribed to religion by many sociological 
theories (Tittle and Welch 1983). Not only is there a large body of research demonstrating the 
predictive power of religiosity for various kinds of obedience to rules and laws and a body of 
literature showing a connection between religiosity and self-control, but there is a strong 
logical basis for expecting self-control to affect religiosity as well as help people avoid using 
force or fraud and analogous acts. 

By allowing suppression of impulses for immediate gratification, self-control may help 
individuals meet the expectations of fellow religious believers, thereby not only generating 
greater conformity to the rules embraced by religious co-adherents but also ensuring stronger 
social bonds that help sustain conformity. In addition, high self-control may facilitate 
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internalization and operation of religious beliefs. For example, those with relatively high self- 
control may find it easier to absorb and follow religious dictates emphasizing denial and self- 
sacrifice and to undergo religious instruction and worship, which require large investments of 
time and much personal discipline. At the same time, self-control theoretically allows people 
to anticipate and take into account the potential long-range and costly consequences of 
misconduct, which Gottfredson and Hirschi (1990: 95, 190-191) contend constitutes the main 
interpretive link between self-control and misbehavior, thereby reducing the chances they will 
actually violate the norms. Thus, self-control may enhance both religiosity and obedience to 
rules of law or social norms generally endorsed by religious followers, leading to a spurious 
association between religiosity and conformity. 

However, others contend that values independently influence conformity (Akers 1985; 
Akers 1998; Etzioni 1988, 1996; Parsons 1937). Values may bear on the attractiveness of 
offending to the individual, a characteristic that Gottfredson and Hirschi regard as unimportant. 
According to the self-control theorists, everybody is motivated to use force or fraud because 
the results of such acts are inherently gratifying: the object of the offense is clearly pleasurable, 
and universally so....It follows that, though there will be little difference among people in their 
abilities to see the pleasures of crime, there will be considerable difference in their abilities to 
calculate potential pains (Gottfredson and Hirschi 1990: 95). Values may also affect individuals’ 
desires to exercise whatever degree of self-control they might have (Baumeister et al. 1994). 
Some presumably possess high self control but, rather than exercising it to avoid criminal or 
deviant behavior, they use it to more effectively commit crimes (Benson and Moore 1992; Tittle 
1991). On the other hand, some with weak self-control may nevertheless want to control 
themselves so badly that they conform to a greater extent than do others with higher degrees 
of self-control. (See Tittle et al. 2004.) Hence, religiosity may affect conformity to social and 
legal rules independently of self-control. 

Self-control and religiosity may also interact in their influence because the deviance- 
restraining effects of religiosity may be especially intense among those with strong self- 
control. Not only are religious people with high self-control potentially inhibited by their fear of 
supernatural sanctions and by their moral codes, but given their high self-control they are also 
more likely to take into account potential penalties exacted by their religious communities or 
secular authorities, and to consider the inherently destructive nature of some forms of 
misconduct. At the same time, those with little religiosity should be even more likely to 
deviate if they have weak self control because they will be less able to restrain their impulses 
even if they have some fear of a punitive afterlife or some non-religious moral commitments, 
and they will be less likely to anticipate informal or formal sanctions. 

Therefore, to assess common notions about the import of religion on conformity and 
society, it is important to investigate the self-control/religiosity/deviance nexus, particularly in 
light of the challenging claims of self-control theorists. The exploration of that nexus is 
motivated by three specific issues crucial to several areas of sociology. They can be succinctly 
summarized in the form of questions: (1) are religious people really more capable of self- 
control than non-religious people?; (2) are the frequently observed associations between 
religiosity and force, fraud or analogous acts due to a common association of religiosity and 
misbehavior with self-control?; and (3) do self-control and religiosity interact in producing 
crime/deviance? If religious people are no more capable of self-control than others, and if the 
associations between religiosity and various forms of deviance are in fact attributable to self- 
control, then the usual channels through which the effects of religion are thought to be 
realized will be challenged. However, if both religiosity and self-control have independent 
effects or if they interact in preventing criminal/deviant behavior, scholars will need to 


integrate the two. 
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Methods 


Sampling and Data Collection 


Data are from the 16" annual Oklahoma City Survey, conducted during the spring of 1994 by 
faculty and graduate students from the Department of Sociology at the University of 
Oklahoma. The annual survey series ended in 1996 after 18 years. Because there was often 
little continuity from year to year in its content, the 1994 data are the only ones sufficient for 
addressing issues about religiosity and self-control. Even though the 1994 data have been 
analyzed previously to address some issues about self-control (Tittle et al. 2003a, 2003b, 
2004), they have not been used to investigate religiosity and obedience to social norms or the 
interplay between self-control and religiosity in producing conformity. 

A simple random sample of 350 adults aged 18 and over was drawn from the A.L. Polk 
Directory of names and addresses of Oklahoma City residents. Notices were mailed to 
prospective interviewees to inform them that they would soon be contacted by a member of the 
research team to schedule a convenient time for an appointment. Individuals who could not be 
interviewed were replaced randomly. Forty percent of the 350 respondents who completed the 
face-to-face interviews were original selections with the remainder being random substitutes. To 
help increase accuracy of reporting, interviewers did not see the separate sheet on which 
individuals reported past criminal behavior and projected possible future misconduct. 

Because more than half of the original households produced no interview, the sample's 
demographic representativeness might be in doubt, and because even demographically 
representative samples may contain an over abundance of people with high self-control, 
those who are religiously oriented, and individuals less inclined toward crime and deviance, 
it is important to try to assess how well the sample reflects the religiosity, criminality and 
degree of self-control of the population of Oklahoma City in 1994. Some demographic 
comparisons of the 1994 sample with the 1990 census figures are possible, although 
differences might be expected because of actual changes during the four-year interval 
between the census and the survey. First, although it is not possible to compare mean or 
median ages, given that the sample is based on those 18 or above while the census covers 
the whole age range, we can compare ratios of people in various age categories. Thus, the 
ratio of people aged 65+ to those 21-44 in the census is .30; that same ratio in the sample is 
.40. Similarly the ratio of those 45-64 relative to those 21-44 in the census is .46 while it is .68 
in the sample. Thus the sample population is older. The sample is also slightly more white 
than the census indicated (80 percent vs. 78 percent for the census population), and it is a 
little more female (56 percent vs. 53 percent for the census). In addition the sample seems 
to include a larger percent of married people (54 percent in the census compared with 61 
percent of the sample) and sample respondents are more highly educated (22 percent in the 
census with a college degree compared with 34 percent of the sample). Thus, because our 
respondents appear to be of higher socio-economic status and to encompass the more 
stable elements of the population the usual biases of household surveys seem evident. 

The main concern for our study, however, is the possibility that these demographic aberrations 
may have produced some skewness toward the more self-controlled, the more religious and the 
less criminally inclined, thereby reducing variation and perhaps biasing our findings concerning 
these variables. It is difficult to assess this potential distortion because there are few independent 
indicators of the theoretical variables. Nevertheless, some comparisons are possible. First, - 
according to church sources (Bradley et al., 1992), 73 percent of the population of Oklahoma 
County (which includes approximately 100,000 more people than the city itself) were religious 
adherents, with only .2 percent of the population being Jewish. By comparison, 66 percent of the 
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sample agreed or strongly agreed that they were very religious, and there were no Jewish 
respondents. Based on these limited data, a religiosity bias does not seem to be present. 
Second, since smoking, drinking and having accidents are regarded by Gottfredson and Hirschi 
as indicative of low self-control, we can make some inferences by comparing the percent of our 
respondents who reported smoking (28 percent) and drinking heavily (approximately 20 percent) 
with mortality figures (we have no indicator of accidents). If we assume that smokers, drinkers 
and the accident prone overlap considerably, we might infer that about 25 percent of our sample 
exhibit low self-control. Mortality figures suggest that about 11 percent of deaths in 1990 in 
Oklahoma City were due to lung cancer, liver disease and accidents. Even allowing for the fact 
that tobacco and alcohol enhance the chances of other causes of death, such as heart and 
circulatory diseases or homicide, it appears that our measures identify more people with low self- 
control than would be expected from mortality figures. This is exactly what would be expected if 
we are measuring self-control accurately because low self-control manifests itself in many more 
ways than is evident from these three causes of death. Thus, these comparisons again fail to 
show a bias toward the theoretical variable. Finally, the number of violent crimes in Oklahoma City 
in 1991 relative to the population was .011. This is in comparison to the 10.6 percent of our 
respondents who reported having hurt someone on purpose during the past five years. If the 
police figure is extrapolated for five years, it gives a violence rate of about 5.5 percent, which is an 
underestimate since the population base includes children. Nevertheless, the respondents seem 
somewhat more inclined toward violence than police data for the population as a whole would 
suggest. Typically, however, police do not learn of all crimes, the omitted numbers sometimes 
being called ‘hidden crime’ or the ‘dark figure’ of crime. So, it is reasonable to conclude that the 
sample appropriately reflects the rate of violence. Overall, then, the few comparisons that are 
possible suggest that the sample is not biased to overrepresent the theoretical variables of 
religiosity or to under-represent the degree of low self-control or criminality. 

Nevertheless, because we are testing hypotheses that presumably apply to any collection of 
people, sufficient variation in the variables is actually more important than representativeness of 
the sample. The standard deviations for the principal independent and dependent variable used 
in our analyses (Table 3) suggest that the data fulfill this requirement. 


Measures 


Independent Variables 


Personal Religiosity. Most scholars think the best measures of religiosity include a variety of 
items. Some tap devotional behaviors such as personal prayer and church attendance; others 
focus on salience or meaningfulness of beliefs (Guth and Green 1993), and still others cite the 
degree of literalism attached to religious interpretation (Kellstedt and Smidt 1993; Leege and 
Kellstedt 1993). Following that guideline, we factor analyzed, with the principal components 
method, 10 items representing different aspects of personal religiosity, including literalism, 
salience, frequency of prayer and attendance at religious services.. Although the initial analysis 
yielded two factors with eigenvalues greater than 1.0 (5.359 and 1.541, respectively), the 
magnitude (3.818) of the discontinuity between the first factor and the second suggests an 
appropriate and interpretable single solution (Nunnally and Berstein 1994). The results from a 
forced one-factor solution are listed in Table 1. Four of the items appear to be relevant only for 
Christians or for those with no religious preference. However, the sample contains no Jewish 
respondents and religious preference was unascertainable only for seven people. Since those 
seven are treated as missing cases in the analysis, the effective potential pool of respondents 
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for measuring religiosity is 343, all of whom are oriented either toward Christianity or have no 
religious preference. 


Table 1: Factor Loadings for Items in Personal Religiosity Scale (N = 343) 





Item Loading Mean Std. Dev. 
Religion is a very important part of my life. 807 3.41 811 
| believe it is important to accept Jesus Christ. 803 3.3/, .902 
| believe Jesus actually rose from the dead. .760 3.64 167 
Religion influences how | live my life. .158 3.20 918 
In believe miracles in the Bible actually happened. 157 3.35 879 
| would describe myself as very religious. 155 2.79 .935 
| believe the Bible is the actual word of God. 4135 2.95 1.049 
| believe that those who do not accept God are going to hell. .667 2.96 1.104 
How frequently do you pray? 651 3.62 1.448 
How frequently do you attend religious services? 601 Zeta tite 


To form a factor-based composite scale, we multiplied the z scores for each respondent on a 
given item by the regression factor coefficient for that item and summed across the 10 items. 
The resulting religiosity scale has a mean of zero, a standard deviation of one, and a range 
from -3.13 to 1.22. The alpha value is .89. 


Self-Control 

Self-control has been extensively studied (Baumeister et al. 1994; Pratt and Cullen 2000; White et al. 
1994), with researchers using many different kinds of measures. Remarkably, the self-control 
concept appears to be robust in that it persistently produces at least a modest prediction of 
misconduct, no matter how it is measured (Pratt and Cullen 2000; White et al. 1994).1 Nevertheless, 
the most commonly employed measure of self-control has been a 23-item cognitive-based scale 
incorporating six different elements specified by Gottfredson and Hirschi (1990) as components of 
self-control (Grasmick et al. 1993). However, there has been much debate about the 
unidimensionality of that and similar measures (e.g., Arneklev et al. 1999; Longshore et al. 1996; 
Marcus 2003; Piquero and Rosay 1998; Vazsonyi and Crosswhite 2004) and some challenge to the 
idea that cognitive-based scales, especially ones incorporating the diversity of elements specified by 
Gottfredson and Hirschi, can actually measure self-control (Marcus 2003, 2004). 

While other research using the same survey data employed here shows that behaviorally- 
based measures generate no more favorable evidence than do cognitively-based scales (Tittle et 
al. 2003b), there seems to be ample reason to employ behavioral measures where possible. The 
authors of self-control theory contend that behavioral measures are superior (Hirschi and 
Gottfredson 1993: 48), and measurement experts contend that behavioral measures are more - 
consistent with the theoretical argument and are more likely to have construct validity (Marcus 
2003, 2004). Therefore, we use a 10-item, factor-generated, behavioral scale as one measure of 
self-control. (See Table 2.) This scale was derived by Tittle and his associates (Tittle et al. 2003b), 
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and because we are using the same survey for our analysis, we adopted their instrument. The 
10 items were factor analyzed and combined in the same way as for our religiosity measure, 
producing a scale with a mean of zero, a standard deviation of one, and a range of scores from - 
3.52 to 1.18.* However, Hirschi and Gottfredson (1995: 134) seem to think that variety indexes are 
better than factor-based ones, so we also conducted the analysis using an alternative, ‘variety’ 
type measure of self-control produced by summing dichotomous categorizations of the 10 
behavioral items. Since the substantive results are similar for each of these behavioral measures 
of self-control, we report only those produced by the factor-based scale. 


Table 2: Items Used in Factor-Based Scale and Variety Index of Behavioral Self-Control 





Dichotomized 
Index Items, % with 


Item Loading Mean SD__Low Self-Control 
(1) How often during the past year did 
you drink beer, wine, or hard liquor? 189 4.52 1.51 65 (any drinking) 


(2) On a typical day when you drank 

in the past year, how many drinks did 

you have? That is, how many beers, 

glasses of wine, mixed drinks and 

shots of liquor did you have? .168 4.53 1.54 65 (any drinking) 


(3) Do you usually drink more than 
two or three alcoholic beverages over 
a span of a week? .130 83 38 17 (yes) 


(4) Have you ever had a blackout 

while drinking, that is, where you 

drank enough so that you couldn't 

remember the next day what you had 

said or done? .639 .80 40 20 (yes) 


(5) Have you ever had difficulty 
stopping drinking before you became 


intoxicated? 438 19 41 21 (yes) 
(6) Current marital status 361 2d 15 15 (separated/divorced) 
(7) Do you smoke tobacco products? 300 he 45 28 (yes) 


(8) When you have a cold or some 
other minor ailment, do you usually 
take some kind of medication? 241 14 44 26 (no) 


(9) Do you sometimes get so far in 
debt that it’s hard to see how you will 


get out of it? .208 18 42 22 (yes) 

(10) When you are in an automobile, 

do you always use a seatbelt? 191 61 49 38 (no) 

Alpha = .62 (scale) Alpha = .58 (Index) 
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Dependent Variables 


Survey researchers studying misbehavior have relied mainly on either self-reports of past 
crime or deviance and self-projections of probabilities of misbehaving in the future. 
Reports of past misconduct may reflect memory loss, telescoping and concealing of acts 
that might lead to prosecution or subpoena. And occasionally there is exaggeration of 
some kinds of misbehavior. Similarly, future projections of offending may be 
underestimated or exaggerated to serve respondent self-images, and such projections 
cannot include acts that might be committed in spontaneous reaction to unusual 
situations. 

Despite potential inaccuracies, well-conducted surveys, even household surveys in 
which the identity of the respondent is known to the researchers, can produce reasonably 
accurate self-reports of misbehavior. This has been demonstrated through a variety of 
validation techniques (Hindelang et al. 1981and Thornberry and Krohn 2000), and as a 
result most scholars now employ self-reports or at least assimilate findings based on 
them. 

Although acceptance of the validity of self-projections of future misconduct is not quite 
as widespread as reports of past delicts, their use has been increasing (e.g., Grasmick and 
Bursik 1990; Hay 2001; Tittle et al., 2003), and many now believe they provide a desirable 
alternative to past reports. The growing recognition of future projections of misconduct as 
valid is partly due to their similarity to ‘behavioral intentions’ that were pioneered by 
Fishbein and Ajzen (1975; Ajzen, 1991) and which have been extensively confirmed as 
actual predictors of future misconduct (e.g., Albarracin et al. 2001; Sheppard et al. 1988). 
Furthermore, some research shows that future projections of misbehavior themselves 
predict deviant conduct or subsequent self-reports of misconduct (Green 1989; Murray 
and Erickson 1987). 

The Oklahoma City survey from which our data were taken contains both self-reported 
past misconduct and future projections. Because the two are modestly correlated 
(approximately .50) but with the projections showing higher degrees of misconduct, we 
assume that the two are tapping into a common reservoir of misbehavior, but that the 
reduced level of past reports stems from greater withholding of information. For that and 
other reasons, we report results here based on estimations of future misconduct rather 
than past admissions. Particularly important in our decision was potential causal 
sequencing of self-control and religiosity in producing crime or deviance. Because future 
projections occur at the time of the interview and presumably involve estimates of what 
will happen, the misconduct predicted by them would appear to follow the independent 
variables at issue here, or at least to occur simultaneously with them. However, although 
some of the coefficients vary in magnitude and the significance tests sometimes change, 
the overall pattern of results remains the same for both of the measures. Hence, we use 
the projections because they make more causal sense and seem to be more accurate? 

Variety of Crime/Deviance. Because Hirschi and Gottfredson (1995: 134) emphasize 
that low levels of self-control are best reflected in the number of problem behaviors a 
person displays, it also seems reasonable to imagine that variety measures of crime would 
be most relevant for testing their ideas about how self-control restrains misbehavior. 
Therefore, we constructed an index of the variety of crimes individuals estimate they are 
likely to commit in the foreseeable future (Future Variety). The Future Variety index (mean 
= 1.01, s.d. = 1.23, alpha = .61) was created by summing five dichotomous indicators of 
projected misbehavior representing illegal gambling, petty theft, DUI, assault and tax 
evasion. The questions asked individuals whether they will ‘ever commit the specific act in 
the future.” Responses were reverse-coded (0 = no, 1 = yes) so that a high score reflects 
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a likelihood of future misbehavior. Total scores for the variety of misbehavior measure 
range from 0 (denoting low chances of misbehavior) to 5 (denoting high likelihood of 
committing many different deviant acts). 


Specific Offenses 

Projections of probabilities of the same five offenses (i.e., assault, illegal gambling, DUI, 
petty theft and tax evasion) incorporated within the Future Variety index are also used as 
offense specific measures. 


Control Variables 


Six control variables, all assumed to be antecedent to self-control, religiosity and 
crime/deviance, are included in all multivariate analyses to help eliminate the possibility of 
spuriousness. They are: gender (1 = male, 0 = female), race (1 = white, 0 = nonwhite), age 
(coded in actual years; mean = 46.0, s.d. = 17.1), education, family intactness during 
childhood and place of residence while growing up. We are assuming that the respondent's 
level of education roughly reflects the educational level attained by his or her parents. It is 
coded as follows: 0 = less than high school, 1 = high school graduate, 2 = post high school 
training, 3 = some college, 4 = college graduate, 5 = some graduate work, 6 = master’s 
degree, 7 = post-master’s work, 8 = advanced graduate degree (J.D., PhD., M.D.). The 
structure of the respondent's family during childhood was inferred from an item asking the 
respondents to generally describe their family situation when growing up: whether they lived 
with only their mother or mother-figure, with only their father or father-figure, with both a 
mother (or mother-figure) and father (or father-figure), or in some other type of situation. We 
coded the responses as: 1 = lived only with father/father-figure or other situation; 2 = lived 
only with mother/mother-figure; 3 = lived with both parents (biological or surrogate). Place 
of childhood residence was coded so that: 1 = farm; 2 = in a rural area, but not farm; 3 = 
small town of 10,000 or less not located near a bigger city; 4 = small town of 10,000 or less 
located near a bigger city; and 5 = city of 10,000 or more. 

Ideally one would control for all antecedent variables that could possibly affect the 
relationship between the independent variables or the independent variables and the self- 
projected measures of crime, but the data set does not contain information relevant to all 
potentially important variables (e.g., respondent's |Q, additional aspects of family 
socioeconomic status when the respondent was a child, parental religiosity, etc.). However, 
given that our principal goal is to determine whether the net effects of religiosity and self- 
control are empirically independent, and whether they interact in shaping misbehavior, control 
of all possible extraneous variables becomes less important. 


Analyses 


In initial analyses, we use OLS multiple regression procedures to estimate basic models that 
include in various combinations the measures of personal religiosity, self-control, the six 
control variables described earlier, and a multiplicative interaction term representing the joint 
effect of personal religiosity and self-control. The Future Variety index is treated as the 
dependent measure and regressed on the set of independent and control variables. The five 
dichotomous measures of the specific offenses were also regressed on the same set of 
independent and control variables, using binary logistic regression techniques. Descriptive 
statistics for all variables used in the analyses are reported in Table 3. 
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Table 3: Descriptive Statistics for Variables Used in the Analyses 








Variables Mean sD Range 
Age 46.0 17.10 18-90 
Gender : 44 .50 0-1 

Race .80 40 0-1 
Educational Level 2.78 1.89 0-8 
Family Intactness During Childhood 2.74 aL 1-4 

Type of Place of Residence During Childhood 3.61 1.58 1-5 
Behavioral Self-Control Scale 01 .99 -3,52-1.18 
Personal Religiosity .00 1.00 -3.13-1.22 
Variety of Future Deviance 81 1.10 0-5 
Projected Gambling 29 45 0-1 
Projected Petty Theft .06 23 0-1 
Project DUI “ee 42 0-1 
Projected Assault .08 Ah 0-1 
Projected Tax Evasion eur 38 0-1 

Results 


Are Religious People More Self-Controlled? 


Table 4 reports the basic associations between our measure of self-control and religiosity. 
Consistent with previous research, the coefficients show that religious people in Oklahoma 
City in 1994 did exhibit more self-control than less religious people, even with other variables 
held constant (Model 2). The coefficient is statistically significant and at least modest in size 
(beta = .38), indicating that the higher the religiosity, the higher the self-control of the person. 


Is the Effect of Religiosity Spurious? 


Results from the OLS regression examining the net relationships between the general future 
deviance measure and the measures of religiosity and self-control are presented in Table 5, which 
contains six columns representing different models. Model 1 shows the association between self- 
control and the general index of misconduct while Model 2 shows the same thing for religiosity. 
Model 3 shows the coefficient for self-control with the control variables included in the equation. 
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Table 4: Zero Order and Net Relationships (Standardized Coefficients) Between Behavioral Self- 
Control, Personal Religiosity and Demographic Controls 


ee 
Behavioral Self-Control Scale 


1 2 
Personal Religiosity he .38* 
Gender -.21* 
Race Ade 
Age 01* 
Educational Level 10% 
Family Intactness During Childhood Ada 
Types of Place of Childhood Residence -.10* 
Constant: 1.41 -.42 
Adjusted R*: .15* 28% 





* Significant at least at p < .05. 


Model 4 shows the ‘controlled’ coefficient for religiosity. Model 5 shows coefficients for self- 
control and religiosity when both are in the equation, along with the control variables. Finally Model 
6 includes all of the predictors as well as a multiplicative term representing the interaction between 
self-control and religiosity in predicting misconduct. Note that all of the variables in Model 6 are 
centered (Aiken and West 1991) while the variables in models 1-5 are in their original metrics. 

First, consistent with prior research, the coefficients for Model 4 suggest that religious 
individuals are less likely to commit a variety of offenses in the future (Row 2, beta = -.32), 
even with other variables controlled. Also, as demonstrated previously with these data (Tittle 
et al., 2003b), self-control is negatively related to the chances of crime, even with other 
variables controlled (Model 3, Row 1, Column 3, beta = -.38). However, as the coefficients 
for Model 5 show, when both religiosity and self-control are included in the same predictive 
equation, both show modest and statistically significant associations with future misconduct 
(-.23 and -.30, respectively). In other words, the effects of religiosity on conformity do not 
appear to be spuriously attributable to prior levels of self-control. Instead, religiosity and self- 
control seem to operate independently of one another.’ 


Do Self-Control and Religiosity Interact in Predicting Deviance? 
Although it would seem logical to expect the effects of religiosity to be enhanced by self- 


control, there appears to be no interaction in these data between the two in predicting 
conformity (Model 6). The multiplicative interaction term is small and insignificant (beta = -.04), 
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Table 5: Standardized Net Coefficients From OLS Analysis Predicting the Variety of Future 
Deviant Acts Respondents Report They Are Likely To Commit 


TEESE 


Independent Variable Models: 1 2 3 4 5 6 
Behavioral Self-Control -.45* -.38* -.30* -.31* 
Religiosity -.38* B32) = 2513.20 
Gender ie tame e © chm Le 
Race 01 04 -01  -.01 
Age 12° tl) =A0 2.08 
Educational Level 02  -06 -.03  -.03 
Family Intactness During Childhood --04 -06 -03 -.03 
Type of Place of Residence During Childhood 02 01 -.03 -.03 
Behavioral Self-Control X Religiosity -.04 
Constant: B2e tS Vee ok ee mena 
Adjusted R**: AQ Sot teeth 2g: 2525 





* Predictor variables are centered in Model 6. 
* Significance level at least at p < .05. 
** All models significant at least at p < .001. 


again suggesting that self-control and religiosity operate on independent tracks in inhibiting 
crime. Both are negatively related to future projections of committing a variety of criminal acts 
and the two are related to each other, but the effects of neither is accounted for by the other 
nor are the levels of prediction provided by either affected by the degree of the other. 


Do the Results Vary by Type of Offense? 


Separate analyses using binary logistic regression (Table 6) of the specific dichotomously 
coded offenses show that the results vary considerably from offense to offense. 

These differences may be due to restricted variation in dichotomous measures. Indeed, with 
the logistic regression, many of the coefficients remain large but no longer reach statistical 
significance. Nevertheless, potential variations by specific offense should be noted. First, the 
basic results described above for the variety index of projected misconduct are repeated exactly 
for gambling and DUI, two offenses that are traditionally referred to as ‘ascetic,’ meaning that 
they are acts more likely to be prohibited by religious communities than by the larger society. In 
previous research, ascetic offenses have usually been more strongly affected by religiosity than 
are other offenses (Burkett and White 1974; Harris 2003). Neither self-control nor religiosity 
show significant relationships with petty theft (which, of course, renders the issue of 
spuriousness irrelevant); however, when combined, both interact to significantly reduce the 
likelihood of a future occurrence (beta = -.61). Furthermore, though religiosity significantly 
predicts assault (beta = -.70), self-control does not and there is no interaction between the two. 
By contrast, self-control appears to predict tax evasion while religiosity fails to do so, although 
there is a significant negative interaction between the two in predicting tax evasion, suggesting 
that the effects of self-control are greatest when religiosity is low. 
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Table 6: Logistic Regression Coefficients and Odds Ratios (in parentheses) for Measures of 
Behavioral Self-Control, Religiosity and Interactions from Analyses Predicting Types of Future 
Offenses Respondents Report They Are Likely to Commit 





Types of Specific Offenses 
Petty Tax 
Predictors Gambling Theft DUI Assault Evasion 
Behavioral Self-Control -.68* -.33 -.88" -.26 -.40" 
(.50) (.28) (.59) (.23) (.33) 
Personal Religiosity -.45* -.47 -.51" -.10* -.23 
(.36) (.38) (.40) (.50) (.21) 
Behavioral Self-Control X Religiosity” -.06 -.61* -.08 -.20 -.30* 


(06). (45) (.08) (18) (26) 


-2 Log Likelihood 324.90 112.92 263.72 146.73 283.30 
Pseudo- R? at 1935 a8 16* 08" 





- pippiticant at least at p<.05 level. 

equations include the following controls: gender, race, age, level of 
education, intactness of family structure during childhood, and type of 
place of childhood residence. 


Discussion 


Several patterns of findings deserve closer scrutiny. First, it is apparent that self-control and 
personal religiosity are both inversely related to various forms of crime/deviance that individuals 
report, even after the effects of important antecedent background variables have been 
controlled and with the two independent variables in the equation. Thus, any effects of self- 
control and religiosity would seem to be relatively independent of each other. The net 
associations of religiosity with various measures of crime in these data vary from low to 
moderate, replicating the general pattern reported by Baier and Wright (2001). Compared to the 
net coefficients for religiosity, those representing the associations of self-control with various 
kinds of deviance generally appear to be somewhat larger, reinforcing the importance that 
theorists attribute to self-control as a predictor of misconduct. However, Gottfredson and 
Hirschi’s provocative contention that relationships between traditional social variables, 
presumably including religiosity, are spuriously due to prior levels of self-control is not supported 
here. It was primarily that implication that inspired our analysis, so it is important to emphasize 
that self-control does not seem to operate with the potency anticipated by the theorists. Though 
it is a persistent predictor of deviance, regardless of measurement procedures, self-control does 
not appear to be the master variable in explaining conformity. Previous research has already 
shown that other variables, such as peer influences, operate independently of, and compete 
with, self-control (Pratt and Cullen 2000), and here we find that another well known social 
variable, religiosity, also seems to perform independently and in competition with self-control. 
Thus, though self-control appears to be an important factor in the explanation of conformity, it 
does not seem to occupy the premier position that its advocates envision. 

Although none of the models we examined account for more than 35 percent of the 
variance in the measures of crime and deviance (as measured by Adjusted R? or Pseudo-F’) the 
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models do seem to predict at least to a modest extent the variety of crimes an individual is likely 
to commit. The models appear to do somewhat worse in predicting specific offenses than in 
predicting offending in general, except for gambling and DUI. Both self-control and general 
religious constraints tend to promote various kinds of abstinence, so it is not surprising that they 
would bear a stronger relation to behaviors that reflect the momentary or complete failure of 
self-denial. It is also not surprising that religious values might strongly constrain violence. 
However, it may be that neither self-control nor religiosity affect all forms of misconduct. 

By itself, the measure of neither self-control nor religiosity predicts an individual's 
tendency to commit theft, and only the measure of self-control predicts tax evasion, reversing 
the pattern observed in the model predicting assault. But the interaction of self-control and 
religiosity does predict both projected theft and tax evasion. In trying to understand forces 
that shape one’s decision to evade paying taxes, the findings may reflect the fact that such an 
act typically involves substantial forethought and planning, constituent elements of high self- 
control. Conversely, such acts as petty theft (e.g., illustrated in a case of shoplifting) and 
assault may have a more impulsive or spur of the moment aspect associated with them, one 
that bears a much weaker relationship to the more general, long-term measure of self-control 
used in the analysis. This suggests that it might be possible, as well as theoretically advisable, 
to specify that the effects of self-control are contingent on the nature of the misconduct, with 
self-control expected to be a more effective predictor of spontaneous acts. Indeed, critics 
have frequently noted that some forms of crime, such as white collar or organized crime, 
appear to require fairly high levels of self-control rather than reflecting low levels (Geis 2000; 
Tittle 1991; Reed and Yeager 1996). Alternatively, self-control might be re-conceptualized to 
include long-term and short-term components. However, the relatively weak net relationships, 
particularly those where the coefficients for specific offenses turn out to be insignificant — 
assault and theft — may simply reflect the limited variance to be explained in the dependent 
measures. 

It is also worth noting that the inhibitory effects of personal religiosity sometimes seem to 
be enhanced when combined with the force of self-control. The existence of such an 
interaction, although perhaps limited to certain specific offenses, suggests that researchers 
and theorists would be well advised to concentrate on various contingencies and how they 
operate. Until this is done, self-control theory seems unlikely to achieve its full explanatory 
potential even with improved measurement (Marcus 2004). 

Although religiosity appears to operate independently of self-control and therefore 
probably has the importance that most social theorists have attributed to it (Stark and 
Bainbridge 1996), we are unable to identify precisely which mechanisms associated with 
religion are most responsible for promoting conformity. Without more extensive data about 
various aspects of religiosity, we cannot parcel the apparent inhibitory effects to one or 
another specific aspect of religion. Clarification of which properties of religion promote 
conformity most powerfully, therefore, remains a major task for scholars. 


Conclusion 


Despite the fact that religious people do seem to have more self-control than others, self- 
control does not explain associations between religiosity and many forms of conformity. Both 
self-control and religiosity have independent influences on misconduct and only sometimes 
interact, though such influences depend on the type of offense in question. Although self- 
control is a persistent predictor of most forms of deviance, it does not seem to account for 
effects of religiosity in predicting those same misbehaviors. Exactly what it is about religiosity 
that leads to greater conformity remains an open question. 
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Although some may be concerned about the vintage of the data used in the analyses, we 
believe such concern is relatively misplaced because we are not interested in estimating the 
marginal distributions of religiosity, self-control and criminal intent at some particular time. 
Rather, our goal is to examine theoretical relationships among these variables, relationships 
that are unlikely to have changed dramatically within a relatively short period and that appear to 
withstand the test of time. While our findings suggest that self-control is not as powerful as 
some think, reinforcing previous research about the import of religiosity in generating social 
conformity, such conclusions cannot be considered entirely definitive. We merely offer these 
results as one piece of evidence relevant to an important issue. A collection of similar relevant 
findings from other studies must also confirm what we have found before the challenge posed 
by self-control theory can be turned aside. Until then, however, we present these results in the 
hope that will they provoke further investigation of the mechanisms by which religiosity inhibits 
misconduct. 


Notes 


1. Arecent 67-item behavioral self-report scale is reported to predict workplace deviance much 
better than usual (.60 or above vs. approximately .20 to .30) for other measures (Marcus 
2004; Marcus et al. 2002). However, many of its items appear very similar to those in the 
misbehavior scale that was being predicted. For example, three of the self-control items are 
about drinking while five of the work misconduct items concern drinking, and three of the 
self-control items concern theft while at least six of the work place misbehaviors concern 
various kinds of theft. Since the theorists have been criticized for having proposed a possibly 
tautological argument (Akers 1991; Geis 2000; Reed and Yeager 1996; Tittle 1991), it would 
appear unwise to accept the Marcus scale without further validation. 


2. The means and standard deviations reported in Table 2 deviate slightly from those reported 
by Tittle et al. (2003b) because of rounding and the listwise deletion of different numbers of 
cases. In addition, whereas they coded the dichotomous items as 1 and 2, we used 0 and 1. 


3. All of the results can be obtained by request to the first author. 


4. Wealso performed the same OLS regression using a factor-based general deviance index 
that was developed earlier by Tittle et al. (2003a) and which consists of seven self-report 
items that indicate whether an individual has committed a variety of criminal or deviant 
acts during the past five years or since becoming an adult, and five items that indicate 
whether he or she is likely to commit such acts in the future. The acts range from illegal 
gambling, DUI and tax evasion to petty theft and assault, including threats of physical 
harm. These items formed a single standardized factor-based scale with an alpha of .77. 
Because results from this analysis were similar to those we obtained in the model 
presented in Table 5, we report only those using the measure of projected deviance to 
help overcome difficulties in interpreting causal directionality. 
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The Poverty of Trust in the Southern United States 
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Abstract 
This paper bridges two lines of research. One line shows that social relations in the southern 
United States are more “collectivist” than social relations in non-southern regions. The second 
line of work argues that collectivist social relations generate lower levels of general trust than 
individualist social relations. At the intersection of these two arguments is the prediction that 
Southerners are, on average, less trusting than non-Southerners. I test this prediction using 
trust measures taken from the General Social Survey. As expected, results from whites, but not 
blacks, show the predicted regional differences. Importantly, regional differences in trust occur 
after controlling for regional variation in other factors related to trust. I conclude by outlining 
various implications of the findings and questions for future research. 


Introduction 


Research on trust occupies an increasingly central place in the social sciences. The growing 
ubiquity of work on the topic stems in part from the wide-ranging consequences of trust 
(Glaeser et al. 2000). As Rotter (1980, 443) writes: “The entire fabric of our day-to-day living, of 
our social order, rests on trust... almost all of our decisions involve trusting someone else.” 
Given the extensive benefits of trust, several new lines of research have emerged to explain its 
origins. One of the most active research programs explains variation in trust with reference to 
the different types of social ties that link actors to each other and to the social groups to which 
they belong (Buchan et al. 2002; Yamagishi 1988; Yamagishi and Yamagishi 1994). Most of this 
work focuses on how international differences in trust (e.g., between Japanese and Americans) 
are linked to cross-national differences in the patterning of social relations (Miller and Kanazawa 
2000; Yamagishi 1988; Yamagishi and Yamagishi 1994). The central demonstration of this 
research program, discussed in detail below, is that socia/ relations in “collectivist societies” 
(e.g., Japan) generate lower levels of trust than social relations in individualist societies, such 
as the United States. This relationship stands in contrast to the conventional view that collectivist 
cultures generate more trust and cooperation than individualist cultures. 

The key aim of the present research is to demonstrate that there are significant 
intranational differences in trust in the United States.' | show that these regional differences 
in trust can be predicted by extending theory developed to explain trust differences between 
nations. Specifically, | integrate research on the negative relationship between collectivist 
social relations and trust with research showing that the southern United States is more 
collectivist than non-southern regions. The implication is straightforward: we should observe 
lower levels of trust in the South. 

An extension of the theory to intranational differences in trust could have a number of 
theoretical and empirical payoffs. In addition to extending the explanatory domain of the 
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theory, the extension may also increase precision. International applications, by definition, 
gloss over variation within nations (Voronov and Singer 2002). By locating and explaining 
regional peaks and valleys of trust within nations, we may gain more insight into which (if any) 
regions are most influential in the creation of international differences. 

Beyond increased scope and precision, the extension to intranational differences can 
serve as a reality check on international “cross-cultural” studies, where differences such 
as language may confound relationships between key variables (Miller and Mitamura 2003; 
Vandello and Cohen 1999). Most important for current purposes, a large literature links low 
trust to a variety of social ills. The discovery of regional differences in trust reported in this 
paper may therefore help pave the way to an increased understanding of regional 
differences in important social and economic problems. These and other implications are 
taken up later. 

The remainder of this paper is organized as follows. | begin by reviewing research on 
international differences in social relations and trust. After discussing how collectivist social 
relations generate lower levels of trust than individualist social relations, | address why social 
relations in the southern United States are more collectivist than social relations in the non- 
South. | then describe the data and methods used to test the main argument of the paper: net 
of regional differences in other variables associated with trust, Southerners are less trusting than 
non-Southerners. As discussed below, previous work suggests that regional differences in trust 
will be limited to (non-Hispanic) whites. But, for comparison purposes, | also assess the 
argument using data from black respondents. After presenting the results, | discuss limitations 
of the current work and conclude with some suggestions for addressing these limitations. 


Collectivism and Trust 


The traditional framework for understanding differences between individualist and collectivist 
societies (e.g., Triandis 1995) suggests that, given a tension between individual and collective 
interests, actors in collectivist societies tend to give greater priority to group goals. But 
research by Yamagishi and colleagues calls into question the traditional argument that a 
tendency for actors in collectivist societies to prioritize collective — over individual — goals 
results from worldviews or broad cultural values that emphasize cooperation and trust 
(Hayashi et al. 1999; Yamagishi and Yamagishi 1994). Their research explains such group- 
oriented behavior in collectivist societies with reference to co//ectivist social relations and the 
concomitant high level of internal monitoring and sanctioning within such groups. This is the 
approach taken in the current explanation. That is, | focus on collectivist vs. individualist social 
relations, rather than collectivist vs. individualist worldviews. Thus, when | henceforth refer 
to collectivism and individualism, | am referring to characteristics of social relations rather than 
more diffuse (cultural or personal) worldviews. 

Compared to individualist social relations, collectivist social relations tend to be strong and 
densely connected. In addition, collectivist social relations tend to be multiplex (organized 
around a number of issues) while individualist relations tend to be simplex (organized around 
a single issue). For example, if Anna and Bob are connected by a multiplex social relation, they 
will engage in a number of activities together (e.g., watching movies, talking politics and 
golfing). Meanwhile, if Anna and Bob are connected by a simplex relation, they will tend to 
participate in only one type of activity together. The greater tendency for actors connected 
by collectivist social relations to spend much time engaged in many different activities 
together leads to continual monitoring and, when necessary, sanctioning of each other's 
behavior (Miller and Kanazawa 2000). At issue is how differences in the patterning of social 
relations — and the concomitant monitoring and sanctioning — lead to differences in trust. 
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Central to the theory of collectivism and trust is the distinction between general trust and 
assurance. General trust is an “expectation of goodwill or benign /ntent,” whereas assurance 
is “expectations of benign behavior for reasons other than the goodwill of alter” (Yamagishi 
and Yamagishi 1994, 132).? Thus, in assurance relations, Person A has positive expectations 
about Person B's behavior because it is in B's self-interest to act benignly. 

According to the theory, the high level of informal monitoring and sanctioning that 
characterize collectivist social relations limits expectations of benign behavior to relations 
within group boundaries. That is, monitoring and sanctioning systems increase an actor's 
expectation that others within his or her ingroup will act benevolently (i.e., they generate 
assurance). However, when actors go outside their groups to interact with strangers, the 
safety net of the sanctioning system disappears, leaving them with relatively low expectations 
of trustworthy behavior from others. That is, they have low levels of general trust, a prediction 
supported in a number of recent studies (Hayashi et al. 1999; Yamagishi 1988; Yamagishi and 
Yamagishi 1994).° 

In sum, a number of studies suggest that compared to individualist relations, collectivist 
social relations (through assurance) tend to reduce the need for general trust and thus, by 
extension, inhibit the development of general trust outside group boundaries. The resulting 
distrust, in turn, may serve to reproduce these collectivist relations because distrusters will 
tend to stick with assurance relations rather than risk interaction outside group boundaries. 
The comparatively lax individualist social relations, on the other hand, tend to increase general 
trust.4 Thus, by linking variation in trust to international differences in social structures, this 
research program underscores the central role of trust in the creation and persistence of 
cross-national differences in a variety of social and economic phenomena. 

In the sections to follow, | extend the theory to predict cross-regional differences in trust in 
the United States. But first | turn to the issue of why southern whites’ social relations are more 
collectivist than non-southern whites’ social relations. | then argue, based on the above 
reasoning, that we should expect lower levels of trust among whites in the South. Thereafter 
| discuss why we should not expect similar regional differences for blacks. 


Collectivism in the Southern United States 


Sociologists (Reed 1974; 1983), social psychologists (Conway et al. 2001; Vandello and 
Cohen 1999), and historians (Grantham 1994) have variously argued that, compared to non- 
southern regions, social relations in the South are characterized by collectivism. Vandello and 
Cohen (1999) outline four primary causes of this difference: defeat in the Civil War, history of 
slavery, poverty and prominence of religion. | discuss each briefly in turn. 

Following Reed (1974), Vandello and Cohen argue that loss of the Civil War led Southerners 
to define themselves as a people and region in opposition to a “powerful external threat.” 
Vandello and Cohen liken this perception to that of a minority group that coalesces in reaction to 
potential hostility from a more powerful majority. The end result, Vandello and Cohen assert is 
localist tendencies and strong ingroup identification. (See also Grantham 1994.) 

Vandello and Cohen link collectivist social structures in the contemporary South to the 
region's history of slavery in several ways. First, the agricultural system of the slave South was 
collectively oriented insofar as work was carried out by groups with highly interdependent 
members rather than by isolated individuals. Furthermore, the strongly overlapping 
categorical distinctions that characterized the institution of slavery (slave/master, white/black) 
helped solidify status hierarchies and, following social identity theory (Turner 1985), ingroup 
boundaries. The coupling of congruent status divisions with ingroup/outgroup distinctions, 
Vandello and Cohen (1999) argue, led to more collectivist social relations. 
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Finally, Vandello and Cohen suggest that poverty and the prominence of religion contribute 
to collectivist structures in the South. Poverty increases collectivism due to the increased 
reliance on social ties and fellow ingroup members. Religious groups can produce similar 
effects to the extent that they encourage multiplex relations (e.g., social, emotional and financial 
support between members) and emphasize ingroup-outgroup distinctions (lannaccone 1994). 

Empirical work has uncovered a number of indicators of the higher levels of collectivism in 
the South. For example, Reed (1974) shows that.Southerners have stronger attachments to 
their friends, families and others in their community than non-Southerners. Similarly, these 
surveys show much stronger norms of obedience to groups such as the family in the South 
than the North. Importantly, more recent work by Reed (1993) and others (Grantham 1994; 
Vandello and Cohen 1999) shows that these regional differences have largely persisted over 
the past several decades. 


Collectivism and Trust Among African Americans 


Following the logic of the argument outlined earlier, the greater propensity of southern social 
structures towards collectivism should produce lower levels of general trust in the South. 
However, there are a number of reasons that we may not expect to observe such regional 
differences among African Americans. First, the “Great Migration” of southern blacks into the 
Northeast and Midwest is a relatively recent phenomenon. That is, many blacks currently 
living in the non-South were born in the South, and many others are first generation non- 
Southerners. (See Tolnay 2003.) A tendency for southern black out-migrants to carry with 
them various codes for relating to others, and to pass these codes on to their children 
(explicitly or not), can be expected to attenuate differences in collectivism — and by extension, 
trust - between southern and non-southern blacks. Similarly, as noted by Tolnay (2003) (see 
also Lemann 1991), participants in the Great Migration tended to join friends and family 
members who migrated before them. This resulted in African American enclaves, many of 
which likely had features of collectivist communities.® 

Finally, some factors discussed earlier that lead to higher levels of collectivism among 
white Southerners either do not apply to black Southerners or apply equally to blacks in the 
South and non-South. For example, the typical reaction of black Southerners to loss of the 
Civil War was obviously quite different than the typical reaction of white Southerners.® 

The main analyses outlined below focus on regional differences in trust among whites. This 
focus is consistent with most of the recent work on intranational cross-cultural comparisons 
(Nisbett and Cohen 1996). After analyzing the trust responses of white respondents, | then turn 
to whether there are regional differences in trust among African Americans.’ 


Predictions 


The collectivist social relations a distrust argument suggests we should expect white 
Southerners to report lower levels of trust — net of other factors normally associated with trust 
and region. Although no specific predictions are made about differences between whites in 
non-southern regions at this stage of the research, any such differences are expected to be 
small compared to the differences between the South and non-South. Thus, the analyses 
reported below simply compare Southerners to non-Southerners. 

Previous work points to several factors that, if not controlled, may create a spurious 
relationship between region and trust. First, the South has a higher proportion of religious 
fundamentalists than the non-South. Because fundamentalist churches tend to be characterized 
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by collectivist social relations, (lannaccone 1994), Miller and Kanazawa (2000, 112-13) predicted 
and found that fundamentalists are less trusting than non-fundamentalists. Similarly, some 
previous work points to possible differences in trust between adherents to different religions (c.f., 
Glaeser et al. 2000; Alesina and La Ferrara 2002). Thus, the analyses reported below include 
various controls for religion. 

Both income and education (Alesina and La Ferrara 2002; Glaeser et al. 2000) positively 
affect trust. Because Southerners have somewhat lower incomes and less education than 
persons in non-southern regions (based on my own analyses of data from the General Social 
Survey), | control for both these variables. Finally, previous work has shown that trust is 
negatively related to community size (Putnam 2000). Because average community size is 
smaller in the South, | control for this variable. 

In sum, there are a number of factors that previous work suggests may generate lower 
levels of trust in the South (e.g., income and education), as well as factors that may lead to 
more trust (smaller communities). Importantly, as discussed above, a number of these factors 
are associated with the genesis of collectivist social relations in the South. However, the 
arguments outlined earlier suggest that, once established, we should expect collectivist social 
relations to persist net of these antecedents. 


Data and Measures 


This section describes the data and measures used to test the arguments outlined above. The 
data are taken from the General Social Survey (hereafter GSS), a full probability survey of English 
speaking adults living in non-institutional settings in the United States. The National Opinion 
Research Center has conducted the survey annually from 1972 to 1994 and, since then, biennially. 
The analyses reported below use data from interviews conducted from 1972 to 2000. 

As shown in Table 1, assignment of states to the South vs. non-South is based on the 
Census Bureau's classification.® It is important to note that the analyses of this paper regress 
trust on region (and various control variables), rather than measures of collectivist social 
relations. The logic of this test is based on the demonstration in previous work reviewed above 
that the South is more collectivist than the non-South. This approach is analogous to that taken 
in international comparisons (Hayashi et al. 1999; Yamagishi 1988). These studies assume, 
based on prior work, international variation in collectivist social relations and predict international 
differences in trust from that assumption. The current analytic approach is also consistent with 
comparisons of different social categories within the United States, such as Miller and 
Kanazawa’s (2000) study of trust differences between members of fundamentalist vs. mainline 
churches. None of these studies directly measures collectivist social relations. 


Dependent Measures 


The GSS contains three questions that tap into respondents’ level of general trust (Table 1). 
The first measure, trust, is the one normally used in research on trust, but some researchers 
have pointed to potential problems with this measure. Miller and Mitamura (2003), for 
example, suggest that the standard trust measure may conflate trust with the related concept 
caution. This potential conflation is more serious, they suggest, when the measure is used in 
cross-national comparisons, where languages “necessarily involve slight changes in the 
meaning, and are read and answered in different social contexts.” (2003, 62) Thus, if analyses 
with the trust measure uncover the predicted trust differences, and these differences are 
analogous to those reported in cross-national applications, the current research would offer 
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Table 1: Descriptions of Variables 


Variable Description 

Sumtrust Simple combination of three binary measures: trust, helpful, and fair (see 
below): Range = 0-3, Mean = 1.57 

Trust Generally speaking, would you say that most people can be trusted (coded 


1) or that you can’t be too careful in dealing with people (coded 0)? 


Helpful Would you ey that most of the time people try to be helpful (1), or that 
they are mostly just looking out for themselves (0)? 


Fair Do you think most people would try to take advantage of you if they got a 
chance (0), or would they try to be fair (1)? 


Region Region respondent lived at time of interview, based on Census Bureau 
classification: South = AL, AK, DE, DC, FL, GA, KY, LA, MD, MS, NC, OK, 
SC, TN, TX, VA, WV. Non-South = all remaining states. 


Community Type community R lives: Large City (Central city of 112 largest SMSAs), 
Suburb (Suburb of 112 largest SMSAs), Smaller City (Counties having 
towns of 10,000 or more), Rural (Counties having no towns of 10,000 or 
more). 


Fundamentalist Whether R considers herself fundamentalist (1) or moderate or liberal (0) 


Income R's family income (in constant 1986 dollars) in 1,000s: Range = .33- 
143.78, Mean = 30.43 


Education Number of years of schooling: Range = 0-20, Mean = 12.47 
Religion Respondent's Religion: Catholic, Jewish, Protestant, Non-religious 


Race-live Whether African Americans live in respondent's community at time of 
interview (whites only): Yes = 1 (50.2%), No = 0 


ee 


additional confidence in the measure, and the conclusions drawn from cross-national 
applications. 

In either case, Yamagishi et al. (1999) show that the he/pful measure solves the potential 
confound suggested by Miller and Mitamura. They write (1999, 148), “in contrast to [trust], 
which is double-barreled, i.e., measuring trust on the one hand and need for prudence on the 
other, response categories to [he/pfu/] represent two extremes ona single dimension, belief 
in human benevolence.” The two response categories for fair also fall ona single dimension. 
Thus, if the trust measure is confounded by the problems suggested by Miller and Mitamura, 
the use of helpful and fair provide appropriate alternative measures. As will be shown 
immediately below, however, there is a strong level of agreement between the three 
measures, and regional differences in responses to the three measures are highly consistent. 

Preliminary analyses (available upon request) revealed highly consistent regional 
differences in responses to the trust items, and the three measures are highly correlated (a 
= .68), suggesting that they tap into the same underlying concept, general trust. Thus, 
responses to the three measures were summed to create a new variable, sumtrust, which 
ranges from 0 to 3 (Table 1). The analyses reported below are based on this measure. 
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(Analyses with each of the three binary measures yield the same general conclusions as 
those reported below.) 


Control Variables 


Based on the above discussion, the analyses implement several control variables, all of which 
are given in Table 1. In addition to income, education and community type, the analyses 
control for whether the respondent is fundamentalist (vs. moderate or liberal). | also control 
for respondent's religion using a series of dummy variables. Finally, | control for respondent's 
gender (following Alesina and La Ferrara 2003) and age (Glaeser et al. 2000). 


Results 


The arguments presented earlier suggest that, net of the control variables, we should observe 
regional differences in trust among whites, but not blacks. Thus, | begin the analyses with 
whites and then repeat the analyses using data on black respondents. 

As an initial step, | regressed the main dependent measure (sumtrust) on the region dummy 
variable, the control variables and all two-way interactions between the region dummies and 
control variables. The only interactions that reached significance involved the community type 
dummies. But given that these interaction terms did not affect key results, they were removed. 
(These more detailed analyses are available upon request.) The discussion to follow is based 
on the Model 1a analysis (Table 2), which includes only main effects. 

Model 1a shows that, net of the control variables, there is a strong effect of region on 
trust. Southerners respond far less trustingly than non-Southerners (p < .001). To assess the 
robustness of this finding, | conducted several variations on the Model 1a analysis: | included 
alternative (and additional) control variables; | conducted separate binary logistic regressions 
for each of the three binary measures of trust (Table 1); and | compared the South to each 
non-southern region, rather than collapsing non-southern regions. | could not make the basic 
effect go away. Thus, southern whites’ lower trust appears to be highly robust. 


Results for African Americans 


As noted earlier, there are a number of reasons that we should not expect the predicted 
regional differences in trust to apply to African Americans. To address this point, | repeated 
the Model 1a analyses reported above with data from black respondents. The results of this 
analysis are given in Model 2 (Table 2). As expected, net of the control variables, black 
Southerners are not less trusting than black non-Southerners (p = .72). 


Alternative Explanations 


The results reported thus far provide support for the argument linking region and trust. But 
recent work by Alesina and La Ferrara (2002) suggests a possible alternative explanation for 
those findings. These authors show that, ceteris paribus, living in a community with blacks has 
a negative effect on whites’ general trust. Thus, whites who live near blacks can be expected 
to report being less trusting of others in general than whites who do not live near blacks. (They 
report that blacks’ general trust is unaffected by a community's racial heterogeneity.) The 
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Table 2: Beta Coefficients from Multiple Regression Analyses for Whites (Models 1la&b) and 
Blacks (Model 2) 








Model 1a Model 1b Model 2 
Variable (N= 17,725) (N=16,967) (N= 2,669) 
South (vs. non-South) -.080**** -.082**** .008 
Community (ref. = smaller city) 

Large City =03l -.033**** -.024 
Suburb -.041**** -.035**** -.054* 
Rural .029**** De -.029 
Male -.049**** -.048**** -.048** 
Age 194**** .194**** MOV 
Education eae at ae SO 
Income OCS .099**** “ot 
Fundamentalist -.076**** 073" -.009 
Religion (ref. = not relig.) 
Protestant OSR .084**** 013 
Catholic 033% .031* .056* 
Jew -.012 -.014 .016 
Race-live -.026**** 





*p ==" .05" **p 01 “p = 005 “**p = .001 (two-tailed) 


question thus becomes whether regional differences in trust can be explained by a greater 
tendency for southern whites (compared to non-southern whites) to live near blacks. 

To address this question, | repeated the Model 1a analysis for whites, but replaced the 
region variables with a new variable race-live (Table 1). As an initial step, | included all Model 
1a control variables, but not the South/non-South dummy variable. Consistent with the 
Alesina-La Ferrara finding, this initial analysis revealed a strong negative relationship between 
race-live and sumtrust (p < .001). Net of the control variables, whites who live near blacks are 
less trusting than those who do not. 

| then repeated the analysis with the South dummy included. As shown in Model 1b, the 
effect of race-live decreased slightly but remained highly significant (0 = .001). Most 
importantly for current purposes, however, the South variable was highly robust, with 
Southerners reporting lower levels of trust (0 < .001).'° Thus, regional differences in trust 
cannot be explained by a greater tendency for southern white respondents to live near blacks. 

A second alternative explanation for the general findings reported above shifts the focus 
from the respondents, or trusters, to the targets of trust. According to this explanation, when 
non-Southerners report higher levels of trust, they may simply be reporting greater perceived 
trustworthiness of others in the non-South (compared to others in the South). If so, when 
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Southerners and non-Southerners are considering the same target population, we should not 
observe regional trust differences." 

| addressed this explanation by comparing trust differences between natives and migrants 
to various regions. Specifically, | crossed each respondent's region of residence at the time of 
the interview with his or her region of residence at age 16 (a proxy for native region), which 
yielded five categories: (1) Southerners who moved to a non-southern region, (2) non- 
southern natives (respondents who remained in a specific non-southern region), (3) 
respondents who moved from one non-southern region to a different non-southern region, (4) 
non-Southerners who moved to the South, and (5) southern natives. 

The analyses reveal that, net of the Table 2 control variables, southern natives are significantly 
less trusting than all remaining categories (0 < .05). But the most important comparison for 
Current purposes is between southern natives and non-Southerners who move to the South. 
That migrants to the South are significantly more trusting than southern natives (0 = .001) 
suggests that differences in trust between Southerners and non-Southerners do not result from 
differences in the perceived trustworthiness of others in various regions. 

As further evidence against this alternative explanation, | repeated the analysis but 
changed the reference category to non-southern natives. Once again, the results are highly 
reassuring. Southerners who move to the non-South are significantly less trusting than non- 
southern natives (0 = .002). Thus, we can rule out the possibility that regional differences in 
trust are driven by regional differences in the perceived trustworthiness of others. 


Issues for Future Research 


The results of this paper support the theory linking region, collectivist social relations and trust. 
But due to data limitations, the analyses did not directly measure social relations. Instead, the 
logic of the analyses was based on prior demonstrations that southern social relations are more 
collectivist than non-southern social relations. (As discussed earlier, this logic is like that used in 
international comparisons of collectivism and trust.) While this is a useful starting point, future 
work should directly measure how collectivist social relations co-vary with trust across regions. 
Similarly, future research should be directed at assessing regional variation in actors’ 
reliance on alternatives to general trust. As noted earlier, assurance relations and general trust 
serve as alternative solutions to some of the same cooperation problems (Yamagishi and 
Yamagishi 1994). From this perspective, collectivist and individualist social relations generate 
assurance-based and general trust-based solutions, respectively. Previous work suggests that 
Japanese depend more on assurance relations than Americans, while Americans rely more 
on general trust. Meanwhile, the present research finds that (white) non-Southerners have 
higher levels of general trust than (white) Southerners. Thus, a question for future work is 
whether Southerners depend more on assurance relations than non-Southerners. Empirical 
investigations of these questions could employ both survey methods (Yamagishi and 
Yamagishi 1994) and experimental studies that manipulate the social proximity and 
monitoring/sanctioning capabilities of interdependent actors (Buchan et al. 2002). 


Implications for Regional Differences in Social Institutions and 
Interpersonal Behavior 


Previous work has shown that trust is associated with a wide range of macro-level 
phenomena, including economic development and civic engagement (Fukuyama 1995; 
Putnam 2000). Meanwhile, other research finds that the South has higher rates of poverty and 
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lower levels of political participation than non-southern regions (Grantham 1994; Vandello and 
Cohen 1999). A question for future research rests at the intersection of the regional 
differences detailed in previous work and the findings reported in this paper: Can regional 
differences in trust account for regional variation in these various social and economic ills? 

The positive relationship between trust and trustworthiness noted earlier in the paper also 
implies a number of additional questions. First, if high trusters are also more trustworthy, as 
shown in previous research (Orbell and Dawes 1993), and non-Southerners are more trusting 
than Southerners (as shown above), should we expect non-Southerners to be, on average, 
more trustworthy than Southerners? For example, will non-Southerners be more likely than 
Southerners to honor an interdependent stranger's trust? Especially suggestive in this regard 
is Yamagishi et al's (1998) finding that Japanese participants engaged in less trustworthy 
behavior than (northwestern) U.S. participants in a study of trust and.cooperation between 
strangers. The question for future research is whether analogous differences hold within the 
United States. A cross-regional experiment would allow controlled comparisons of trusting 
and trustworthy behavior. 

In addition to the effects of ego’s region, future work should consider the effects of an 
interdependent a/ter’s region on whether ego engages in trusting behavior. Especially 
interesting in this regard is the possible effect of stereotypes about Southerners on whether 
actors place trust in Southerners vs. non-Southerners. For example, important aspects of the 
southern stereotype are politeness and honor (Nisbett and Cohen 1996). These stereotypes 
may interact in interesting ways with the possibility, suggested in the previous paragraph, that 
Southerners are less trustworthy partners in exchanges between strangers. We might expect 
perceptions of Southerners as more honorable to lead actors to expect more trustworthy 
behavior from Southerners than non-Southerners. If so, and if it is non-Southerners who are 
more trustworthy, a tendency to trust Southerners over non-Southerners would be a (relative) 
misplacement of trust. (For more on the relationship between trust, trustworthiness and 
social categories in general, see Orbell et al. 1994). These arguments could also be tested 
using cross-regional experiments. 


Conclusion 


The research reported here integrates two streams of research. The first stream shows that 
collectivist social relations generate lower levels of trust than do individualist social relations. 
Previously, researchers in this tradition have focused on the differences in collectivist and 
individualist nations, with the majority of studies comparing Japan and the United States. The 
second line of research shows that social structures in the U.S. South are more collectivist 
than those in non-southern regions. At the confluence of these two research lines is the 
prediction that Southerners are less trusting than non-Southerners. As expected, this 
argument was supported for whites but not blacks. 

Beyond its empirical implications, the current application extends the scope and increases 
the precision of an existing theory of trust. The paper extends the scope by moving beyond 
the typical comparison between nations to the investigation of intranational differences. At 
the same time, the theoretical and empirical demonstration that trust varies across regions of 
the United States leads to a more precise account of how differences between nations 
compare to differences within. In so doing, this may help overcome recent criticisms that the 
large literature on international differences in collectivism obscures variation within nations ~ 
(e.g., Conway et al. 2001; Voronov and Singer 2002). 

In sum, researchers have recently called for more intranational studies of “cross-cultural” 
differences (Conway et al. 2001; Vandello and Cohen 1999, 289-290), a primary benefit of 
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which is that many of the factors that vary between nations (such as language and 
government) are held constant within. For this reason, as well as those outlined in the 
preceding paragraphs, the current findings increase our confidence in arguments previously 
used to explain international variation in trust. 

Perhaps more importantly, intranational variation in trust is worth considering for precisely 
the same reason as trust differences between nations: the array of (personal and social) 
benefits of trust. Of course, the current work was just an initial step and many questions 
remain about the causes and consequences of regional differences in trust. Future research 
should focus on more directly assessing intranational variation in social relations and trust, 
and the consequences of this variation. Given the far-reaching consequences of trust, such 
efforts should be well worth our while. 


Notes 


1. While a few scholars (Putnam 2000; Alesina and La Ferrara 2002) have suggested that 
trust or other aspects of “social capital” may vary by region, the investigation of regional 
differences in trust was not the goal of that work. Thus, previous research does not offer 
an explanation of regional differences in trust, nor does it investigate whether regional 
differences in trust persist after controlling for regional differences in other variables 
related to trust (e.g., income, education and community size). Finally, | show that regional 
differences in trust occur net of other community-level variables previous work uses to 
explain geographical variation in trust. 


2. Rousseau et al.’s (1998) interdisciplinary review of the trust literature shows that this 
conception of trust is common across a wide range of disciplines. 


3. Similarly, Fukuyama (1995) explains how strong family ties can inhibit the development of 
trust in non-family members, and Malhotra and Murnighan (2002) demonstrated the 
negative effects of contracts (a type of sanctioning system) on interpersonal trust. 
Malhotra and Murnighan argue that the presence of contracts negatively affect trust 
because actors attribute cooperation to the contract (the sanctioning system), rather than 
to each other's trustworthiness. These arguments and findings are consistent with the 
theory of trust just reviewed. 


4. Arguably, individualist relations increase general trust only if the placement of trust is 
honored more than it is exploited. Previous work on the positive relationship between 
trust and trustworthiness suggests this is a reasonable assumption. For example, Orbell 
and Dawes (1993) show that in social dilemmas with an option to “play” (go into the 
market) or “exit” (go it alone), high trusters are more likely than low trusters to play. Then, 
given the positive relationship between trust and trustworthiness, the market will contain 
a disproportionate number of trustworthy actors. If so, the placement of trust outside 
group boundaries will tend to be rewarded. 


5. A reviewer correctly noted that a large number of southern whites also migrated to 
Northern and Midwestern cities around the same time period, and that many of them 
established ethnic enclaves (Fligstein 1981). However, the migration of blacks was of 
much greater magnitude (Fligstein 1981). Tolnay (2003: 221) notes that “between 1910 
and 1980 ... the percentage of [Chicago's] black population jumped from 2.0% to 32.7%.” 
He goes on to point out that “similar transformations occurred in virtually all major 
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northern and western metropolitan areas.” Thus, compared to white non-Southerners, a 
much greater proportion of blacks in the data considered below are first- or second- 
generation non-Southerners. As a result, we should expect the responses of the average 
Southerners and non-Southerners to be more similar for black respondents than for white 
respondents. : 


6. Michael Macy (personal correspondence) proposes another reason not to expect lower 
levels of trust among blacks in the South compared to those in the non-South. He notes 
that Yamagishi's theory of low trust in collectivist Japan refers to a system of lateral social 
control that relies on peer monitoring, not centralized social control by formal institutions. 
He agrees that assurance is promoted by the knowledge that others are being closely 
monitored, but he argues that this is not what lowers trust. Rather, it is the responsibility 
tor monitoring others that attenuates trust by promoting suspicion. In addition to Japan, 
close mutual monitoring characterizes many societies with “honor codes,” including 
Corsica (Gould 2000) and the southern United States (Nisbett and Cohen 1996). However, 
southern blacks were not part of the southern honor code. Southern blacks have 
historically been closely monitored and sanctioned, but by an outside oppressive regime 
(i.e., vertically, rather than laterally by ingroup members). If low trust in Japan and the 
South is due to the peer monitoring characteristic of honor codes, he concludes, this may 
explain why we observe lower levels of trust among southern whites (compared to non- 
southern whites), but not southern blacks (compared to non-southern blacks). Macy's 
hypothesis differs from the theory | propose in that the focus is on the effects of peer 
monitoring rather than collectivist social relations. We are planning a lab experiment with 
which to adjudicate this theoretical difference. 


7. Research on regional differences in collectivism typically compares southern vs. non- 
southern whites. | am unaware of any research that investigates regional differences in 
collectivism for blacks. The theory of collectivism and trust suggests that if blacks are, on 
average, more collectivist than whites, blacks should also be less trusting than whites. 
Many scholars have documented blacks’ lower level of trust, and researchers have 
offered a number of plausible accounts of this pattern. To my knowledge no one has 
investigated the possibility that blacks’ lower level of general trust may be traceable in part 
to a tendency for blacks’ social relations to be more collectivist than whites’ social 
relations. These issues should be investigated in future work. 


8. There are a few “special case” states in terms of how they compare to other states with 
which they are categorized on various measures of collectivism. For example, Vandello 
and Cohen (1999, Table 1) show that the social structures of Hawaii are more collectivist 
than other Western states, while Oklahoma is more individualist than other southern 
states. However, for simplicity and ease of replication, | follow the Census Bureau 
classification. If anything, this classification results in a more conservative test of the 
argument linking region, collectivism and trust. 


9. Unfortunately, the GSS does not contain measures of collectivist social relations. It is 
important to note, however, that when coupled with the previous research reviewed 
above, the analyses reported below strongly support the argument linking regional 
differences in trust to regional differences in collectivist social relations. Of course, future 
work should attempt to measure region, collectivist social relations and trust 
simultaneously. A number of suggestions for future research relevant to this issue are 
made below. 
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10. Once again, supplementary analyses with alternatives measures yielded the same 
conclusion. 


11. | thank an anonymous reviewer for suggesting this possibility. 
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Southerners in the West: 
The Relative Well-being of Direct and Onward Migrants 
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Abstract 

The Great Migration of southerners away from their region of birth stands as one of the 
most significant demographic events in U.S. history. The first waves of migrants headed 
primarily to the Northeast and Midwest. During and after World War II, a larger 
proportion moved to the West. We use information from the 1970 through 2000 public use 
samples to compare the economic status of “onward migrants” from the Northeast and 
Midwest with that of “direct migrants” from the South. Our findings show that onward 
migrants had greater incomes and higher occupational statuses, but were not more likely to 
be employed than direct migrants with the same socio-demographic profiles. The economic 
advantages enjoyed by onward migrants were shared by blacks and whites and prevailed 
across three of the four decades considered in our analysis. 


Introduction 


Throughout much of the 20" century, southern-born Americans fled their region of birth in 
search of better social and economic opportunities outside of Dixie. Millions of southerners 
participated in this “Great Migration” that substantially altered the regional and residential 
distribution of the U.S. population, especially for African Americans (Alexander 1998; Eldridge 
and Thomas 1964; Johnson and Campbell 1981; Long 1988; Tolnay 2003a). These peripatetic 
southerners also shaped the cultural dynamics and political balances in their adopted cities 
and states, creating legacies that have persisted into the 21* century (e.g., Berry 2000; 
Gregory 1989, 2005; Grossman 1989; Lemann 1991). Thus, the Great Migration can 
legitimately be considered one of the most important and influential demographic events in 
the nation’s history. 

Originally, the vast majority of migratory southerners headed to the Midwest and Northeast, 
as they took advantage of convenient transportation routes and the more proximate economic 
opportunities in those urbanized and industrialized regions (Alexander 1998; Berry 2000; Eldridge 
and Thomas 1964: Johnson and Campbell 1981; Long 1988; Taylor 1998; Tolnay 2003a, Tolnay et 
al. 2005). Large cities such as Boston, Chicago, Detroit, New York and Philadelphia absorbed 
huge numbers of southern-born arrivals between 1910 and 1940. And, even smaller cities like 
Cincinnati, Ohio; Flint, Michigan; and Bloomington, Indiana saw large influxes of southerners. 
Eventually, the growth of defense-related industries on the West Coast during World War II 
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attracted a larger share of southern migrants to the more distant destinations of California, 
Oregon and Washington. Although included within the broader phenomenon of the Great 
Migration, this later movement to the West has not attracted the same level of scholarly attention 
as the relocation of southerners into the Northeast and Midwest. 

The western migration of southerners differed in many important respects from their more 
widely studied movement to the Northeast and Midwest. As noted, the timing was different, 
with the vast majority of migration to the West occurring after 1940. But, the regional 
destinations also varied in a number of important ways. The racial and ethnic composition of 
the western population was different from that in the Northeast and Midwest. Japanese, 
Chinese, Native Americans and Mexicans were significant minorities in the West (Gregory 
1989; James 1945; McWilliams 1945; Taylor 1998), while African Americans and European 
immigrants were the primary ethnic groups of the Northeast and Midwest (e.g., Bodnar, 
Simon and Weber 1982; Duncan and Duncan 1957; Grossman 1989; Lieberson 1980; 
Perlmann 1988). The large cities of the West exhibited a different urban ecology from their 
counterparts in the East. Newer, and with settlement patterns that were less dense, they had 
not yet had a chance to become as rigidly segregated as the older cities in the Northeast and 
Midwest (Farley and Frey 1994; Logan, Stults and Farley 2004; Massey and Denton 1993). 
Finally, the migrants themselves were not the same. First, because the western migration 
occurred later, those migrants who came directly from the South to the West were drawn 
from a different population at their place of origin, and were also exposed to different 
selection processes (e.g., Tolnay 1998). Second, many of the southerners who headed to the 
West had already spent time in the Northeast or Midwest before moving on. 

In this paper we examine the adaptation experiences of southern migrants in the West, 
while devoting special attention to a comparison between the economic well-being of those 
who moved directly from the South and those “onward” migrants who had resided previously 
in the Northeast or Midwest. We draw from migration theory to propose hypotheses 
regarding the differences between these two groups in employment patterns, income and 
occupational status. On the one hand, migrants who had already spent time in the Northeast 
or Midwest might be expected to fare better than those who moved directly from the South 
because they had had an opportunity to acquire additional human capital that provided them 
with an economic advantage in the West. On the other hand, it is possible that some of the 
onward migrants were moving on because they had failed in the industrial economy of the 
North and would experience the same disappointment in the West. Furthermore, it is possible 
that these processes were different for black and white southern migrants, and that they 
changed over time as a result of transformations in the economies of the rural South and the 
urban North. We explore these questions with data from the public use microdata samples 
(PUMS) of the 1970 through 2000 U.S. population censuses. 


Background and Theory 


The Great Migration and the West 


The Great Migration of southerners to other regions of the country escalated sharply during 
the second decade of the 20" century. Many of the same forces were instrumental in causing 
this exodus of both white and black southerners. Economic stagnation, especially in - 
agriculture, motivated many southerners to flee the region (Daniel 1985; Fligstein 1981; Kirby 
1987; Tolnay 1999; Woofter 1920). But, this was not a movement made up solely of people 
from farms. Many of the migrants originated in small towns and cities throughout the South 
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(Alexander 1998; Marks 1989). Pulling the migrants northward were the new economic 
opportunities created by the expansion of industrial production required by the nation’s entry 
into World War | (Collins 1997). Because the war, and later legislative restrictions, sharply 
reduced the number of immigrants entering the country, those industrial opportunities were 
no longer dominated by newcomers from southern, central and eastern Europe, as they had 
been in the past. As a result, a labor vacuum developed in the North that created 
unprecedented opportunities for white and black southerners who were willing to move. 

While the poor economic conditions in the South affected both whites and blacks, there 
were also forces that uniquely induced African Americans to leave the region. Decades of 
racial segregation, economic discrimination, social humiliation and violence made many 
African Americans long for the perceived relative equality and security of the North (e.g., 
Faulkner et al. 1982; Henri 1975; Tolnay and Beck 1992). However, it was not until restricted 
immigration combined with industrial expansion that the African American exodus from the 
South began in earnest. At the outset of the Great Migration, 90 percent of African Americans 
resided in the South. By the time it had run its course, the black population of the United 
States was divided roughly evenly between the South and the non-South (U.S. Bureau of the 
Census 1985). 

From the very beginning of the Great Migration, southern whites were more likely than 
southern blacks to choose a western destination (see Figure 1). In 1920 only about 5 percent 
of black southern migrants resided in the West, compared to roughly one-quarter of white 
migrants. After 1940, the West became a much more popular destination for southern 
migrants of both races. By 2000, more than 40 percent of white and more than 25 percent of 
black southern migrants resided in the West. Although the West continued to attract 
proportionately more white than black southerners during this Post-World War II surge in 
migration, the sharp increase for blacks attracted a good deal of attention. For example, the 
Journal of Educational Sociology published a special issue on black migration in 1945 that 
focused on the experiences of four West Coast cities that were receiving increasing numbers 
of black migrants from the South — Los Angeles, Portland, San Francisco and Seattle. Despite 
the recognition of many social problems that were associated with this influx of population, 
including strains on local transportation systems, housing and race relations, the articles in 
this special issue painted a rather optimistic picture of the long-term prospects for African 
Americans in the West. One author captured this sense of opportunity and optimism by 
writing (McWilliams 1945: 187), “Not only has the Pacific Coast become our new racial frontier, 
but in many respects, it has become our new economic and industrial frontier.” 


“Direct” vs. “Onward” Migrants 


The literature on southern migration to the West tends to lump together all people living in the 
West who were born in the South, usually distinguishing them only by race (as in Figure 1). 
However, the West was not the initial destination for many migrants who had abandoned the 
South. Many had moved originally to the Northeast or Midwest, and then eventually continued 
their trek to the more distant West. This distinction between “direct” and “onward” migrants to 
the West has potential importance that transcends the geographic route that was taken by 
southerners to reach the West. Specifically, the two groups may have varied in the economic 
success that they enjoyed after they reached their common destination. But, why should we 
expect different adjustment experiences for direct and onward migrants from the South? 
Previous researchers have inferred differential outcomes for “repeat” and “original” 
migrants and, among repeat migrants, by whether they returned to their place of origin (i.e., 
return migrants) or moved on to a new destination. Furthermore, how well repeat migrants 


1642 * Social Forces Volume 84, Number 3 * March 2006 


Figure 1. Percentage of Southern-Born Migrants Living in the West, by Race, United States 1900-2000 








50 


45 








40 - 
White 


Comes 





30 





95 Black 











Percentage 


20 


WS 





10 








5 











0 + T T T T T T T T 


79 797 795, 799 794, 79g, 796, y 79%, 799 799 0 <A, v 
0 0 O 


Source: Ruggles et. al, 2004. Integrated Public Use Microdata Series. Minnesota 
Population Center Version 3.0 http://www.ipums.org. 


fare may depend on their personal experiences and their reasons for moving. For example, 
DaVanzo and Morrison (1981:87) have noted that: 


“Repeat migrants, having moved before, should be more efficient in 
acquiring information and forming accurate expectations about the 
characteristics of destination areas, and their broader spatial awareness 
(based on having lived in several places) should have crystallized into 
clearly defined locational preferences. However, repeat migrants also 
include some persons with poor judgment, whose inability to form 
accurate expectations necessitates additional moves.” 


Empirical support for these differences identified by DaVanzo and Morrison has been 
offered by investigators comparing repeat migrants who return to their original places of 
residence to those who move on to new locations. In general, return migrants, especially those 
who return to their place of origin relatively soon after leaving, have lower levels of education, 
fewer job skills and are older than repeat migrants who move on (e.g., DaVanzo 1983; DaVanzo 
and Morrison 1981; Grant and Vanderkamp 1986). We are aware of no studies that directly 
compare onward migrants and direct migrants who have moved to the same general 
destination, while following different routes, as we do in this article. However, it is possible to 
draw from migration theory, as well as from a variety of contextual and temporal influences on 
interregional migration in the United States, to anticipate differences in the relative economic - 
well-being of onward and direct southern migrants to the West from 1970 through 2000. 

There are compelling reasons to expect onward migrants to have fared better in the West 
than those migrants who moved directly from the South. First, as noted by DaVanzo and Morrison 
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(1981), onward (or repeat) migrants are more experienced than primary migrants in the migration 
process and, perhaps, more efficient in making decisions about when to move and where in 
order to maximize their opportunities. Among all repeat migrants who began their journeys in the 
Northeast or Midwest, those who headed west should have been more positively selected than 
those who returned to the South, especially earlier in the time period considered. According to 
migration theory, onward migrants had been subjected to a two-stage, positive selection process 
— first when they left the South for the North, and second when they moved on to the West, 
rather than returning to the South (e.g., Greenwood 1985; Lee 1966; Ravenstein 1885, 1889). In 
contrast, the direct migrants were relatively untested in their ability to adjust to the non-southern 
economy and society. Some would thrive in their chosen destinations, but others would return 
rather quickly to the South because of unfulfilled hopes and expectations. 

Second, by virtue of their prior residence in the Northeast and Midwest, onward migrants had 
an opportunity to acquire occupational training and skills that were appropriate for an industrial 
workplace. On average, then, onward migrants should have been more likely than direct migrants 
from the South to bring with them work experience and job skills that were transferable to the 
expanding industrial economy in the West. While the South had also experienced substantial 
economic development and industrialization by the close of the Great Migration, it had not, yet, 
achieved parity with the North or the West (Cobb 1982; Daniel 1985; Wright 1986). Any human 
capital advantage enjoyed by onward migrants also may have varied by race and time period. 

The previous mobility experience of onward southern-born migrants, coupled with their 
opportunity to acquire important human capital during their temporary residence in the 
Northeast or Midwest, leads us to predict a more favorable economic profile for onward 
migrants in the West than for those southerners who had moved directly to the West: 


H1: Onward migrants enjoyed greater economic success (/.é., 
employment, earnings, occupational status) in the West than did 
direct migrants. 


However, it is also important to acknowledge that there are counterbalancing factors that 
might lead one to expect that direct migrants enjoyed certain advantages over onward migrants. 
First, it is possible that some prior southern migrants were compelled to leave the Northeast and 
Midwest because they had failed socially or economically in those regions. As suggested by 
DaVanzo and Morrison (1981), it is likely that more “failed” migrants chose to return to the South 
rather than to move onward. However, some certainly would have decided to try their luck in the 
booming western region. DaVanzo (1981) also suggests that unsuccessful migrants are 
especially likely to move onward rather than return if their place of origin is economically 
depressed, as was the case for the South during the early decades of the Great Migration. 

Cultural and experiential differences also might have contributed to higher levels of 
unemployment for onward migrants than for direct migrants. First, it is possible that their 
temporary stays in the Northeast and Midwest had created higher expectations among onward 
migrants and a reluctance to settle for less attractive jobs with low wages, at least in the short 
run. In contrast, direct migrants had been subjected to even stronger forms of economic 
discrimination in the South and had been paid even lower wages than their counterparts in the 
Northeast and Midwest. Therefore, the same job opportunity that was deemed unacceptable to 
the onward migrant may have been viewed as an attractive opportunity by the direct migrant. 
Rose (1975:313) noted the potential for this phenomenon of differential expectations when he 
argued that socialization in the rural South “...is expected to lead southern black migrants of non- 
metropolitan origin to settle for less in the job market. A presumption of lower esteem works 
to influence one's perception of what constitutes satisfactory employment... The perceptions of 
the existence of a more open society on the part of black youth socialized in large northern 
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centers is thought to have influenced the attitude of those youths regarding what constitutes 
acceptable employment.” Lemke-Santangelo (1996:6) refers to a similar phenomenon in her 
study of the adaptation of African American female southern migrants to the East Bay Area of 
California. Regarding the employment opportunities available to the migrant women she writes, 
“However bad these jobs were, women relished the fact that ‘we were at least getting paid to 
put up with it. In the South it had been nothing but hard work and bad treatment. Here | was 
making more in a day than | made back home in a month.” 

The possibility that cultural or experiential factors acquired during their residence in the 
Northeast or Midwest may have produced higher expectations for employment among 
onward migrants, and therefore a reluctance to settle for less attractive or lucrative jobs, leads 
us to predict that any economic advantage for onward migrants was weaker for employment 
status than for earnings or occupational prestige: 


H1a: The economic advantage for onward migrants was weaker for 
employment status than for earnings or occupational prestige. 


In its extreme, it is possible that this auxiliary hypothesis could predict that direct migrants 
were actually more likely than onward migrants to be employed. 


The Importance of Race and Time 


The processes that we hypothesize resulted in an economic advantage for onward migrants 
from the South were not necessarily invariant between races or across time periods. Indeed, 
there are good reasons to believe that black and white migrants shared unevenly in the 
economic advantages of prior residence in the Northeast or Midwest and that the significance 
of those advantages changed over time along with the social, economic and cultural milieus 
within the South and North. 

The racial conditions that prevailed in the North during the last half of the 20" century likely 
assured that white southern migrants were more successful than black migrants in acquiring 
additional human capital during their periods of residence in the Northeast and Midwest. This, 
in turn, would have offered them an advantage in their subsequent economic transitions to the 
West. African Americans experienced economic discrimination in the North, including a 
disadvantaged position within an occupational queue that was stratified by race and ethnicity 
(Adelman and Tolnay 2003; Boyd 2001; Lieberson 1980; Tolnay 2003b). Compared with whites, 
blacks in the Northeast and Midwest were more heavily concentrated in unskilled jobs that 
offered less remuneration or opportunity for upward mobility. Therefore, we might expect that 
prior “northern experience” carried more significant economic benefits for white southern 
migrants than it did for black migrants. 

Neither the North nor the South was static as the Great Migration proceeded. Social and 
economic changes occurred within both regions, and these undoubtedly influenced the 
composition of migrants headed to the West. After 1960, many of the onward migrants were 
responding to deteriorating social and economic conditions in the large urban centers of the 
Northeast and Midwest. Economic restructuring saw many of the blue-collar job opportunities 
that originally had attracted southern migrants to the North heading to the South and West 
(Kasarda 1989, 1995; Sale 1975; Tabb 1987). Many onward and return migrants were following 
those employment opportunities. For those who remained in the North, suburbanization - 
produced a shifting tax base that further reduced job opportunities for the residents of central . 
cities, especially for black males (Squires 1987; Wilson 1987, 1996), leading to widespread un- 
and under-employment. In turn, the resulting economic distress contributed to a variety of 
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increasing social problems, including crime, welfare dependency and single-parent families 
(Wilson 1987, 1996). For African American residents, these disturbing trends were magnified by 
persistently high levels of residential segregation (Massey and Denton 1998). 

These were powerful social and economic forces pushing the residents of central cities 
outward — to nearby suburbs or to more distant destinations such as the West or South. While 
traditional migration theory would predict that these “pushed” migrants were negatively 
selected from the population at origin, it is not obvious whether that broad generalization 
applies in this specific case. For both blacks and whites, there are reasons to believe that the 
unskilled and poorly educated population may not have headed West. First, the “truly 
disadvantaged” of both races may have faced difficulty in escaping the declining central cities 
and neighborhoods in the Northeast and Midwest because they lacked the financial 
resources required to move across the country. Second, at least for whites, even the less well- 
off had access to nearby suburbs as they attempted to flee the deteriorating conditions in the 
central cities (Frey 1979). Third, residential segregation in the Northeast and Midwest 
imposed an even stronger barrier to the long-distance migration of poor African Americans 
who might have benefited from moving to the West (Massey and Denton 1993). 

Significant social and economic changes were also occurring in the post-war South. By 
1960, fundamental agricultural re-organization and widespread farm mechanization had 
reduced the South's dependence on tenant farmers and farm laborers (Daniel 1985; Fligstein 
1981; Kirby 1987). As a result, urbanization intensified, and the number of workers engaged 
in agricultural pursuits fell sharply. In contrast, the region's drive to increase industrial activity 
and to modernize its economy had begun to pay dividends. Manufacturing activity in the 
South increased and created substantially more employment opportunities in the non- 
agricultural sector (Wright 1986). These trends intensified after 1960 with the movement of 
manufacturing and industrial operations out of the North and into the sunbelt (Kasarda 1995; 
Sale 1975; Squires 1987). Socially, the harsh conditions of racial segregation gradually gave 
way to a more tolerant racial climate in the South. The Civil Rights Movement and federal 
legislation eventually produced racially-integrated institutions in the South and ushered in an 
improved, if still imperfect, era of southern race relations. 

What implications did these social and economic changes have for migrants moving 
directly from the South to the West? We can suggest three possibilities. First, as the southern 
economy offered more opportunities for urban residence and non-agricultural employment, 
migrants were increasingly likely to possess job skills and experience that were more 
transferable to the industrial workplaces in the West. Second, the expansion of secondary 
schooling within the South after 1940 also reduced the educational gap between the South 
and the North — a trend that would have been reflected in the composition of the migration 
streams coming from the two regions (Anderson 1988; Margo 1990; Tolnay 1998). Third, the 
moderated racial climate in the South reduced the level of segregation, discrimination and 
racial violence to which blacks were exposed. As a result, it is likely that proportionately fewer 
African-American migrants would have moved West in order to flee the oppressive social 
conditions in the South, and that proportionately more would have been attracted by 
economic opportunities.” In sum, all of these changes point to the possibility of a declining 
difference between onward and direct migrants over time. 

These considerations regarding the importance of race and time lead to our final two 


hypotheses: 


H2: The economic advantage for onward migrants was greater for 
whites than for African Americans. 


H3: The economic advantage for onward migrants declined over time. 
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Data 


We use information from the public use microdata samples (PUMS) of the decennial U.S. 
population censuses for 1970 through 2000 to compare the relative economic well-being of direct 
and onward southern migrants in the West.? The 1970 and 1980 censuses capture the experience 
of migrants near the close of the Great Migration, while the 1990 and 2000 censuses offer a more 
contemporary perspective beyond the time period.normally associated with the Great Migration. 
A variety of factors prevent an even broader historical perspective by including data from earlier 
censuses. The 1940 census occurred prior to the sharp increase in migration to the West, 
therefore it includes too few southern migrants to conduct rigorous comparisons between direct 
and onward migrants. The 1950 census used a different time reference (one year vs. five years) 
when asking questions about residence in the past, and it collected certain key information (e.g., 
income and education) only from a smaller set of “sample line persons.” As a result, analyses 
using the 1950 data would not be comparable to analyses based on other census years, and 
would also have too few direct and onward migrants for the analysis of some economic 
characteristics.’ The 1960 PUMS does not include information about residence five years before 
the census that would allow us to construct measures of recent inter-regional migration. While it 
would be advantageous to have information about onward and direct migrants to the West for an 
earlier decade, we believe that the 1970 and 1980 censuses are capable of supporting tests of 
our central research hypotheses for a time period that corresponds roughly with the end of the 
Great Migration. As described below, the recent migration behavior measured by the 1970 
census occurred sometime between 1965 and 1970, still within the temporal boundaries of the 
Great Migration. Onward migrants would have moved from the South to the Northeast or 
Midwest prior to 1965, in some cases significantly before that date. Finally, the 1980 through 2000 
censuses span a period of important social and economic transformations in the North and 
South, so are capable of supporting the investigation of possible changes in the relative well- 
being of onward and direct migrants across a heterogeneous time period. 

Our quantitative analyses are based on extracts from PUMS files that include black and white 
adult males (18 years and older) who were born in the census-defined South but resided in the 
census-defined West at the time of enumeration. Those enrolled in school at the time of the census 
or living abroad five years before the census were excluded from the sample.’ We do not include a 
parallel analysis of female southern migrants because a thorough consideration of gender 
differences in the relative economic well-being of direct and onward migrants is beyond the scope 
of this study and worthy of its own investigation. Given the importance of gender in the migration 
process, many previous researchers investigating issues related to the Great Migration have made 
a similar decision to restrict their analyses to either males (e.g., Crowder, Tolnay and Adelman 2001; 
Long and Heltman 1975; Tolnay 2001; Tolnay et al. 2005) or females (e.g., Adelman, Morett and 
Tolnay 2000; Lemke-Santangleo 1996; White 2005), rather than attempting to combine them in the 
same analysis. Furthermore, recent evidence suggests that the migration outcomes for females 
during the Great Migration were more dependent upon their marital status than the outcomes for 
males and, for married women, closely tied to the outcomes of their husbands (White et al. 2005). 


Variables 
Dependent Variables 


We use three different dependent variables in our analyses: (1) employment status, (2) wage 
and salary income, and (3) occupational status. Employment status is measured as a 
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dichotomy that distinguishes those males who were currently employed in.the week prior to 
the census from those who were either in the labor force and unemployed or not in the labor 
force.* Wage and salary income is measured as an interval scale and refers to money that was 
received as an employee during the previous calendar year, including wages, salaries, 
commissions, cash bonuses, tips and other money income received from an employer. To 
facilitate the comparison of findings across decades, income for all years is expressed in 
“constant” 1970 dollars. Our analyses of income were restricted to those males who were 
currently employed. Occupational status is measured using the Duncan Socioeconomic Index 
(SEI), a constructed variable that is based on the 1950 occupational classification system. 
More specifically, it is based on the income level and educational attainment linked with each 
occupation in 1950 (see Duncan 1961; Ruggles et al. 2004). Larger values on SEI imply a 
higher occupational standing, with scores ranging from a possible low of 3 to a possible high 
of 96. SEI scores are available for all males who reported an occupation — those who had 
worked sometime during the previous 10 years for 1970 or during the previous five -years for 
1980, 1990 and 2000. 


Independent Variables 


In the analyses of all three dependent variables primary attention is focused on the recent 
migration history of southern migrants living in the West. Three groups of migrants are 
identified, based on their history of inter-regional migration during the five years preceding the 
census: direct migrants, onward migrants and “pioneers.” Direct migrants are those males 
who reported that they were born in the South and resided in the South five years before the 
census. Onward migrants were those who reported a southern birthplace but a northeastern 
or midwestern residence five years prior to the census. Pioneers were born in the South, but 
reported a western residence at the time of the census as well as five years before. Pioneers 
are distinguished from direct and onward migrants in order to allow additional contrasts 
among southern migrants by their migration history. However, differences involving pioneers 
are not our primary interest and, therefore, are not included in our research hypotheses. Direct 
migrants serve as the reference group throughout our analyses.’ 

A limited set of control variables is included in order to avoid drawing conclusions about the 
differences between onward migrants and direct migrants that are really the result of 
compositional differences between the two groups. Specifically, these predictors are 
characteristics that might affect economic well-being (employment, income or occupational 
status) while also varying by recent migration history. A potential curvilinear relationship between 
age and economic well-being is allowed by including linear and quadratic terms for the subject's 
age in years — age and age’. The possible influence of family obligations on economic well-being 
is considered by distinguishing between men who are married and co-residing with a spouse vs. 
all others (the reference group). Years of schooling completed is included in the analyses of 
employment status and income to control for group differences in educational attainment and 
its possible effect on corresponding differences in economic well-being. Because aggregate 
education levels are used in the assignment of Duncan SEI scores to specific occupations, we 
do not include individual-level education in the analysis of occupational status.° A control variable 
is included that identifies southern-born residents of the West who were current members of 
the military. Interregional migration that is the result of military assignments or transfers may 
have economic implications that differ from those for non-military migrants. Finally, we use two 
strategies to account for group variation in residential distribution within the West. First, place of 
residence is represented by distinguishing between those who lived in a metropolitan area and 
those who did not (the reference group).? Second, a set of variables is used to identify the 
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individual's western state of residence, and thereby control for any differences across states 
that might affect group variation in the dependent variables."° 

The variables “race” and “time” are accounted for by conducting separate analyses for 
blacks and whites, and for the four different time periods. By estimating separate models, 
rather than pooling the data and including relevant multiplicative interaction terms, we allow 
all predictor variables to have varying effects by race and time period." 


Method 


We use binary logistic regression for the analysis of current employment status, a dichotomy. For 
the analysis of income and occupational status we use ordinary least squares regression. The 
analyses for 1990 and 2000 are weighted to account for person-level variation in the likelihood of 
inclusion in the PUMS. In reporting the statistical significance of regression coefficients obtained 
from our analyses, we use a minimum p-value of .10 (two-tailed test) for those relationships or 
differences in relationships for which we have explicitly predicted direction (e.g., the contrast 
between onward and direct migrants or racial differences in the contrast between onward and 
direct migrants). For all other coefficients, or for relationships that are in the direction opposite of 
that predicted, we use a minimum level of statistical significance of p < .05 based on a two-tailed 
test.” Coefficients that differ significantly between blacks and whites are noted in tables with the 
symbol T. 


Findings 


Employment Status 


The results from our analysis of current employment status are presented in Table 1. Looking 
first at the results for black males, we find that onward migrants were actually /ess /ikely than 
direct migrants (the reference group) to be employed in all four decades, with the difference 
attaining statistical significance in all decades except 1980. Net of all other independent 
variables, the odds of a black onward migrant being employed in 1970 were roughly two- 
thirds of the odds for a direct migrant (Odds Ratio = e“* = .63)."* In 1990 the relative odds of 
employment for black onward migrants stood at 58 percent of those for black direct migrants 
(OR = e = .58). By 2000 they had risen to 72 percent of the odds for direct migrants (OR = 
e™ = .72). For 1980 the likelihood of employment for black onward and direct migrants 
should be considered statistically equivalent. 

For whites we also find no evidence of an employment advantage for onward migrants. 
Indeed, like their black counterparts, white onward migrants in 1970 actually reported 
significantly lower levels of employment than direct migrants (OR = e = .75), though the 
difference between the two migrant groups was smaller for whites than for blacks. In all other 
decades, the employment differential between white onward and direct migrants was 
statistically non-significant, though consistently negative. 

How do these results regarding variation in employment status by migration history 
compare with our research hypotheses? To the extent that significant employment 
differences existed between onward and direct migrants, they reveal higher levels of 
employment for those who moved to the West directly from the South than for those who 
had resided temporarily in the Northeast or Midwest. The inference of an employment 
advantage for black direct migrants in 1970, 1990 and 2000, and for white direct migrants in 
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1970 contradicts our hypothesis (H1). However, it is consistent with the speculation that 
migrants who moved directly from the South to the West may have been more willing to take 
jobs that were deemed unattractive or unsatisfactory by those migrants who had some 
experience with the non-southern labor market (e.g., Lemke-Santangelo 1996; Rose 1975). 

Regarding racial and temporal differences in the contrast between onward and direct 
migrants, the results offer only minimal support for our hypotheses. Despite the more 
consistent pattern of an employment advantage for direct migrants among blacks, the sole 
statistically significant difference in the effect of onward migrant status between blacks and 
whites is observed for 1990. That difference is consistent with our hypothesis (H2) of a greater 
economic advantage (or in this case, a weaker disadvantage) for white onward migrants than 
for blacks. However, since we observe no significant racial variation in the relative employment 
probabilities of onward and direct migrants for the other three decades, it is inappropriate to 
infer a more general pattern from this single statistically significant difference. Nor is there a 
consistent pattern of change over time in the relative employment fortunes of onward and 
direct migrants, reflecting the transformations in the southern and northern contexts. 


Wage and Salary Income 


The results from the analysis of wage and salary income (in constant 1970 dollars) are 
reported in Table 2. In general, they offer strong evidence of an advantage for onward 
migrants over direct migrants. Beginning with the findings for black men, we find that 
employed onward migrants had significantly higher incomes than employed direct migrants 
during three time periods, with 1980 being the only exception. In 1970, net of all other 
variables in the model, black onward migrants earned an average of $526 more from wages 
during the previous year than did direct migrants. The income advantage for onward migrants 
grew to $872 in 1990 and to $1,334 in 2000. To put the magnitude of these differences in 
better perspective, the overall mean income for all black southern migrants in our sample was 
$6,148 in 1970, $7,104 in 1990 and $8,082 in 2000. Therefore, the income advantage for black 
onward migrants in 1970, 1990 and 2000 was not trivial. 

We observe a generally similar pattern for whites with onward southern migrants enjoying 
a significant income edge over direct migrants in 1970, 1990 and 2000. Although the 
differential for white migrants was marginally greater than for blacks in 1970 ($606 vs. $526), 
by 1990 and 2000 the benefits of “onward status” for black southern migrants outstripped the 
corresponding advantage among white migrants. In contrast to the trend observed for blacks, 
the income difference between onward and direct migrants for whites declined somewhat 
between 1970 and 1990, from $606 to $455. By 2000, the gap had increased to $996, but 
remained more modest than the comparable differential for blacks. Again, these income 
differentials by migration history for whites can be put into better perspective by considering 
the overall mean incomes for all white migrants in our sample which were $7,923 in 1970, 
$8,863 in 1990, and $10,553 in 2000. 

The findings for wage and salary income for blacks and whites are consistent with 
Hypothesis 1 of an economic advantage for onward southern migrants, as well as with H1a 
that the advantage enjoyed by onward migrants would be greater for income than for 
employment status. But, what do they imply for our expectations regarding variation by race 
and time period in the influence of migration history on income? Although we do observe 
variation by race within each time period in the relative income advantage for onward 
migrants, in no decade was the difference between blacks and whites statistically significant. 
Therefore, the results from the analysis of income are not consistent with Hypothesis 2 that 
white southern migrants enjoyed greater economic benefits than comparable black migrants 
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who had resided temporarily in the Northeast or Midwest. Of course, the comparison of the 
coefficients for onward migrant status for blacks and whites in Table 2 does not take into 
consideration the higher overall incomes for all white southern migrants in the West, 
compared to all black southern migrants. Aside from the absence of an income advantage for 
onward migrants of either race in 1980, the results also do not support Hypothesis 3 that 
socioeconomic and/or cultural trends within the North or South reduced the economic 
benefits of temporary residence in the Northeast or Midwest for southerners who ultimately 
migrated to the West. If anything, the income advantage for onward migrants appears to have 
intensified during the later decades, especially for blacks. 

The absence of an income advantage for onward migrants in 1980, in the face of 
significant advantages for blacks and whites in all other decades, begs for explanation. In 
supplementary analyses we further explored the employment characteristics of onward and 
direct migrants in 1980 to determine whether negative pressures in the labor market (e.g., 
lower levels of labor force participation or full-time employment) may have reduced the ability 
of onward migrants to translate human capital acquired in the Northeast or Midwest into 
higher incomes in the West. We detected no meaningful pattern to suggest such a process. 
Likewise, other socio-demographic characteristics of onward and direct migrants did not 
exhibit clear differences in 1980 that would explain the lack of an income advantage for 
onward migrants in that year. That the 1980 findings are similar for blacks and whites, 
suggests that some general characteristic of the economy in the West during this period of 
economic recession and “stagflation” was responsible. However, additional research is 
required to explain this somewhat anomalous finding. 


Occupational Status 


The results for occupational status, reported in Table 3, offer further support for our 
hypothesis of an economic advantage for onward southern migrants to the West. The 
findings for black males reveal higher occupational status for onward migrants in all four 
decades, with the differences in 1970 (3.81 points) and in 2000 (2.99 points) attaining 
statistical significance. The results for white males offer even stronger support for our 
hypothesis. In all four time periods, white onward migrants reported significantly higher 
occupational status than their counterparts who moved directly from the South, with their 
advantage varying from a minimum of 1.82 in 1980 to a maximum of 4.50 in 2000. Although 
these are rather modest differences, it should be noted that this contrast is net of all variables 
on the right-hand-side of the equation.'* Again, this evidence is also consistent with our 
Hypothesis 1a that the economic advantage for onward migrants would be greater for 
income and occupational status than for employment status. 

Our hypothesis of racial variation in the economic benefits of onward migration receives 
support only from the results for 1990 when the advantage for white onward migrants (4.39 
points) was significantly greater than the advantage for black onward migrants (1.72 points). 
In all other decades the racial difference in the effect of onward migrant status is statistically 
non-significant. Similarly, the results in Table 3 reveal no consistent evidence of a declining 
advantage for onward migrants, black or white, over time. 


The Possible Impact of Return Migration on Racial and Temporal Variation 


When combined, the results from the analyses of wage and salary income and of occupational 
Status suggest that the economic advantage for onward southern migrants was pervasive and 
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enduring. Apparently, black and white men alike were able to take advantage of their temporary 
residence in the Northeast and Midwest to bolster their incomes and occupational status when 
they moved west. And, they accomplished this within drastically different social, economic and 
cultural milieus, as represented by interregional migration between 1965 and 1970 and 
between 1995 and 2000. 

The possibility also must be recognized that shifting patterns of return migration to the 
South contributed to the persistence of an economic advantage for onward migrants across 
decades, and to the lack of racial variation in that advantage within time periods. Although a 
thorough examination of this possibility is beyond the scope of this paper, the following 
supplementary evidence is consistent with our speculation that return migration may have 
acted to maintain the advantage for onward migrants over time, and to attenuate the racial 
difference in the benefits enjoyed by onward migrants. Figure 2 shows trends in the relative 
propensity for southern migrants living in the Northeast or Midwest five years before a census 
to (1) return to the South or (2) move on to the West. Two patterns are noteworthy. First, among 
the southern-born residents of the Northeast and Midwest who left those regions, substantially 
more returned to the South than moved onward to the West. Second, among blacks there was 
a marked increase in the likelihood of returning to the South, while for whites the level of return 
migration (compared to onward migration) was relatively stable across decades. 


Figure 2. Percentage of Southern-born Migrants Leaving the Northeast or Midwest who 
Returned to the South, by Race, United States 1970-2000 
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Source: Ruggles et al, 2004. Integrated Public Use Microdata Series. Minnesota 
Population Center Version 3.0 http://www.ipums.org. 


How did the characteristics of return migrants compare to those for onward migrants? 
Figure 3 contrasts the economic characteristics of return and onward migrants during the 
earliest and most recent time periods included in our investigation, 1970 and 2000. Three 
findings deserve special attention. First, return migrants compared unfavorably with onward 
migrants for all three economic characteristics in both time periods. This difference is 
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consistent with the conclusions drawn by previous researchers (e.g., DaVanzo 1983; DaVanzo 
and Morrison 1981; Grant and Vanderkamp 1986). Second, the economic deficit for return 
migrants vis-a-vis onward migrants increased over time, indicating a relatively stronger 
“positive” selection for onward migration in 2000 than in 1970. Third, the change over time 
in the relative economic standing of return and onward migrants was somewhat greater for 
blacks than for whites. These trends in the relative likelihood of return and onward migration, 
and in the relative economic characteristics of the two groups, are consistent with the 
possibility that changing patterns of return migration influenced both the longevity of, and 
racial variation in, the economic advantage for onward migrants. In short, perhaps onward 
migrants continued to enjoy an economic advantage over direct migrants during later 
decades partially because an increasing percentage of less successful southern migrants 
decided to return to the South rather than move on to the West. And, because that trend was 
stronger for blacks than for whites, it also acted to attenuate the expected racial difference 
in the economic benefits associated with onward migrant status. This possibility further 
highlights the importance of return migration as a factor influencing the socioeconomic profile 
of migrant populations as well as the potential value of future research on that topic. 


Conclusion 


There is a tendency by social scientists, as well as by longtime residents of destination areas, 
to overlook sources of heterogeneity within migrant populations. Within the vast literature on 
the Great Migration from the South, researchers regularly have distinguished the differential 
experiences of blacks and whites, and occasionally the differential experiences of males and 
females. It is less common to dig more deeply into the recent histories of southern migrants 
to assess their implications for the adaptation experiences of the migrants in their new places 
of residence. In this paper, we have attempted to “unpack” the larger group of southern 
migrants to the West during the Post-World War II period to determine whether those who 
had spent time in the Northeast or Midwest enjoyed economic benefits over those who had 
moved directly from the South. 

The results of our statistical analyses reveal interesting differences between onward and 
direct migrants that are only partially consistent with our research hypotheses based on 
migration theory, previous research on the Great Migration, and well-known social and 
economic trends within the North and South. In support of our primary research hypothesis, our 
findings show that, in general, onward migrants enjoyed larger incomes and higher occupational 
status than their counterparts who had moved directly from the South. Also consistent with our 
hypothesis, the advantage for onward migrants was greater for income and occupational status 
than for employment. Indeed, our results are suggestive of an employment disadvantage for 
onward migrants, though in most cases the differences in employment between onward and 
direct migrants were not statistically significant. The employment results are somewhat 
supportive of speculation that migrants leaving the South had lower expectations regarding 
acceptable employment opportunities in their new locations (Lemke-Santangelo 1996; Rose 
1975). Because that speculation has been based largely on the experience of black southern 
migrants, perhaps it should not be surprising to find a more consistent pattern of employment 
advantage among black direct migrants than among whites. 

In contrast, our findings did not reveal consistent patterns of variation by race or time 
period in the economic advantage for onward migrants. Although they contradict our research 
hypotheses, these findings of racial and temporal uniformity in the economic benefits of 
being an onward migrant are important and interesting in their own right. They suggest that 
black southern migrants were as successful as white southern migrants in (1) exploiting the 
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Figure 3. Economic Characteristics of Onward and Return Migrants from the Northeast and 
Midwest in 1970 and 2000, by Race 
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Figure 3. Economic Characteristics of Onward and Return Migrants from the Northeast and 
Midwest in 1970 and 2000, by Race (continued) 
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knowledge they had gained through their previous migration experience, and/or (2) acquiring 
additional human capital through their temporary residence in the Northeast or Midwest. This 
is a notable achievement, given the significant challenges that black males faced in the urban 
economies of the rust belt regions of the United States during this time period (e.g., Wilson 
1987, 1996). They also suggest that the dramatic social, economic and cultural changes in the 
South during the last half of the 20th century were not able to erase the economic advantage 
of southern migrants who first headed north rather than west. While we are not able to offer 
a definitive explanation for the persistence of the advantage for onward migrants over time, 
given the substantial transformations in the North and South, the circumstantial evidence 
hints strongly at the influence of changing levels and patterns of return migration from the 
Northeast and Midwest to the South. Finally, they point to the possibility of an economic 
advantage for onward migrants that might be generalized to other settings and time periods. 
These issues offer a useful agenda for future migration research. 

The flow of southerners into the West contributed significantly to the dramatic social, 
demographic and economic changes that occurred within the region. Gradually, California 
surpassed New York as the nation's most populous state. African Americans became a large 
and prominent minority group within certain western locales, especially in large urban areas 
such as Los Angeles, Oakland, Richmond and Seattle. Southern culture made its mark on the 
diet, music and lifestyle of the West. Indeed, one important aspect of the geographic dispersion 
of the southern-born population has been the accompanying diffusion of southern culture (e.g., 
Gregory 1989, 2005). To a considerable extent, the “story” of the West is the story of the various 
populations that have moved there — including southerners. The evidence that we have 
presented in this paper furthers our understanding of the southern migrants. And, by 
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considering the diversity of their experiences, our findings emphasize the value of treating 
migrant populations as the heterogeneous groups that they are. By doing so it is also possible 
to tell more nuanced stories about migration streams and the destinations that they affected. 


Notes 


1. 


Another possible difference between onward and direct migrants is the proportion who 
had made arrangements for employment in their new locations before they moved, rather 
than migrating in a general search for opportunity. However, we are reluctant to 
emphasize this issue because, without additional information, it is difficult to make a 
strong case that such a difference favored either of our migrant groups. 


Of course, the specific motivations that produced a stream of African American migrants 
from the South to the West would have included both economic and non-economic factors. 
And, the intensity of each motivation would have varied from individual to individual. 


We use the public use samples for 1970 through 2000 as prepared and made available 
by the Integrated Public Use Microdata Series (IPUMS) project at the Minnesota 
Population Center. The reader is referred to Ruggles et al. (2004) or the IPUMS website at 
http:/Awww.ipums.umn.edu/ for more information. For 1970 we used the 1% Form 1 and 
Form 2 State and Metro samples; for 1980 and 1990 we used the 5% State samples, for 
2000 we used the 5% Census sample. 


For example, we estimate that there were only 16 black onward migrants and 84 black 
direct migrants in 1940 PUMS file and only one black onward migrant and nine black 
direct migrants in the 1950 PUMS file. 


These selections yield a total of 44,781 white southern migrants in the West in 1970, 
32,747 in 1980, 65,556 in 1990, and 62,028 in 2000. For blacks the comparable totals are 
10,416 in 1970, 8,852 in 1980, 15,738 in 1990, and 14,468 in 2000. The effective sample 
sizes used for the multivariate analyses vary according to the dependent variable because 
of the different selection criteria used for the inclusion of cases. 


In supplementary analyses we used an alternative measurement strategy for employment 
status that contrasted employed and unemployed persons in the labor force, omitting 
those not in the labor force. The results of those analyses were virtually identical to the 
results obtained with the measurement strategy used here. 


There are important limitations to measuring migration histories based only on state of 
birth, state of current residence, and state of residence five years in the past (e.g., 
Cromartie and Stack 1989; DaVanzo and Morrison 1981). Especially problematic are 
multiple moves that are not captured by considering only residence at these three distinct 
points in time. Return migration that “erases” a previous move will also not be detected 
by this measurement strategy. Despite those potential problems, this is the standard 
approach that has been used to measure recent migration using PUMS data. 


Note that a case could be made for also excluding education as a control in the analyses 
for employment status and income. That is, educational attainment and quality were a 
form of human capital that onward migrants may have acquired during their temporary 


10. 


A 


he. 


13: 


14. 


15: 


Southern Migrants in the West * 1659 


residence in the Northeast or Midwest, especially those who moved from the South at a 
young age and completed all of their schooling in the North, and before migrating to the 
West. Therefore, our estimates of the differences between onward and direct migrants 
in those analyses might be considered as somewhat conservative. 


For 1970, 1980 and 2000, we conducted supplementary analyses that further distinguished 
residents within metropolitan areas by whether they lived in the central city or a suburb. The 
information required for that distinction is not available in the 1990 PUMS data. The results 
obtained from those supplementary analyses were very similar to those reported here. For 
consistency, we use the simpler metropolitan vs. nonmetropolitan dichotomy for all 
decades. Information indicating metropolitan or non-metropolitan residence is missing for 
some cases in all years. In order to retain those cases in our analysis, we include a control 
variable that indicates that place of residence is “missing.” Although it is included in all 
models, the coefficients and standard errors for this variable are not reported in the tables. 


Twelve dummy variables are included on the right-hand-side of the prediction equations to 
account for residence in 13 western states, with California used as the reference category. 
To avoid clutter in the tables, the coefficients and standard errors corresponding to the 
dummy variables representing state of residence are not reported. Those results are 
available from the authors upon request. 


In supplementary analyses for 1970 and 1980 we also included a control for the subject's 
employment status, working vs. not working five years prior to the census, in an effort to 
tap the economic success of migrants prior to moving to the West. Because controlling 
for prior employment did not affect the substantive conclusions regarding economic 
differences between direct and onward migrants, and because comparable information 
for prior employment is not available for 1990 and 2000, we do not include this control in 
the models reported here. For 1990 and 2000, employment status one year before the 
census is reported. In many cases, this will refer to the respondent's post-migration 
employment status. 


With direction predicted, a p-value of .10, using a two-tailed test, is the same as a p-value 
of .05 using a one-tailed test. 


Odds ratios (OR) are obtained by exponentiating the coefficients from the logistic 
regression analysis. An odds ratio of 1.0 suggests equivalence between the reference and 
index groups on the dependent variable, in this case employment. Odds ratios greater 
than 1.0 indicate a higher likelihood of employment for the index group than for the 
reference group, while a ratio of less than 1.0 indicates a lower likelihood of employment 
for the index group. We do not report odds ratios in Table 1 in order to avoid clutter. 


The mean SEI scores for blacks were 24.9 in 1970, 29 in 1980, 32.3 in 1990, and 37.4 in 
2000. The comparable means for whites were 35.3, 39.9, 42.7 and 47.7. 


Caution must be exercised in using these post-migration economic characteristics to draw 
inferences about the degree of positive or negative selection operating for return or 
onward migrants. For example, the different contexts to which the two groups of migrants 
moved may have had a significant influence on their employment, income, and 
occupational status in their new locations. For these reasons, the evidence in Figure 3 
must be considered illustrative, rather than definitive. 
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Building Community: 
The Neighborhood Context of Social Organization 


Sapna Swaroop, University of Chicago 
Jeffrey D. Morenoff, University of Michigan 





Abstract 

This study explores how neighborhood context influences participation in local social 
organization through a multilevel-spatial analysis of residents in Chicago neighborhoods. We 
construct a typology of community participation based on two dimensions: instrumental vs. 
expressive motivations for participation and formal vs. informal modes of participation. Both 
instrumental and expressive participation are generally higher in more disadvantaged 
neighborhoods. However, the association is nonlinear for instrumental organization, such that 
beyond a certain threshold, additional increases in disadvantage are associated with diminishing 
rates of participation. Rates of instrumental participation are also higher in neighborhoods 
where residents perceive more disorder. Rates of expressive participation are higher in more stable 
neighborhoods. These findings suggest that theories of urban poverty and social need are more 
applicable to instrumental forms of social organization, whereas the systemic perspective is more 
applicable to expressive forms. Finally, most forms of participation are related to the 
characteristics of both the immediate neighborhood and surrounding geographic areas. 


A long tradition of work in urban sociology suggests that resident participation in 
neighborhood life is a cornerstone of successful communities (e.g., Jacobs 1961; Alinsky 
1971; Putnam 2000). Most recently, research on so-called “neighborhood effects” has 
focused on how living in neighborhoods characterized by high levels of social cohesion, 
interaction among residents and collective efficacy is related to better physical and mental 
health among adults and a lower incidence of problem behaviors among youth. (See reviews 
by Gephart 1997; Sampson, Morenoff and Gannon-Rowley 2002; Morenoff and Lynch 2004.) 
Additionally, policymakers have acknowledged that resident participation is fundamental to 
community health and sustainable development (e.g., World Bank 2002; MacArthur 
Foundation 2004; U.S. Dept. of Housing and Urban Development 2004). 

Scholars of urban poverty have suggested that the hardships associated with 
concentrated poverty lead to distrust among neighbors and withdrawal from various forms of 
community life, including social interaction among neighbors and collective participation in 
local voluntary associations, or local social organization (Massey and Denton 1993; Wilson 
1987; 1996). However, this argument overlooks the possibility that the demand for some 
forms of community organization — such as those that address neighborhood problems 
related to crime and disorder — may be greater in poor neighborhoods, which might boost 
participation rates in some activities. To date, empirical research has not systematically 
evaluated the relationship between concentrated poverty and community participation across 
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a wide spectrum of outcomes related to local social organization, nor has anyone offered a 
conceptual framework for understanding how different dimensions of community social 
organization might be related to neighborhood poverty. 

In this paper, we investigate the conventional sociological wisdom that neighborhood 
poverty discourages individuals from participating in their communities and thus undermines 
local social organization. Our analysis extends previous research by (1) clarifying how 
neighborhood context affects different forms of-participation in local social organization, (2) 
examining the neighborhood mechanisms through which concentrated poverty is related to 
community organization, and (3) taking into account contextual influences on social 
organization that emanate from beyond the geographic boundaries of the local neighborhood. 


Contextual Effects on Community Social Organization 


Urban Poverty Perspective 


Much of the recent interest in community participation stems from Wilson's theory of social 
isolation (1987; 1996), which emphasizes the pernicious consequences of concentrated 
socioeconomic disadvantage for employment opportunities, family structure and local social 
organization. Community organizations may suffer in neighborhoods that lack a critical mass 
of middle-class residents (Wilson 1996) and local business leaders (Logan and Molotch 1987). 
Moreover, in neighborhoods beset by high levels of crime and disorder, fear of victimization 
can compel individuals to withdraw from community life (Rainwater 1970; Stack 1974; Wilson 
and Kelling 1982; Skogan 1990; Furstenburg 1993). 

Despite the theoretical power of this argument, empirical analysis yields mixed results. 
Sosin (1991) and Stoll (2001) both fail to find a significant relationship between neighborhood 
poverty and individuals’ participation in neighborhood organizations, but Fernandez and Harris 
(1992) show that neighborhood poverty is associated with lower rates of participation in 
certain types of organizations. Rankin and Quane (2000) also find that neighborhood poverty 
is associated with lower levels of participation in community activities, but that the 
association is nonlinear, such that levels of participation actually increase in neighborhoods 
with the highest poverty rates.' However, most previous studies (except Rankin and Quane 
2000 and Stoll 2001) use ordinary least squares regression instead of appropriate multilevel 
models. Moreover, most studies (except Stoll 2001) rely on samples composed exclusively of 
African Americans living in poor neighborhoods and thus cannot compare how community 
participation varies across a wide range of socioeconomic contexts. No previous study has 
assessed the mediators through which neighborhood poverty might be related to community 
social organization. 


Systemic Perspective 


The systemic model of community life extends Shaw and McKay's (1942) influential theory of 
social disorganization to highlight the role of residential stability in building and maintaining local 
social organization (Kasarda and Janowitz 1974). This perspective emphasizes that limited 
mobility in and out of a neighborhood enhances opportunities to develop friendships and 
participate in local organizations (Kasarda and Janowitz 1974; Sampson 1988; Bursik 1988). By 
and large, empirical studies support the claim that social ties and neighborly interaction thrive 
in residentially stable neighborhoods (Guest and Lee 1983; Sampson 1988: Sampson and 
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Groves 1989; Taylor 1996; Sampson, Morenoff and Earls 1999; Veysey and Messner 1999). 
However, Sampson and colleagues’ (1988; 1989) research also shows that residential stability is 
unrelated to some forms of social organization, particularly participation in formal neighborhood 
organizations. These findings indicate that residential stability may be important for encouraging 
informal exchanges among neighbors, which often rely on long-standing social ties, but that it 
may not be as big a factor in promoting participation in local formal organizations. These 
differences in the effects of residential stability on different aspects of social organization 
suggest that some qualification of the systemic model may be in order. 


Social Needs Perspective 


The “social needs” hypothesis diverges from the main premises of the urban poverty and 
systemic perspectives. According to this view, elevated rates of crime, victimization and 
physical dilapidation in poor neighborhoods actually motivate residents to participate in formal 
and informal community organizations in order to address these problems. Residents may 
become involved in the community in an attempt to protect their property and personal safety 
(Janowitz 1967; Suttles 1972). ; 

Previous research on community participation and indicators of social need offers mixed 
results. Some ecological-level studies demonstrate that neighborhoods experiencing physical 
deterioration have higher rates of community participation (Perkins et. al. 1990; Taylor 1996; 
Perkins et. al. 1996), but other analyses suggest that these neighborhoods have lower rates of 
participation (Skogan 1990; Perkins et. al. 1996). These studies do not address the factors that 
shape individuals’ participation in local social organization. Woldoff’s (2002) individual-level 
analysis, however, shows that residents’ perceptions of physical disorder are unrelated to 
their participation in informal problem-solving activities. Previous studies have been 
conducted either at the ecological or individual level, but there are few multilevel studies that 
investigate how the presence of neighborhood social problems influences individuals’ 
community participation. 


Extending Previous Research 


The research on neighborhood context and community participation is difficult to synthesize 
because previous studies give little attention to distinctions among different types of social 
organization. Previous research also focuses exclusively on the conditions present in the 
immediate neighborhood, neglecting the possibility that conditions in surrounding areas may 
influence community participation. Below, we present two ideas for unifying and extending 
work on the contextual sources of local social organization. 

First, building on research by Guest and colleagues (1983; 2002) and others (Gordon and 
Babchuk 1959; Marcus 1960), we propose that closer theoretical attention should be given to 
differences in the activities that comprise social organization, particularly the distinctions 
between expressive and instrumental forms of community participation. Expressive acts are 
motivated by one’s sense of identity as a “neighbor” and include behaviors such as social 
exchange with neighbors and participation in groups designed to promote feelings of 
“community” among residents. Instrumental acts are motivated by the functional and political 
concerns of neighborhood residents and include membership in organizations designed to 
protect community interests, as well as other types of informal problem-solving. The 
distinction between these two forms is fluid because participation may take on a more 
expressive or instrumental bent — or both — depending on the situation (Logan and Molotch 
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1987; Sampson et. al. forthcoming). Still, distinguishing between the principal functions of 
these two modes of participation is valuable, particularly for assessing the effectiveness of 
social organization. In this study, we develop a typology of community social organization to 
address the differences between expressive and instrumental forms of participation. 

Second, we assess whether participation in neighborhood social organization is a spatially 
dependent process. In this case, spatial dependence means that a person's decision to 
participate may be influenced not only by what happens within his or her neighborhood, but 
also by what is happening in neighborhoods that are geographically proximate. Spatial 
dependence may arise from a diffusion process through which participation spreads across 
neighborhoods through social networks, which may be one of the primary pathways through 
which people become involved in local organization (Booth and Babchuk 1969; Snow et. al. 
1980: McPherson et. al. 1992). If such networks span the geographic boundaries of statistical 
neighborhoods (e.g., census tracts), then levels of participation in one neighborhood can 
affect levels in adjacent neighborhoods. Another source of spatial dependence is that 
contextual influences can spread from one neighborhood to another through spatial 
externalities (Anselin 2003: Morenoff 2003), if, for example, some residents participate in 
order to keep crime and disorder in geographically proximate neighborhoods from spreading 
to theirs. Finally, spatial dependence can also arise from measurement error if survey 
respondents’ definitions of their neighborhoods do not correspond to the geographic 
boundaries imposed by the census, thus generating discrepancies in self-reports of 
participation in neighborhood organizations. 


Data 


This study utilizes multilevel data drawn from the 1995 Project on Human Development in 
Chicago Neighborhoods (PHDCN) and the 1990 United States Census. The PHDCN 
Community Survey was a representative probability sample of 8,782 Chicago residents aged 
18 and older in 1995 drawn from 343 “neighborhood clusters,” which were constructed by 
aggregating Chicago's 865 census tracts. Decisions about which tracts to combine were 
informed by local geographic knowledge (e.g., ecological boundaries such as parks, railroad 
tracks and freeways) and a cluster analysis of census data (Sampson, Raudenbush and Earls 
1997). The average size of an NC is 8,000 people. A multistage sampling procedure ensured 
adequate sample sizes within each neighborhood to construct reliable between- 
neighborhood measures (Sampson, Raudenbush and Earls 1997). Face-to-face interviews 
were conducted with a 75 percent response rate. Our final analytic sample size varies for each 
dependent variable depending on the number of cases with missing data.* We also construct 
measures of neighborhood structural characteristics from the 1990 United States Census. 


Social Organization Measures 


In Figure 1, we present a typology that we used to generate measures of community social 
organization, based on (1) whether the motivation for community participation is expressive 
or instrumental, and (2) whether the type of participation is informal or formal. Table 1 displays 
descriptive statistics for each of the four outcome measures described below. 

Our measure of formal instrumental participation is a count of the number of neighborhood 
instrumental organizations to which the respondent (or any member of the household) belongs, 
including neighborhood watch programs, block groups (or tenant associations or community 
councils), and neighborhood ward groups (or local political organizations). We consider these 
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organizations to be instrumental because they are explicitly dedicated to maintaining or 
improving conditions within a neighborhood. Participation in block/community groups is the 
most common form of formal instrumental participation, with about 11 percent of respondents 
reporting that they or a member of their household belongs to such an organization. We 
measure informal instrumental participation as a count of the number of neighborhood 
problem-solving actions the respondent (or any member of the household) exhibited in the past 
12 months, which include speaking with a politician, speaking with a minister, attending a 
meeting of a neighborhood group, or getting together with neighbors to take action about a 
problem.’ We classify this scale as instrumental because all of the above actions aim to 
address local problems and informal because they do not involve participation in a formal 
organization established to deal with such problems. Descriptive statistics from Table 1 
indicate that residents tend to engage in informal problem-solving actions more than they 
participate in formal instrumental organizations. 


_ Figure 1. Typology of Community Social Organization 
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Our measure of formal expressive participation is a count of how many of the following 
neighborhood expressive organizations the respondent, or any member of the respondent's 
household, belongs to: religious organizations, ethnic/nationality groups, and civic groups 
such as the Elks and Masons. We classify these groups as expressive because their primary 
function is to build social networks and promote a sense of community among participants. 
Table 1 shows that about 32 percent of respondents participate in religious groups, while only 
1 percent of respondents participate in ethnic/nationality groups, and 1 percent participate in 
civic groups. Informal expressive participation is measured by a scale of informal neighboring © 
which takes the average of the following five items: how often people in the neighborhood do 
favors for each other, watch over someone's property when that person is not at home, ask 
each other for personal advice, have get-togethers where other people in the neighborhood 
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are invited, and visit in each other's homes or on the street (a = .80). Respondents chose 
whether they never, rarely, sometimes, or often engage in these behaviors, and responses 
were coded from 1 to 4, with higher values representing more neighboring. We classify 
neighboring as primarily expressive.® 

Recent studies suggest that Chicago seems reasonably close to other U.S. urban areas in 
terms of participation in local social organization. Data from the Multi-City Study of Urban 
Inequality (1994) indicate that the neighborhood block group participation rate in Chicago (11 
percent) is comparable to that in Los Angeles (10.5 percent) but slightly lower than in Boston 
(17.9 percent). However, a comparison with unpublished statistics from the Los Angeles Family 
and Neighborhood Survey (2001) indicates that participation in some types of informal 
neighboring (e.g., doing favors for neighbors and watching their property) may be higher in 
Chicago than Los Angeles. 


Independent Variables 


Descriptive statistics for both the neighborhood- and individual-level independent variables 
are provided in Appendix A. 


Neighborhood Characteristics 

Building on previous research (Sampson, Raudenbush and Earls 1997; Sampson, Morenoff 
and Earls 1999; Morenoff, Sampson and Raudenbush 2001; Morenoff 2003), we construct 
four Census-based measures of neighborhood sociodemographic structure, each of which is 
based on the summation of equally weighted z-scores divided by the number of items.° To 
assess the urban poverty thesis, we construct an index of concentrated disadvantage based 
on the proportion of neighborhood residents living below the poverty line, the proportion of 
female-headed households, the proportion of families receiving public assistance, and the 
unemployment rate. We include a measure of residential stability to test the claims of the 
systemic model, using a scale which incorporates the percentage of residents 5 years old and 
older who lived in the same house five years earlier and the percentage of owner-occupied 
homes. Other structural variables include Latino/immigrant concentration (the average of 
percent Latino and percent foreign-born) and population density, which measures persons per 
square kilometer. 

We also consider several neighborhood factors that might mediate the effects of 
neighborhood structural characteristics on participation in local social organization. One such 
factor is the number of community institutions in a given neighborhood, which may shape 
residents’ opportunity to participate in formal organizations as well as their opportunity to 
build informal ties through social interaction with others in these organizations. We construct 
a neighborhood-level measure of community institutions based on an aggregation of PHDCN 
Community Survey respondents’ reports of how many of the following institutions are 
present in the neighborhood: a block group (or other group dealing with local issues, such 
as a tenant association), crime prevention program (e.g., neighborhood watch), or community 
newspaper (or newsletter or bulletin). We follow Raudenbush and Sampson's (1999) 
procedure for constructing neighborhood-level scales from individual-level survey data using 
the empirical Bayes (EB) residuals of a multilevel measurement model, which adjusts for 
measurement error through a three-level model with separate variance components for scale 
items, persons and neighborhoods (neighborhood-level scale reliability is .82).’ We also use 
this method to construct another set of neighborhood measures from the PHDCN 
Community Survey that tap the presence of physical and social disorder, based on the theory 
that the presence of such incivilities in a neighborhood provides visual cues of social need. 
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Neighborhood physical disorder is a three-item scale indicating how much of a problem 
(where responses are coded as 1 for “not a problem,” 2 “somewhat of a problem,” and 3 “a 
big problem”) the respondent perceives each of the following to be in her/his neighborhood: 
(1) litter, broken glass or trash on the sidewalk or streets, (2) graffiti on the buildings and walls, 
and (3) vacant and deserted houses or storefronts (neighborhood-level scale reliability is .88). 
Neighborhood socia/ disorder is also a three-item scale consisting of respondents’ 
assessments (using the same response categories) of how much of a neighborhood problem 
is (1) drinking in public, (2) people selling or using drugs, and (3) groups of teenagers or adults 
hanging out and causing trouble (neighborhood-level scale reliability is .87). All the 
neighborhood-level variables (Census measures and EB residuals) have been standardized 
around a mean of zero and a standard deviation of one to place them on a common metric. 


Individual-Level Controls 

Although our focus is on the neighborhood sources of local social organization, we also 
control for potentially confounding individual-level characteristics. We consider a number of 
demographic variables, including gender (fema/e = 1) and age (in years — both linear and 
quadratic terms). We code race/ethnicity with four dummy variables: Non-Latino white 
(omitted), non-Latino, African American, Latino, and non-Latino other races. We also use 
dummy variables to indicate marital status: sing/e (omitted), married, separated/divorced or 
widowed. Our measures of socioeconomic status include education (years of education 
completed), family income and the respondent's occupationa/ status (higher scores 
correspond to jobs with higher average incomes). We control for the individual's level of 
investment in the community with a dummy variable indicating whether the respondent is a 
homeowner, a count variable indicating the number of residential moves the respondent has 
made within the past five years, and a measure of the respondent's number of years in the 
neighborhood (living at the same address). We also include respondents’ perceptions of their 
neighborhood size (the logged number of blocks a respondent considers the neighborhood) 
to control for the possibility that participation in a neighborhood activity is more likely among 
people who have more expansive geographic definitions of their neighborhood. 


Methods 


Multilevel analyses of the four measures of local social organization are conducted using 
hierarchical modeling techniques (Raudenbush and Bryk 2002). Hierarchical models for 
multilevel data consist of two equations estimated simultaneously: a level-1 (individual-level) 
model and level-2 (neighborhood-level) model. The level-1 model is either a linear model (for 
informal neighboring) or a generalized linear model (a Poisson model for count variables, 
including participation in expressive and instrumental organizations and problem-solving 
actions). The linear model is written as Yi) = Boj + 2g Bg Xqiji + ej where Y;7 is the score on 
the informal neighboring scale for respondent / in neighborhood /; Bgj is the intercept; Xyj7 is 
the value of covariate q; and Bq is the partial effect of that covariate on informal neighboring. 
The person-specific error term, €j7, is assumed to be independently, normally distributed with 
constant variance o . The generalized linear model views the count variable, Yj; for person / in 
neighborhood /, as sampled from an over-dispersed Poisson distribution. It replaces Yj; with the 
natural log link, nj = log (Ajj), where Aj is the event rate, assuming constant exposure across 
persons, and nj is the log of the event rate (Raudenbush and Bryk 2002). 

The lével-2 model is the same in both the linear and generalized linear cases. The intercept 
from level-1, Bj, is allowed to vary randomly across NCs: Boj = Y00 + Bo YOs W sf + Hop 
where yqq is the average value of the outcome across all neighborhoods, yg. are the 
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neighborhood-level regression coefficients, Wej are the neighborhood-level predictors, and 
gj is the unique increment to the intercept associated with neighborhood j assumed to be 
normally distributed with variance r. 


Spatial Models 

Spatial regression models are estimated through an autoregressive process in the dependent 
variable known as a “spatial lag” model,° Y = p WY + XB + e, where 9 is the spatial 
autoregressive parameter, WV is a weights matrix that expresses a form of spatial association 
among each pair of neighborhoods (in the analysis below it is a binary contiguity matrix), X is 
a matrix of exogenous explanatory variables with an associated vector of regression 
coefficients B, and € is a vector of normally distributed, random error terms. The spatial 
autoregressive parameter p can be interpreted as the effect of a one-unit change in WY on Y. 
However, rewriting the spatial lag model in its reduced form where all endogenous variables 
are expressed as functions of the exogenous variables, shown in equation (1) below, indicates 
that the p coefficient not only captures the effects of spatial proximity to Y in other locations, 
but also spatial proximity to the observed and unobserved predictors of Y (Anselin 2003, 
Morenoff 2003): 


Y=XB+pWXB + p2 W2 XB+... +p W XB +e + pWe + p2W2 e+...+ (MWe, (1) 


Where m—+ 00, The key point revealed by this equation is that the spatial effect, represented 
by p, could result from either spatial externalities from the observed X variables or a spatial 
diffusion effect of community participation from surrounding areas that operates through 
unobserved factors (e.g., social networks). We use a two-step procedure to estimate 
hierarchical spatial models (Morenoff 2003), in which each outcome variable is first adjusted 
for the potentially confounding effects of individual-level covariates using coefficients from 
multilevel models, and then the adjusted outcomes are regressed on neighborhood-level 
covariates and a spatial lag term in spatial regression models." 


Results 


The results of the multilevel analyses for the four social organization outcome variables are 
presented in Tables 2 through 5." The first model in each table shows the association 
between the outcome and neighborhood-level structural variables before adjusting for 
individual-level controls, which we then add in the second model. In subsequent models 
we introduce neighborhood-level covariates that may mediate the effects of the structural 
variables.” 

Results from models of participation in instrumental organizations are reported in Table 2. 
The relationship between neighborhood disadvantage and participation in instrumental 
organizations is non-significant in model 1 but becomes significant after adding individual- 
level controls in model 2. Supplemental analysis revealed that this association only emerges 
after controlling for individual-level indicators of socioeconomic status (SES) — education, 
income and occupational status — implying that individuals of lower SES are more likely to 
live in disadvantaged neighborhoods and less likely to participate in instrumental 
organizations. After we separate out the person-level effects of SES from the effects of living 
in a poor place, we find that neighborhood socioeconomic disadvantage is actually 
associated with greater rates of participation in instrumental organizations, but that the 
association is nonlinear, as we demonstrate in Figure 2."° The dotted line shows that the 
association between neighborhood disadvantage and participation in instrumental 
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organizations Is positive throughout most of the distribution of disadvantage, but that among 
neighborhoods with the highest levels of disadvantage — starting at about one standard 
deviation above the mean — additional increases in disadvantage are associated with 
declining rates of participation. One standard deviation above the mean on the disadvantage 
scale corresponds to a neighborhood poverty rate of about 33 percent, which is just under 
the 40 percent poverty rate that Jargowsky (1997) and Wilson (1996) classify as “ghetto 
poverty.” Other results from models 1 and 2 in Table 2 indicate that the rate of participation 
in instrumental organizations is lower in neighborhoods with more Latinos and immigrants“ 
and in more residentially stable neighborhoods. However, the latter effect is reduced to non- 
significance in subsequent models. 

Models 3 through 5 in Table 2 control for potential mediators of the relationships 
between neighborhood structural characteristics and participation in instrumental 
organizations. The results of model 3 show that the presence of community institutions 
is positively associated with participation in instrumental associations. The positive effect 
of disadvantage becomes even stronger after adjusting for the presence of community 
institutions, because community institutions are perceived to be less prevalent in poor 
neighborhoods. Model 4 shows that physical disorder is not related to participation in 
instrumental organizations, but model 5 indicates that social disorder is positively 
associated with participation and partially mediates the positive relationship between 
concentrated disadvantage and participation, supporting the social needs hypothesis. The 
continued negative effect of the quadratic term for concentrated disadvantage suggests 
that extremely high levels of disadvantage deter participation in instrumental organizations 
(consistent with the urban poverty perspective), even though the presence of social 
disorder encourages participation. 

Table 3 displays results for informal instrumental forms of community participation (i.e., 
problem-solving behaviors). As was the case with participation in instrumental organizations, 
the effect of concentrated disadvantage on participation in problem-solving behaviors only 
becomes significant after the introduction of individual-level controls (particularly those for 
person-level SES) and is nonlinear. The solid line in Figure 2 graphs this relationship, using 
results from model 2, and it shows that participation in informal instrumental activities peaks 
at about two standard deviations above the mean on the concentrated disadvantage index, 
which corresponds to a poverty rate of about 60 percent. The effects of concentrated 
immigration become non-significant after adding individual-level covariates in model 2; the 
effects of residential stability become non-significant after adding community institutions in 
model 3. The lack of a significant relationship between residential stability and problem- 
solving actions suggests that the systemic model may not apply to instrumental forms of 
social organization. 

The results of the “mediator” models (models 3-5) in Table 3 are also similar to those from 
Table 2. In both tables, the presence of community institutions has a positive effect on 
instrumental participation as does the presence of social disorder. However, higher levels of 
physical disorder are also related to higher rates of problem-solving behavior in model 4, 
unlike the results for participation in instrumental organizations. Controlling for physical 
disorder also reduces the effect of disadvantage, but the continued negative effect of the 
quadratic term for concentrated disadvantage again suggests that extremely high levels of 
disadvantage deter participation in problem-solving activities. Controlling for social disorder in 
model 5 reduces the effect of disadvantage to non-significance. Overall, both physical and 
social disorder are associated with higher rates of involvement in problem-solving behaviors, » 
and both appear to mediate the effects of neighborhood disadvantage, providing support for 
the social needs hypothesis. 
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We next turn to the results for participation in expressive organizations in Table 4. The 
negative effect of concentrated disadvantage on participation in model 1 becomes non- 
significant after controlling for individual-level covariates in model 2. Higher levels of 
residential stability are associated with higher rates of participation in expressive 
organizations, as predicted by the systemic model; this association is significant across all 
model specifications. People living in neighborhoods with higher concentrations of 
Latinos/immigrants tend to participate more in expressive organizations, which include 
ethnic/nationality groups. Model 3 indicates that the presence of community institutions is not 
associated with participation in expressive organizations, which is not surprising because the 
types of organizations included in our community institutions scale (e.g., block groups, crime 
prevention programs and community newspapers) do not closely correspond to the types of 
organizations that comprise our expressive organizations measure (e.g., religious 
organizations, ethnic groups and civic groups). Models 4 and 5 show that neither physical nor 
social disorder has a significant coefficient, suggesting that participation in expressive 
organizations may be less responsive to perceptions of social needs than was the case with 
participation in instrumental organizations. 

Table 5 presents results from models of informal neighboring. Models 1 and 2 show 
negative or no effects of concentrated disadvantage, but the association between 
disadvantage and informal neighboring becomes positive and significant after adjusting for 
any of the neighborhood mediators in models 3-5. This indicates that after the weakened 
institutional resource base and high levels of disorder in poor neighborhoods are accounted 
for, residents of these neighborhoods are more likely to engage in informal interactions with 
their neighbors than are residents of wealthier neighborhoods. Residential stability is 
associated with higher levels of informal neighboring, and this result is robust across all 
model specifications, which is similar to what we found in the models of participation in 
expressive organizations. Latino/immigrant concentration is not related to informal 
neighboring (except in model 1), and population density is associated with less neighboring. 
Thus, the results from Tables 4 and 5 support the systemic perspective as it pertains to 
expressive participation but not the urban poverty or social needs perspectives. 

In Table 6 we consider the spatial dynamics of local social organization. Each of the 
outcomes has been adjusted for within-neighborhood differences on individual-level 
covariates."° In the interest of parsimony, we present only one model per outcome, in which 
we include the structural neighborhood covariates, the community institutions variable, and a 
combined neighborhood disorder scale.” In all cases but one, the spatial dependence term, 
p, iS significant and positive. Only in the case of participation in informal instrumental activities 
do we not find significant spatial dependence. Where p is significant, it represents the degree 
of association between characteristics of surrounding neighborhoods and the mean level of 
participation in social organization (adjusted for individual-level composition effects) in the 
“focal” neighborhood. For example, the significant spatial dependence coefficient in model 
1 indicates that the influence of covariates measured in the first-order neighbors is about one- 
third as strong as it is in the focal neighborhood (because p =.302). This means that a one- 
Standard deviation increase in the average level of concentrated disadvantage in adjacent 
neighborhoods is associated with a 5.8 percent (.193*.302* 100) increase in participation in 
instrumental organizations. 


Conclusion 


Our findings offer several new insights into prevailing theories of social organization. First, we 
find little support for the hypothesis that residents of disadvantaged neighborhoods are less 


The Neighborhood Context of Social Organization + 1685 


SoS. | iy ads 





(110°) 100° (€10°) 200" (v10') 910" » (920) GG0" yd90J9}u| 

(€Z0’) 800° (220) or) = zue (peo) vel se PO.) OLL JEpsosiq |e00g pue jealskyd 

sx 1 AOLO) G20’ EL eo) 260 S Peck (tO) 801° xx (920°) ZS/ SUOINySU| AyUNWWOD 
2 4518) ve0"- (210) Z10° (pL) L10°- (pZ0’) 800° Aysueq 
(S10) 600° (210) 700° (p10) 900° (p20') 800° UolejUBOUOD Jue WWU|/OUN}e7 

. B 4619) oa 2h & i910) 6y0' ae (GLO) S0- «x ($20) 80> AyigeyS 
» — (600) 810° x (910) 9¢0°- pasenbs abeyuenpesiq 

+ = 4610) Zr0" ; = K€20) roo BT @ (Z0) €10° wx (8H0) 61 abejuenpesiq payejusou0g 
«i 120) Cece & = 1620) gee" (220) 990" we #(190:) Z0¢" souspusdep jeyeds 



















‘UZ “PIS 'J90D ‘dF “PIS $909 “UF “PIS $809 ‘dF “PIS ‘$909 
ULOQUBION [euuoju; suOoleziueb1GQ aAissaidxy suoNoYy Bulajos-wajqoig suoeziuebic jejuawinsjsu| 


(p) (¢) (2) 0) 


ajqeue, juapuadapuy 








Adaang AyTUMUIWIOD) NOCGHd S661 PUP SNsUdD 066T ‘SPUI0D}NO UOHLZTUeZIC [eID0S pajsn{py-WWTH Jo Sfepo| UorssaiSay 3eT [eneds Woy syus1yF207 :9 a]qRI 


1686 * Social Forces Volume 84, Number 3 * March 2006 


likely to engage in expressive forms of social organization motivated by social or “neighborly” 
concerns. To the contrary, we find that concentrated disadvantage is associated with 
increased participation in neighboring activities (informal expressive organization) and that 
there is no significant association between disadvantage and participation in formal 
expressive organizations. We find more support for the urban poverty thesis in the case of 
instrumental forms of community participation, which are motivated by functional and 
political concerns. In this case, the association with disadvantage is nonlinear: although 
increases in disadvantage are associated with higher rates of participation in formal and 
informal organization, beyond a certain threshold of disadvantage, further increases in 
disadvantage are associated with diminishing rates of community participation. In short, we 
find limited support for the urban poverty hypothesis in the case of instrumental participation 
and only among the most highly disadvantaged neighborhoods. 

It is also noteworthy that higher levels of individual-level SES are associated with increased 
levels of instrumental participation. Thus, it may be individuals of higher SES who are 
mobilizing community organizations to address community social needs, even in 
disadvantaged neighborhoods, an idea that is suggested by some theories of collective action 
(e.g., Oliver, Marwell and Teixeira 1985). On the other hand, individual-level SES measures are 
generally not significantly related to expressive participation, which may reflect that the skills, 
knowledge and resources associated with higher individual SES are more relevant for taking 
instrumental actions to address neighborhood problems than they are for expressive actions. 

Second, our findings suggest that one explanation for why rates of instrumental participation 
are higher in disadvantaged neighborhoods is that residents are motivated by the signs of 
disorder they perceive in these areas. A limitation of our analysis is that our measures of disorder 
were constructed by aggregating survey responses of neighborhood residents. Previous 
research has shown that neighborhood racial composition can affect residents’ perceptions of 
disorder, such that people tend to perceive more disorder in neighborhoods where more African 
Americans live (Sampson and Raudenbush 2004). We addressed this concern in supplemental 
analysis of instrumental participation by adding a control for the neighborhood's proportion of 
African-American residents and found that our results did not change.'® 

Third, our analysis considers the impact of residential stability on community participation 
and refines the systemic model. Consistent with previous research, we found that residential 
Stability appears to encourage informal interactions among neighbors and participation in 
expressive organizations. However, we also found that residential stability does not predict 
instrumental participation. Intuitively, this finding makes sense because we would expect 
residents of stable neighborhoods to interact more with their neighbors, but we would not 
necessarily expect that neighborhood-level residential stability should make people more 
motivated to act on instrumental concerns, such as personal safety and property values. We 
also found that measures of individual-level neighborhood investment are more sturdy 
predictors of instrumental participation than neighborhood-level residential stability: both 
individuallevel length of residence in the neighborhood and home ownership are associated 
with higher levels of instrumental participation. 

Thus, our results indicate that neighborhood characteristics are related to instrumental and 
expressive forms of participation in different ways. In all but the most disadvantaged 
neighborhoods, instrumental participation appears to be motivated by “demand side” factors 
stemming from the more widespread perception of social and physical disorder in 
disadvantaged neighborhoods. Expressive participation, however, appears to be driven more 
by residentially stable environments (and perhaps the strong social networks they foster). 

A fourth finding is that participation in community organization is related not only to 
characteristics of the immediate neighborhood, but also to the wider spatial context of 
surrounding areas. Spatial dependence may reflect that the social processes that generate 
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participation in social organization “spill over” into adjacent neighborhoods. It may also arise 
from discrepancies between census definitions of neighborhood boundaries and those of 
survey respondents. In particular, the finding of significant spatial dependence for all forms of 
participation except engagement in problem-solving behaviors suggests that the underlying 
mechanism could be related to social networks. Neighborly interactions and participation in 
local formal organizations may rely more on social networks than problem-solving behaviors, 
which are very individualistic actions (e.g., contacting a politician or minister) that do not 
depend on strong social networks to make them happen. The spatial dependence of social 
organization suggests that even small, localized efforts to improve community participation 
may have effects that reach beyond the immediate neighborhood. 

Our analysis is limited in several ways. First, the findings are from a single city, making 
generalizability across diverse urban areas difficult. Second, it is possible that there are 
unmeasured confounding factors that predict whether a person will live in a given 
neighborhood and whether that person will participate in local social organization. Third, our 
measures of neighborhood-level institutions, physical disorder and social disorder were 
constructed using the same source of data (e.g., the PHDCN Community Survey) that 
provided information for the social organization outcomes. Using survey-based measures for 
both the independent and dependent variables can bias results, but we attempted to 
minimize this by aggregating individuals’ assessments of the presence of community 
institutions and disorder to the neighborhood level. 

Our results demonstrate the importance of distinguishing between various forms of 
community participation, with the distinction between instrumental and expressive forms 
being most valuable. Instrumental and expressive forms of social organization may have 
different capacities for solving community problems and stimulating community change. 
Some research suggests that communities characterized by instrumental participation are 
more successful in protecting their political and social interests than communities 
characterized by expressive participation (Logan and Molotch 1987; Guest and Lee 1983; 
Guest and Oropesa 1984). For example, Pattillo-McCoy’s (1999) research suggests that strong 
personal networks among residents in “Groveland” may actually detract from efforts to 
reduce crime because residents hesitate to report offenses committed by their neighbors’ 
family members. More instrumental forms of social organization, such as membership in a 
neighborhood watch group, might be more effective agents of social control in these 
contexts. Thus, participation in instrumental forms of social organization may have more 
potential to actuate neighborhood change than participation in expressive social organization, 
especially in poor neighborhoods. 


Notes 


1. There appears to be more consensus in the literature that individual-level socioeconomic 
status — particularly education — is associated with greater participation in various aspects 
of civic life, including volunteering, politics and participation in community organizations 
(Wilson 2000; Putnam 2000). 


2. For each of the count variables, respondents were assigned a missing value when they 
were missing at least one item in the scale. We also reran our models by recoding those 
missing values as a “0” (i.e., assuming that they did not participate) and found no changes 
to the sign or significance of the coefficients. For the informal neighboring scale, we 
assigned a missing value only when respondents were missing on all five items in the 
scale. 
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3. Whenever a respondent is asked a question which refers to his or her “neighborhood, 
survey protocol states. “By ‘neighborhood,’ we mean the area around where you live and 
around your house. It may include places you shop, religious or public institutions, or a 
local business district. It is the general area around your house where you might perform 
routine tasks, such as shopping, going to the park, or visiting neighbors.” We recognize 
that residents’ perceptions of what constitutes a “neighborhood organization” may vary; 
thus, we also ran models that predict participation in any instrumental or expressive 
organizations, including those that the respondent reported were outside her/his 
neighborhood. We found no differences in the sign or significance of any of the 
neighborhood-level predictors of interest. 


4. An alternative strategy would be to construct binary (rather than count) variables that 
measure whether the respondent belonged to any instrumental or expressive 
neighborhood organizations or engaged in any problem-solving actions. We prefer using 
count variables because (1) we utilize information on organizational membership rather 
than collapsing existing data into two categories, (2) most extant research on 
organizational participation uses count variables, and (3) this strategy substantively 
addresses the question of how much individuals participate, rather than simply whether 
they participate, enhancing face validity. 


5. We recognize that “watching someone's property,” an item in the informal neighboring 
scale, could be motivated mainly by instrumental concerns, so we replicated the analysis 
of informal neighboring without this item in the scale and found no change to the sign or 
significance of any of the coefficients. Still, we report results with this item in the scale in 
the interest of comparability with previous research (Sampson, Morenoff and Earls 1999). 


6. Previous research has shown that factor-weighted scales yield the same results 
(Sampson, Raudenbush and Earls 1997). 


7. EB residuals are defined as the least-squares residuals regressed toward zero by a factor 
proportional to their unreliability (Raudenbush and Bryk 2002). This approach corrects for 
bias in regression coefficients resulting from measurement error (Whittemore 1989) that 
could stem from the fact that these measures are based on only 25 to 40 respondents per 
neighborhood cluster and have differential missing data by items in the scales. We 
replicated all analyses using a more conventional approach that averages across 
responses of all individuals within a neighborhood and found no significant changes in the 
results of any models. 


8. One potential problem with our assumption of constant exposure is that the questions on 
organizational membership ask whether the respondent or anyone in the respondent's 
family is a member of the organization in question, and thus, exposure to organizational 
membership should vary by family size. However, the PHDCN Community Survey did not 
measure family size, so we could not run a variable exposure model. 


9. There is also a spatial error model, in which the autocorrelation process is modeled in the 
error term, as follows: Y = XB + Ne+é, where X is a matrix of exogenous explanatory 
variables with an associated vector of regression coefficients B, Nis the autoregressive - 
coefficient, € is a vector of error terms, and € is a random error term (Anselin 1988; 1995). 
Based on regression diagnostic tests (Anselin 1995), we concluded that the spatial lag 
models fit the data better. 
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. We adjusted the mean value of Y in neighborhood / according to the biasing effects of 


individual-level covariates, weighted by the neighborhood's relative composition on those 
covariates (Morenoff 2003). We used a model that contains only individual-level covariates, 
which were all centered around their neighborhood means in order to derive their within 
cles estimates (Raudenbush and Bryk 2002: 33), using the following equation: 

Y= Ni Y— EBwp xX pj ~ Xp) ], where Yi — Y represents the deviation of Y in 
neighborhood ; from the overall sample mean; % Xp represents the deviation of X in 
neighborhood ; fram the sample mean; and the Bp are the HLM within-neighborhood 
slope parameters from a regression where the Xpj erie rts are centered around their 
group ; means. 


. lables 2-4 report a proportion of variance-explained statistic for each model at the 


neighborhood level, but not the individual level, because the individual-level variance is 
heteroscedastic in nonlinear models (Raudenbush and Bryk 2002: 298). For the linear 
models in Table 5, we report the proportion of variance explained at both the 
neighborhood and individual levels. 


The high bivariate correlation between neighborhood-level measures of physical and 
social disorder (r = .90) prevented us from disentangling the effects of both types of 
disorder in a single regression model. We also examined the variance inflation factors for 
all independent variables to further investigate potential problems with multicollinearity. 
These statistics indicated that the correlations between both forms of disorder and 
concentrated disadvantage are high. (See the correlation matrix in Appendix B.) To 
investigate this issue, we conducted robustness checks for models containing 
concentrated disadvantage and physical or social disorder. We found that even after 
removing concentrated disadvantage from the models, the effects for physical and social 
disorder were very similar to those we report — neither varied in sign or significance. 


There is no quadratic term for concentrated disadvantage in model 1 because 
supplemental analysis revealed it was non-significant. We tested for quadratic effects on 
all individual- and neighborhood-level continuous variables and include quadratic terms 
when they are statistically significant. At the individual level, the effects of the years in 
neighborhood measure are noteworthy: there is a positive association between living in 
a neighborhood for more years and participation in problem-solving behaviors, expressive 
organizations and informal neighboring, but this relationship dissipates at higher levels of 
residential tenure. 


_Our findings on the contextual effects of Latino/immigrant concentration must be 


interpreted with caution because the PHDCN data set does not include an individual-level 
control for immigrant status. Thus, any significant effects of neighborhood 
Hispanic/immigrant composition could actually be reflections of individual-level 
associations between immigrant status and participation in local social organization. 


. Supplemental analysis revealed that after controlling for individual-level racial group 


membership, the neighborhood-level disadvantage coefficient became non- significant 
and its sign flipped from negative to positive, which differs from the findings for 
instrumental forms of social organization, where controlling for indicators of individual- 
level socioeconomic status (e.g., income, education and occupational prestige) changed 
the relationship between neighborhood-level concentrated disadvantage and 


participation. 
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16. The two-stage process of first adjusting for the potentially confounding effects of 
individual-level covariates and then running a spatial regression model on the adjusted 
dependent variable appears to work better with linear models than with nonlinear models. 
Thus, the results of our spatial model for informal neighboring (a continuous variable) in 
Table 6 are closely comparable to those from the HLM results for informal neighboring in 
Table 3. However, spatial model results for the count variables are not as consistent with 
their corresponding hierarchical generalized linear models. 


17. We combined physical and social disorder because their high correlation prevented us 
from including both in the same model. We used EB residuals to generate a scale of 
overall disorder using all six disorder items (neighborhood-level scale reliability is .89). 


18. We replicated our analysis using measures of disorder derived from PHDCN’s systematic 
social observation (SSO) — the same measures used by Sampson and Raudenbush (2004) 
—and found that the SSO measures of disorder were not significantly related to any forms 
of community participation. However, the SSO was only conducted in 80 of the 343 
neighborhood clusters in our study, and thus we do not report the results in our tables. 
Still, we note as a caveat that our findings regarding the association between disorder and 
participation in instrumental organizations pertain to collective perceptions of disorder by 
neighborhood residents and not necessarily to levels of disorder measured by trained 
observers. 
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Appendix A. Neighborhood- and Person-Level Descriptive Statistics: 1990 Census and 1995 
PHDCN Community Survey 


Independent Variable Mean SD Minimum _ Maximum __ 

Neighborhood-Level 
Stability Dur 1 -2.07 Zoe 
Concentrated Disadvantage 0 1 -1.16 3.89 
Latino/Immigrant Concentration 0 1 -1,92 3.52 
Density? 7,028 4,101 227 30,192 
Community Institutions 0 1 -3.16 208 
Physical Disorder 0 1 -1.96 2.68 
Social Disorder 0 1 -1.83 2.10 

Individual-Level 
Male At (.49) 0 1 
Female 59 (.49) 0 1 
Age 42.60 (16.73) 17 100 
Non-Hispanic White a2. (.45) 0 1 
African American 40 (.49) 0 1 
Latino 25 (.43) 0 1 
Other .08 (.27) 0 1 
Education 12.31 (3.12) 0 17 
Income (in thousands) 30.65 = (29.52) au 150 
Occupational Status 40.58 = (13.32) 17 86 
Single fad (.47) 0 1 
Married 37 (.48) 0 1 
Divorced Ae (.37) 0 1 
Widowed 14 (.35) 0 1 
Homeowner 45 (.50) 0 1 
Years in Neighborhood — 10.38 = (11.96) 0 81.5 
No. of Moves oo (1.38) 0 11 
Neighborhood Size 31.73 (29.66) 0 95 





Note: Descriptives are based on n = 7,729 respondents and 342 NCs. 
“Values reported are unstandardized. 
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The Presence of Organizational Resources in Poor Urban 
Neighborhoods: An Analysis of Average and Contextual Effects 


Mario Luis Small, Princeton University 
Monica McDermott, Stanford University 





Abstract 

Wilson (1987) and others argue that poor neighborhoods lack important organizational 
resources the middle class takes for granted, such as childcare centers, grocery stores and 
pharmacies. However, this approach does not distinguish poor neighborhoods from segregated 
neighborhoods, ignores immigration and neglects city differences. Using Department of 
Commerce and 2000 Census data for zip codes in 331 MSA/PMSAs, we estimate HGLM 
models predicting the number of each of 10 organizational resources. We find that, (1) on 
average, as the poverty rate of a neighborhood increases, the number of establishments 
increases slightly; (2) as the proportion of blacks increases, the number of establishments 
decreases; (3) as the proportion of foreign-born increases, so does the number of 
establishments. Finally (4), metropolitan context matters: poor neighborhoods have more 
establishments in cities with low poverty rates, and in cities in the South and West, than in 
other parts of the country. Findings suggest reevaluating the de-institutionalized ghetto 
perspective as a theory of the effects of black segregation and depopulation, rather than poverty 
concentration, and approaching neighborhood poverty from a conditional perspective. 


Introduction 


In Messner and Rosenfeld’s (2001) widely referenced book on crime, the authors quote a police 
officer who describes the physical conditions of a poor Chicago neighborhood: “Do you see any 
hardware stores? Do you see any grocery stores? Do you see any restaurants? Any bowling 
alleys? There is nothing here.... Everything we take for granted — a laundromat, a cleaner’s, 
anything. It’s not here.” (2001:33) The description captures a common conception of poor urban 
neighborhoods (Wilson 1987, 1996; Goering and Feins 2003): that they are scarce in the 
organizational resources basic to day-to-day living that the middle class takes for granted, such 
as grocery stores, hardware stores, pharmacies, laundries and childcare centers. As Wilson 
(1995:9-10) argued, “poverty in ghetto neighborhoods has sapped the vitality of local business 
and other institutions, and it has led to fewer... movie theaters, bowling alleys, restaurants, 
public parks and playgrounds, and other recreational facilities.” Is this actually the case? 

The question speaks to pressing sociological and policy concerns. Researchers have 
suggested that businesses and organizations contribute to the vitality of neighborhoods, 
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increase informal social control, help keep crime at bay, and even contribute to health. 
Peterson, Krivo and Harris (2000) found that a greater number of recreation areas reduces 
violent crime rates in very poor neighborhoods. Yen and Kaplan (1999) found that the 
prevalence of supermarkets and clinics is positively associated with health and well being. If 
these resources were, in fact, scarce in poor neighborhoods, they would constitute an 
important intermediary mechanism for the relationship between neighborhood poverty and 
lower life chances — part of the “black box” of neighborhood effects (Sampson, Morenoff and 
Gannon-Rowley 2002; Small and Newman 2001; Jencks and Mayer 1990). 

In addition, answering this question would help understand the implications of a major 
ongoing policy experiment. Moving to Opportunity (MTO), a federal housing program that 
provides vouchers for residents seeking housing assistance, assesses the effect of moving to 
middle-class neighborhoods (Goering, Feins and Richardson 2003):-MTO analysts have 
speculated that these neighborhoods have greater organizational resources than poor areas 
(Ludwig, Duncan and Ladd 2003:156; Ellen and Turner 2003:325). 

Despite the significance of our question, few published studies have answered it, and most 
of these examine a few resources, such as childcare centers or supermarkets, and focus only 
on a few cities or towns. No published studies have tested the proposition on a large cross- 
section of metropolitan areas and with respect to multiple resources. We examine this question 
based on data from each of 10 organizational resources in every populated zip code in 331 
metropolitan areas in the United States. We estimate hierarchical generalized linear models 
predicting the number of establishments, and examine both average and contextual effects. 


Literature and Hypotheses 


By “organizational resource” we refer to any establishment that has a physical location and 
offers services or sells goods basic to day-to-day living.' Because our empirical tests focus on 
for-profit establishments, the literature review centers on businesses — not on non-profits, 
state-funded organizations or public areas such as parks. 


De-institutionalization 


The relationship between neighborhood poverty and resources is discussed in several bodies 
of work. It finds one of its clearest theoretical expositions in the de-institutionalization 
argument of Wilson (1987, 1996; Wacquant and Wilson 1993). For Wilson (1987, 1996), the 
de-institutionalization of neighborhoods is the product of middle-class flight and the ensuing 
concentration of poverty. Because middle-class residents economically sustain neighborhood 
establishments, their absence undermines the latter's stability and survival. 

This work is a refined expression of several elements of the early ecological theories of the 
Chicago School (Park, Burgess and McKenzie 1925). The latter viewed cities as fundamentally 
market-driven entities in which competition among businesses and groups resulted in natural 
areas — specifically, a set of increasingly affluent, concentric circles that radiated outwardly 
from a central business district. (See Sampson and Morenoff 1997 for a recent critique.) One 
Strand of the Chicago tradition was the social disorganization perspective (Shaw and McKay 
1969), which, as part of a theory of cross-sectional neighborhood differences in crime, 
posited a relationship between neighborhood conditions and the presence of organizations 
and businesses. Poor, ethnically heterogeneous, residentially unstable neighborhoods were 
unable to sustain businesses and organizations because they lacked economic stability and 
social organization. 
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Wilson’s work reflects these origins in two ways. First, it relies on a market-driven 
approach to business viability, which is thought to depend primarily on the presence of 
middle-class constituents or patrons in the neighborhood. The presence of customers 
determines viability? Second, it is based mostly on empirical research in the city of Chicago, 
with little assessment of the impact of variable conditions across cities. This perspective 
informs the first hypothesis: 


H1: The number of establishments in the neighborhood will decrease 
as the poverty level increases, other factors held constant. 


Some evidence supports the hypothesis. Ethnographic works in Chicago and Philadelphia 
depict neighborhoods deprived of businesses and organizations (Anderson 1999; Duncan 1987). 
Most quantitative tests of this hypothesis have focused on particular establishments rather than 
businesses as a whole. For example, several studies have focused on childcare centers. 
Consistent with the hypothesis, Siegel and Loman (1991 :28) find that zip codes in Illinois with the 
“highest concentrations of low-income families” are less likely than others to have childcare 
centers. Queralt and Witte (1998) find that the supply of full-day centers in suburban 
Massachusetts is significantly lower in “socioeconomically distressed neighborhoods.” (1998:40) 
Others have studied financial institutions. Pollard (1996) finds that the number of banks in South 
Central Los Angeles decreases as the proportion of poor and non-white residents in the 
community increases. However, Graves (2003), using data from metropolitan Louisiana and Cook 
County, Illinois, uncovered that “payday lenders” such as check-cashing establishments are 
especially likely to be found in low-income, non-white neighborhoods. Studying grocery stores 
and restaurants in four states, Morland et al. (2002) find that the number increases as the median 
house value and number of white residents increases. Finally, research on “negative” businesses 
such as bars and liquor stores finds that poverty increases prevalence (Morland et al. 2002). All of 
these studies, however, are based on a handful of cities or regions. 


Poor Neighborhoods vs. Black Neighborhoods 


The de-institutionalized ghetto perspective has become a theory of the effects of 
neighborhood poverty. However, given: some of the data it embraces, the theory may also be 
construed as one on the effects of poverty concentration in black neighborhoods. Citing 
research in Chicago by Drake and Clayton (1945), Wacquant and Wilson (1993) suggest that 
predominantly black neighborhoods in the past were racially segregated but class integrated, 
resulting in organizationally rich and stable communities. The reduction in structural barriers 
as a result of civil rights mobilization in the 1960s and 1970s allowed black middle-class 
residents to move out of the inner-city, resulting in a concentration of black poor left behind 
(Quillian 1999; Jargowsky 1997). As Wilson (1996:54) writes, the “absence of working- and 
middle-class blacks... deprives ghetto neighborhoods of key resources, including structural 
resources... such as residents with income to sustain neighborhood services.” This theory is 
based on conditions relating to the particular history of African Americans. Thus, one could 
hypothesize: 


H2: Other factors held constant, as the proportion of residents who 
are black increases, the number of establishments in the 
neighborhood will decrease. 


In fact, it is possible that proportion black accounts for much of the negative effect of 
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proportion poor. In addition, there could be an interaction effect: 


H3: Other factors held constant, as the proportion of residents who 
are black increases, the negative association between neighborhood 
poverty and the number of establishments will increase.* 


There are few tests of these particular hypotheses. Wacquant and Wilson cite 
ethnographic work in Chicago by Duncan (1987), who reports that residents’ depictions of 
black communities speak of “how the street used to be filled with stores, theaters and 
nightclubs.... [Today] [s]ome stores, currency exchanges, bars, and liquor stores continue to 
exist... [Yet] [i]n terms of physical structures, many stores are boarded up and abandoned. A 
few buildings have bars across the front and are closed to the public:” (Wacquant and Wilson 
1993: 31) Anderson (1999) reports similar findings in Philadelphia. We know of no statistically 
representative tests of these propositions. 


Immigrant Entrepreneurship 


A second reason to question the de-institutionalized ghetto perspective stems from the 
research on immigrant enclaves. Ethnic enclave theory has argued that businesses in urban 
neighborhoods flourish as a result of immigrant entrepreneurship (Portes and Bach 1985; 
Aldrich and Waldinger 1990). Urban immigrants, who often have few socio-economic 
resources, are theorized to avoid the secondary labor market of low wages and dead-end jobs 
by finding jobs or becoming entrepreneurs in the enclave economy (Kasarda 1989). This 
economy is partly characterized by the spatial concentration of businesses even in high 
poverty neighborhoods. The underlying assumption is that although customers matter, 
entrepreneurs are critical — their presence underlies the prevalence of establishments in 
neighborhoods.’ 


Two interrelated hypotheses follow: 


H4: The number of establishments will increase as the proportion of 
residents who are foreign-born increases, other factors held constant. 


A related argument would qualify the impact of poverty, suggesting that the high presence 
of immigrants attenuates the expected negative effects of poverty. Thus: 


H6: As the proportion of residents who are foreign-born increases, 
the negative association between poverty level and the number of 
establishments will decrease, other factors held constant. 


Researchers have found evidence consistent with this perspective. Research on the spatial 
concentration of ethnic businesses has analyzed enclaves at the metropolitan level, finding that 
clustering of ethnic businesses provides both a labor pool and a ready market (Logan, Alba and 
McNulty 1994). Klinenberg (2002) compares predominantly (native) black and predominantly 
Mexican poor neighborhoods in Chicago and describes greater prevalence of establishments in 
the latter. However, the test relevant to our question has not been undertaken. Most studies 
examine the propensity for entrepreneurship among immigrants, not the effect of immigrant 
presence on the number of establishments. Thus, though there is strong evidence that immigrants 
have higher self-employment rates than non-immigrants (Kasarda 1989: Light and Rosenstein 
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1995), it has not been shown that their presence increases the number of establishments in 
neighborhoods, poor or otherwise. For example, immigrant entrepreneurs may constitute too 
small a portion of neighborhood populations to affect the total number of establishments or their 
businesses may be located in neighborhoods different from where they live. 


The Metropolitan Context 


A final issue is that the presence of establishments in neighborhoods may depend on the city. 
Wilson (1987, 1996) and others (e.g., Jencks and Peterson 1991; Danziger, Sandefur and 
Weinberg 1994) have suggested that city-level conditions, such as economic shifts from the 
manufacturing to the service sector and their impact on low-skilled job availability affect the 
neighborhood concentration of poverty. However, the perspective has largely ignored 
whether and how city conditions directly affect the prevalence of establishments in 
neighborhoods, focusing instead on the impact on the number of jobs and/or average wages. 
Metropolitan conditions are implicitly assumed to affect the number of establishments only 
indirectly, by increasing the concentration of poverty. As a result, the perspective does not ask 
whether very poor neighborhoods will resemble each other across different cities. That this 
issue has not been addressed may be due to the theory’s reliance on empirical work 
conducted in a single city, Chicago (Sampson and Morenoff 1997). 

The Chicago School in general has been criticized at length for theorizing about “the city” 
on the basis of Chicago research (Dear 2002; Samspon and Morenoff 1997; Waldinger and 
Bozorgmehr 1996). A recent attempt to address this problem empirically was the Multi-City 
Study of Urban Inequality, which, based on major surveys conducted in Atlanta, Boston, 
Detroit and Los Angeles, examined the spatial concentration of poverty, networks, 
employment and discrimination from a comparative perspective (O'Connor, Tilly and Bobo 
2001). Nevertheless, none of these works has tested specifically whether the presence of 
establishments in poor neighborhoods varies significantly across cities (Jargowsky 1997). 

Most neighborhood poverty research assumes and tests only an overall neighborhood 
poverty effect, uncovered with nationally representative datasets such as the Panel Study of 
Income Dynamics or the National Longitudinal Survey of Youth (e.g., Brooks-Gunn, Duncan 
and Aber 1997a, 1997b). Often implicitly, the research assumes a single neighborhood effect 
“size” regardless of urban context. We hypothesize that, depending on the city, the poverty 
effect may be large, small or even positive. Indeed, finding an association, at the national 
level, equal to zero may simply mask significantly negative associations in some cities and 
significantly positive ones in others. We present two hypotheses: 


H6: The average number of establishments per neighborhood will 
vary significantly trom city to city, other factors held constant. 


H7: The association between proportion poor and the number of 
establishments in the neighborhood will vary significantly from city to city. 


Finally, we expect this variation to be associated with demographic and economic factors. 
Underlying this expectation is the notion that there is no single “neighborhood effect,” 
positive or negative, only effects conditional on metropolitan context. We hypothesize: 


H8: The association between proportion poor and the number of 
establishments will depend on the city’s demographic and economic 
conditions. 
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Specifically, we examine the effects of city-level proportion black and proportion immigrant, 
population density, the poverty and unemployment rates, and region. 


Data and Approach 


Data 


We test these hypotheses employing a dataset compiled from several sources. The County 
Business Patterns section of the U.S. Census assembles data from several governmental 
sources on all businesses and organizations in the United States that have a payroll. 
Businesses and organizations are identified by more than 1,000 North American Industry 
Classification System (NAICS) codes. The office makes some of these data, such as the 
number of businesses, available at the zip code level. No data are available below the zip code 
level. We obtained the data for the year 2000. 

Our second source was the 2000 U.S. Census, Summary File 3, which contains 
demographic data at the zip code level and the level of the Metropolitan Statistical Area (MSA) 
and Primary Metropolitan Statistical Area (PMSA).° Because zip codes are U.S. postal 
designations with no natural relationship to MSA/PMSAs, they were matched to metropolitan 
areas using GIS technology, following a “centroid” approach (using Arcview). We calculated 
the geographic center of every zip code, and matched the zip code to the metropolitan area 
that contained that center. Zip codes partly within metropolitan areas but with centers outside 
those boundaries were excluded. The result is a two-level dataset containing a tabulation of 
every establishment with a payroll, by zip code (n = 13,736), for every metropolitan area (n = 
331) in the United States. 

Two data issues are important. First, the census does not make available at the zip level 
any information on establishments that have no payroll. For example, if a couple opened a 
bakery and hired only their child to help occasionally, this store would not be included unless 
the child was on a formal payroll. (The census labels these establishments “non-employers.”) 
The absence of these data could lead to an under-estimation of the number of resources in 
poor neighborhoods. We assess this possibility in our analysis. Second, the smallest 
geographic unit with available data is the zip code. Depending on the issue, some zip codes 
may be too large. For studies of neighborhood socialization in poor areas, tracts, with 
populations of about 4,000 are probably the most appropriate (among the publicly available 
data sources for multiple cities). For example, in studies of the influence of educated adult 
role models for youth, zip codes are probably too large (Cutler and Glaeser 1997). For studies 
of economic issues, such as job availability and industry sector dominance, tracts are 
probably too small and zip codes may be more appropriate because regions of economic 
activity are wider than those of social neighborhood activity. For example, for supermarkets, 
the modal number per tract will often be zero, given tracts’ small size. Thus, the use of Zip 
data as opposed to tract data may be increasingly appropriate as we are concerned with 
economic well-being and increasingly inappropriate as we are concerned with social well- 
being. We assess this issue in our analysis. 


Analytical Strategy 


There are three issues to determine: the organizational resources to be selected: the dependent 
variables of interest, and the specific models to be estimated. We begin with the first. 
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There are more than 1,000 types of establishments in all sectors and industries in the 
NAICS system. The vast majority - establishments including mining companies, gun 
manufacturers and golf courses — bear no substantive relationship to our question. Based on 
the literature, we employed three criteria for selection: (1) the establishment should be an 
organizational resource basic and important in day-to-day living (this would exclude liquor 
stores and bowling alleys); (2) it should have no a priori class association (this would exclude 
high-end boutiques and dry cleaning services, though not coin-operated laundries that 
happen to provide dry cleaning); (3) it should not be a non-profit organization or a 
governmental entity. (Many non-profits target poor neighborhoods [Cohen 2001]; this would 
also exclude parks and public pools.) Furthermore, for reasons of space and manageability, 
we limited the number of resources to the 10 we identified as most pressingly in need of 
empirical assessment. The resulting list follows: banks, childcare centers, convenience 
stores, credit unions, pharmacies, hardware stores, laundries (coin-operated), grocery stores, 
grooming stores (barbershops, nail salons and beauty salons), and restaurants.® We selected 
the 10 establishments before running empirical tests. 

The top panel of Table 1 exhibits the summary statistics for the organizational resources. 
For hardware stores, credit unions, banks and laundries, the median number of resources is 
zero; for convenience stores and pharmacies, it is 1, while for restaurants, the highest, it is 
5. There is high variability across the organizations, as the highest number of hardware stores 
is 11, while that of restaurants is 376. Note that these are raw figures, unadjusted for the zip 
code’s area (which, as shown in the bottom panel, varies widely). There is also great variability 
in each organization across zip codes; in every case, the standard deviation is higher than the 
mean. Estimates will account for the high variances. 

Our dependent variable is the number of organizations of each type in the zip code. Some 
scholars have suggested that poor neighborhoods have not only fewer resources but also 
resources of lower quality (Wilson 1995:9-10). Though quality is important, we only address 
prevalence in this study. 

The third issue is our estimation strategy. We estimate hierarchical generalized linear 
models (HGLM) in which the outcome variable is modeled as a Poisson process with over- 
dispersion (Snijders and Bosker 1999; Raudenbush and Bryk 2002; Long 1997; McCullagh 
and Nelder 1989). The number of establishments is modeled as a function of both zip-level 
and metropolitan-level variables. See Appendix for details of model specification. Our 
independent variables are listed in the bottom panel of Table 1. The main ones are (1 ) the 
proportion of all persons in the zip code who are poor, (2) proportion black, and (3) 
proportion foreign born. The poverty slope is estimated as a random effect. We also 
control for the following: (4) population density (logged), because some neighborhoods 
are primarily residential, while others are primarily business zoned; (5) residential 
instability, measured as the proportion of residents who were living in a different house 
than they had been in 1995; (6) central city location, because neighborhoods located in 
inner city areas may have more businesses than those in suburban sections of 
metropolitan areas (see Jackson 1985)’; and (7) ethnic make-up, measured as proportions 
non-Hispanic black, non-Hispanic Asian and Hispanic. These controls are modeled as 
“fixed” effects (i.e., constant across cities) (Raudenbush and Bryk 2002). We also include 
interaction terms between percent poor and percent black (to test H3) and between 
percent poor and percent foreign (to test H5).° While including appropriate controls, we 
avoid the “kitchen-sink” approach to controls because it may produce well known (though 
often ignored) estimation problems (Lieberson 1985). In the first portion of the analysis, all 
level-1 variables are centered on the group mean. Thus, coefficients for level-1 predictors 
refer to deviations from the city mean. Since our main interest is in the presence and 
direction of statistical associations with poverty, we present most findings in terms of 
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Table 1: Summary Statistics for Variables Employed 








Dependent variables* Median Min Max Mean/%  S§.D. N 
Hardware stores 0 0 11 .69 TOQUE TSE 
Grocery stores iz 0 58 3.62 490 13,736 
Convenience stores 1 0 42 1.62 Z.30, sigue 
Pharmacies 1 0 195 2.19 U2 (eee tanuce 
Savings Banks 0 0 92 .93 1 05 tai 4 3-238 
Credit unions 0 0 15 83 PSSael ats, TSE 
Childcare centers Z 0 51 3.70 439 Pela ae 
Restaurants a 0 376 9.08 1280 Slaton 
Laundries 0 0 18 16 ls BE 
Grooming stores 2 0 153 Die Lidked uboeor 

Independent variables* Mean/% _ S.D. N 

Zip-level 
Area (as exposure measure) 44.70 105.95 13,73€ 
Percent poor 10.80 GO frag lASh 36 
Percent foreign born 88 210.67 RIS SE 
Percent Latino 8.99 TRIZIP MS TSE 
Percent non-Latino white 76.37 20.22 Val oe 
Percent non-Latino black 9.54 Lie coe 
Percent non-Latino Asian 2.60 5.40  13,73€ 
Percent moved in previous 5 years 43.72 12:34) afd, 3%E 
Population density, logged 6.18 ZO2 MARTE 
Located in central city (1/0) 20.5 13,37€ 

MSA/PMSA-level 
Percent poor 12.39 4.38 331 
Percent foreign born 7.38 743 331 
Unemployment rate 5.74 1.74 331 
Percent non-Latino black 10.33 10.55 331 
Population density, logged 5.52 97 331 
South 39.9 331 
Northeast 18.1 331 
Midwest 22.7 331 
West 19.3 331 


——— 
* The summary statistics are not adjusted for zip code area. 
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coefficient sizes and associated standard errors. The Poisson coefficient represents the 
increase in the log odds of the outcome variable associated with a one unit increase of the 
predictor. The anti-log of the coefficient is a multiplier of the expected number of 
organizations associated with a one-unit increase in the predictor (Long 1997).° 


Results 


Main Effects” 


Figure 1 displays the mean number of organizations of each type by zip poverty level. The zip codes 
are sorted by poverty and divided into quintiles; the top quintile shows the average number of 
establishments in. poorest fifth of all zip codes. At the base of the figures are zero-order correlations 
between the zip poverty rate and number of establishments. The figure shows small, positive 
relationships between poverty and presence of resources for most of the organizations. (All 
correlations are statistically significant at the .01 level.) For eight of the resources, the relationship is 
positive; for banks and grooming stores, it is negative. In poorer neighborhoods in the nation’s 
metropolitan areas, there are fewer banks and grooming stores but more hardware stores, grocery 
stores, convenience stores, pharmacies, credit unions, childcare centers, restaurants and laundries. 

Table 2 exhibits the coefficients for percent poor, percent black and percent foreign-born on 
the log of the expected number of establishments, after controls. All coefficients, except 
poverty, are entered as fixed; the intercept is entered as random. The table presents unit-specific 
models, which show the expected change in log number of establishments associated with a 
one-unit change in a given predictor, after holding constant the other predictors and the random 
intercept (Raudenbush and Bryk 2002:334). The results provide little support for the de- 
institutionalized ghetto perspective as an overall depiction of poor neighborhoods. A 1 
percentage point increase in proportion poor slightly increases the log number of resources in 
the zip code, depending on the resource, by between .019 (for restaurants) and .039 (for 
convenience stores). This translates into a 1.9 percent increase in the number of restaurants to 
a 4 percent increase in convenience stores. For childcare centers and grooming stores, the 
association is not significant. (Because these figures are based on models with interaction 
terms, it is important to be clear on the meaning of the random coefficients. For example, in the 
case of restaurants, .019 is the portion of the association with proportion poor that does not 
depend on [i.e., interact with] either proportion foreign-born or proportion black.") Because we 
do not have data on establishments that have no payroll (“non-employers”), this is a conservative 
test of the poverty effect. If these particular, informal establishments are more likely to be found 
in poor neighborhoods, then the positive poverty effect is expected to be higher. 

Table 2 shows that proportion black is consistently associated with fewer resources, 
suggesting that Wilson's account is a story of segregated black neighborhoods, not high poverty 
neighborhoods. The coefficient ranges from a .006 to a .019 reduction in the log number of 
establishments. The exception is childcare centers, where the log number of establishments 
increases by .006, which translates into .6 percent. The interaction terms suggest that, for a few 
resources, proportion poor and proportion black interact, such that high poverty neighborhoods 
have the expected fewer number of resources if they have high proportions of African Americans. 

Finally, Table 2 shows a positive and statistically significant association with proportion foreign 
born. As shown, a 1 percentage point increase in that variable increases the log number of 
resources, between .004 for pharmacies and .010 for banks. (These coefficients reflect the 
portion of the foreign-born association that does not depend on proportion poor.) The exceptions 
are childcare centers and restaurants, for which the proportion foreign-born coefficient is 
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negative, and credit unions and convenience stores, for which it is not significant. With respect to 
the poverty-foreign-born interaction, the interpretation must take into account the uncovered 
poverty association. As shown in the fifth row of the table, the interaction is negative and 
significant in all but two cases. This suggests that a high proportion of immigrants might reduce 
the number of establishments when they are very poor, lacking the minimum capital to generate 
the organizational resources hypothesized by enclave theory." 


Size Matters 


With respect to size, Table 2 may mask important relationships. In the table, both a hardware 
store with 10 employees and a large Home Depot with several hundred would be counted 
as one establishment. However, it is possible that the relationship between poverty and the 
presence of establishments is different among small hardware stores and large chains. Large, 
established stores may steer clear of high poverty areas. This possibility is considered in Table 
3.'"° We divided establishments into three categories based on number of employees: small (1 
to 19 employees), medium (20 to 99), and large (100 or more). We find that as neighborhood 
poverty increases, the number of small establishments increases, except for childcare centers 
and grooming stores. The middle panel shows that the relationship between neighborhood 
poverty and the number of mid-sized establishments is mixed. In five cases, it is positive; in 
two cases, again childcare centers and grooming stores, the effect is negative and statistically 
significant; in the rest, it is not significant. The bottom panel shows that the relationship is 
mixed among large establishments as well. Large grocery stores and large laundries are, 
indeed, scarcer in high poverty neighborhoods; however, none of the other resources are, and 
large banks, credit unions and childcare centers are especially likely to be found there. Thus, 
the findings suggest that the associations are most consistently positive for small 
establishments, and mixed or insignificant for others. 


Contextual Effects 


All analyses to this point have focused on average effects across cities, statistically controlling 
for city differences but also disregarding them. These figures may mask important contextual 
factors. For example, Table 2 may show the (average) poverty coefficient for number of 
childcare centers to be statistically insignificant even if it is positive and statistically significant 
in some cities, but negative and statistically significant in others. Below we examine between- 
city variation in two ways. 

First, we assess whether the number of establishments in the average neighborhood 
(regardless of its poverty level) varies from city to city, the issue posited in Hypothesis 6. Table 
4 presents the proportion of the total variance in the expected log number of establishments 
that is accounted for by the between-city variance based on a zip code with the mean log 
number of establishments (Snijders and Bosker 1999). (These are area-adjusted.) The 
overwhelming majority of the variance in the number of each organizational resource (for the 
average zip code) is taking place within cities, not between them." In only one organization, 
hardware stores, the between-city estimate for an average neighborhood is more than 10 
percent. In the rest, it is below that figure, and seven organizations have figures below 5 
percent. Thus, the distribution of organizational resources across zip codes (at the mean) - 
does not depend very much on metropolitan characteristics; rather, it is associated with Zip- 
level factors. It is noteworthy that our unit is the zip area. The smaller the geographic unit the 
higher the expected variance, which suggests that the between-city variances might, in fact, 
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Table 4: Proportion of Variance in Log Number of Establishments Taking Place between 
Metropolitan Areas (Based on Values at the Mean)* 





Variance Percent Between 
Component Metropolitan Areas 

Hardware Stores 
between .657 10.56 
within 5.563 

Grocery Stores 
between 803 2.76 
within 20.011 

Convenience Stores 
between 1.006 5.60 
within 16.951 

Pharmacies 
between 075 38 
within 19.857 

Banks 
between .097 24 
within 40.380 

Credit Unions 
between 212 1.10 
within 24.487 

Childcare Centers 
between 143 3.12 
within 23.105 

Restaurants 
between 581. OZ 
within 111.452 

Laundries 
between 941 9.11 
within 9.395 

Grooming 
between .980 1.99 
within 48.160 


ee 


* In linear hierarchical models, this figuré would be the intraclass correlation. It denotes the 
proportion of the total variance accounted for by the group level; it is also the correlation between 
the values of two randomly drawn zip codes in a randomly drawn MSA/PMSA (Snijders and 
Bosker 1999:16ff). In the linear case, ICC = Var(u,)/(Var Uy;) + Var(r;,)). However, in Poisson 
models the level-1 variance is a function of the expected mean. Specifica y, in our case, Var(r;.) = 
o-’/(m,);;). Thus, there is no single ICC as such ( serman Rodriguez, personal communication). 
The figures above are the within and between variances when evaluated for a zip code with the 
average log number of establishments and average exposure. 
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be lower if units such as census tracts were employed. 

The second issue regards the slopes for poverty. The figures in Table 4 say little about the 
association between poverty and the number of establishments, which could, in fact, vary 
widely by city. (This refers to Hypothesis 7.) We examine this possibility in Table 5, which 
shows the “random effects” for the poverty coefficient; that is, it shows the variance across 
cities in the unit-specific slopes for the variable. The models are identical to those in Table 2. 
As shown, for most organizational resources, the random effect is statistically significant: 
cities differ significantly in their slopes for neighborhood poverty. 

To present these relationships more intuitively, we show the two standard deviations 
across cities around the level-1 slope for poverty. Figure 2 displays the mean poverty slope 
and the interval at two standard deviations (not standard errors) above and below the mean 
slope across cities; it presents the range of coefficients across 95 percent of the cities. 
Consider hardware stores. The average effect of zip-level poverty on the log number of 
hardware stores in the zip code is .024. In the city with the highest effect (among 95 percent 
of cities), it is .034; in the city with the smallest effect, it is .015. 

There are three issues to examine in Figure 2. The first is whether the variation across 
cities in the poverty slope is large or small. If it is very small, this would suggest a remarkably 
consistent relationship, across cities, between neighborhood poverty and the number of 
establishments. The second is whether, for any given establishment, the slope is negative 
for a substantial portion of the 330 metropolitan areas, regardless of whether the average 
slope is positive. If so, this would offer support for the notion that poor neighborhoods are 
deprived of these institutions, but only among a substantial minority of cities. The third is to 
compare the standard deviations across organizations to determine whether the variability in 
the poverty slope depends on the establishment. 

For most establishments, there is considerable range across cities in the poverty slope. For 
six of the establishments, the effect of poverty is positive in all cities. For the other four, it is 
negative in at least some cities; for banks, childcare centers and grooming stores, it is 
negative in either a large minority or a small majority of cities, as expected by the de- 
institutionalized poor neighborhoods perspective. This suggests that whether high poverty 
neighborhoods are deprived of organizations or not depends greatly from city to city. 


Contextual Effects of Level-2 Variables 


Our last set of models examines metropolitan-level predictors of the poverty slope. In Table 6, we 
examine the impact of metropolitan-level economic and demographic factors on the 
neighborhood poverty slope. These models differ slightly from the previous ones. In the previous 
analyses, we were interested in comparing poor to less-poor neighborhoods within a given city; 
our current aim is to compare poor neighborhoods in one city to equally poor neighborhoods in 
others. Thus, Table 6 presents models in which all level-1 variables are centered on the grand 
mean and shows population average models with robust standard errors (Raudenbush and Bryk 
2002). We examine several variables at the metropolitan level: metropolitan proportion black and 
proportion immigrant, the poverty rate, the unemployment rate, population density (logged), and 
region (with South as the omitted category). We estimate the effects of these variables on the 
intercept (i.e., the number of establishments in the average neighborhood) and on the poverty 
slope. For simplicity, Table 6 only shows the effect on the poverty slope. 

As shown in the table, the most consistently significant variables are the metropolitan 
poverty rate and region. High poverty rates at the metropolitan level may be a proxy for poor 
economic conditions such as a city-wide recession, large budget deficits, low wages or slow 
economic growth. Under these conditions, establishments in high poverty neighborhoods are 
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likely to do worse than under robust economies. In cities in the Northeast and Midwest, the 
association between neighborhood poverty and the number of establishments is either 
smaller (if positive) or more negative than in cities in the South and (often) the West. This 
suggests that the poverty perspective is most likely to be accurate among rustbelt cities, 
consistent with prior work (Jargowsky and Bane 1991). 


Discussion 


This study is the first test of the de-institutionalization hypothesis based on nearly all U.S. 
metropolitan areas and a large sample of establishments. We have presented a number of 
findings. Because of their implications, we outline them clearly below, categorized into 
average and contextual effects. 


Average Effects 

¢ Contrary to expectations, as the poverty rate of a neighborhood increases, the number of 
establishments increases. 

e As the proportion black increases, the number of establishments decreases. 

¢ As the proportion foreign-born of a neighborhood increases, the number of establishments 
increases. 

* For small establishments, the number of establishments increases with neighborhood 
poverty; for larger ones, the relationship is mixed — notably, the number of large grocery and 
large laundries decreases with neighborhood poverty. 


Contextual Effects 
* The relationship between neighborhood poverty and the number of establishments 
depends significantly on the city, such that it is more positive in some cities than others, 
and even negative for a few resources in some cities. 
¢ Poorer neighborhoods have more establishments in cities with stronger economic 
conditions and in the South and West. 


Collectively, these findings add subtlety to sociological theories about the effect of 
neighborhood poverty and to policy considerations regarding the status of poor 
neighborhoods. Below we expand on our theoretical and policy contributions. 


Contributions to Theory 


This paper's first theoretical contribution is the documentation of a conditional perspective on 
the impact of neighborhood poverty (Small 2004). There are at least two different 
assumptions about the impact of neighborhood poverty and the conditions of poor 
neighborhoods. In one, neighborhood poverty is an institution (“the ghetto”) with a set of 
expected characteristics (graffiti, unemployed young men assembled on street corners) that 
repeat themselves from city to city; in another, neighborhood poverty is a condition that only 
leads to certain outcomes (whether the presence of graffiti or the absence of organizational 
resources) when it occurs in conjunction with other factors. The first model assumes that the 
ghetto is an institution much like an ethnic enclave, with a predictable set of attributes such 
as a paucity of organizational resources. The second model assumes that neighborhood 
poverty is conditional, such that there is no “neighborhood poverty effect,” positive or 
negative, large or small, independent of city context. Much of the research follows the first 
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theoretical assumption (e.g., Jencks and Peterson 1991; Brooks-Gunn, Duncan and Aber 
1997a, 1997b; Furstenberg et al. 1999), attempting to discover whether there is or is not an 
effect of neighborhood poverty. Our results are inconsistent with that approach. Poor 
neighborhoods in most cities are not de-institutionalized ghettoes. 

The positive effect of neighborhood poverty may be understood in light of several issues. 
One is the difference between poverty and segregation. Much of the work supporting the de- 
institutionalization thesis has been based on ethnographic research in predominantly black 
neighborhoods in Chicago. The de-institutionalization perspective argues strictly for 
economic, not racial, effects. However, its empirical support has stemmed from studies of 
blacks, such that it was often unclear whether it applied to most poor neighborhoods or only 
to predominantly black ones. Our findings suggest that predominantly black neighborhoods 
constitute special cases, as suggested by Clark (1965), Kasarda (1989), Massey and Denton 
(1993), and others, whereby the effects of segregation and institutional (and interpersonal) 
discrimination are strong enough to temper the effects that other differences between cities 
might have on the number of observed resources (Immergluck 2002). 

A related issue is population density, for which our models controlled. The story of 
institutional deterioration in Wilson (1987, 1996) was not just about poverty (in a black 
neighborhood) but also about depopulation —i.e., the departure of the (black) middle class. This 
picture differs from that in densely-populated, high-poverty neighborhoods in cities such as New 
York, where a prevalence of small establishments is more likely than not. Our findings in Table 
2 suggest that, on average, this pattern is more common than the alternative. Indeed, in models 
that did not account for population density (available upon request) the poverty coefficient was 
smaller and more likely to be negative. Thus, we would conclude that de-institutionalization is 
associated more with segregation and depopulation than with concentrated poverty. 

Our study made clear the significance of city-level factors for the impact of neighborhood 
poverty, a perspective few researchers have adopted. In the first years of the neighborhood 
poverty research resurgence, there was some focus on rust-belt cities as different from the 
rest, but not much more large-scale research into between-city differences in neighborhood 
poverty (e.g., Jencks and Peterson 1991; Jargowsky and Bane 1991; Small and Newman 
2001). Jargowsky (1996) more recently reported great diversity in poor neighborhoods across 
cities with respect to teen births, school dropouts and other factors, issues which seem 
worth probing further. We find that poor neighborhoods in cities of the South and West have 
consistently more establishments than those in other regions, and that those in economically 
depressed cities do worse than others. Generally, our findings suggest that ethnographic 
studies should be interpreted less as snapshots of conditions in average poor neighborhoods 
than as conditional accounts of high poverty in particular cities with unique characteristics. 

Finally, our findings question the assumption that only clients drive the presence of 
establishments, a market-based perspective on establishment survival. The de-institutionalized 
ghetto perspective focused primarily on the impact of the poverty of potential clients — in poor 
neighborhoods, clients had less discretionary income, making the stability of organizational 
resources precarious. We point the lens toward city-level political and economic conditions. 
Businesses do not rely exclusively on customers’ pockets; they also enjoy government incentives 
and direct aid such as tax breaks, grants for entrepreneurship in certain areas, vouchers to clients 
for services such as childcare, mandates such as the Community Reinvestment Act of 1977, and 
others (Logan and Molotch 1987). In addition to the free market, the state affects the distribution 
of establishments in poor neighborhoods. In New York City, for example, high poverty 
neighborhoods demonstrate a higher probability of having affordable childcare centers than low 
poverty neighborhoods, in part because of the influence of federal and city subsidy programs 
(Small and Stark 2005). In this sense, our findings reinforce a long-standing critique of the free- 
market assumptions of the Chicago School (Sampson and Morenoff 1997). 
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Contributions to Policy 


Our findings have implications about policy approaches to neighborhood poverty. One of 
the most important policy demonstrations in recent years has been the Moving to 
Opportunity (MTO) experiments, which test the effect of moving low-income individuals 
from poor to non-poor neighborhoods (Goering and Feins 2003). Relocation approaches 
assume, among other things, that neighborhood poverty constrains individuals’ capacity 
to gain access to important resources (Wilson 1987, 1996). Neighborhood poverty is likely 
to have such an effect among organizational resources, such as schools, tied directly to the 
tax structure of an area. But for most basic organizational resources, as our study shows, 
neighborhood poverty does not have such an effect in most cities. This suggests that an 
umbrella policy approach toward increasing resource access among the poor is unlikely to 
be effective unless policy makers carefully assess, rather than assume, neighborhood 
circumstances (McDermott forthcoming); policies would probably be more effective if they 
were targeted to city level conditions. 

Nevertheless, our research points to a number of complex issues that should be addressed 
before translating these results into policy prescriptions. One important issue is assessing the 
quality of resources. Just as the quantity of resources in poor neighborhoods depends partly on 
the city, we would expect the quality to do so, at least in part because quality varies dramatically 
from resource to resource. In addition, even within organizations, there are not always obvious 
quality metrics, which calls for careful theoretical reasoning. For example, a grocery store ina 
middle-class neighborhood might have more select products, but it may also be pricier, it may 
not offer ethnic-specific goods, and it may not accept WIC. Finally, changes over time should be 
explored further. One reason for the difference between poor neighborhoods with a high 
proportion of blacks and other poor neighborhoods may be the historical development of poor 
black neighborhoods (Wilson 1987). In general, processes such as transformation in the 
economy or in the tax code also have an impact on the constitution of poor neighborhoods. 


Notes 


1. Some researchers have referred to these as “neighborhood institutions,” and to the 
process as the institutional deterioration of the inner city (Wacquant and Wilson 1993:33). 
We exclude from discussion non-organizational resources that are sometimes considered 
“neighborhood institutions” — such as a dense social network in the community — which 
some have argued are prevalent in poor neighborhoods (Stack 1974). 


2. Notably, not all aspects of Wilson's overall theory rely on market factors; e.g., the 
argument that the departure of middle class blacks from the inner city was driven not by 
the market but by political transformations (Wilson 1987, 1996). 


3. This study does not purport to test Wilson’s comprehensive theory of the effect of 
neighborhood poverty, only the hypotheses noted. We do not test, for example, whether 
this effect is due to changes in the inner city during the 1970s and 1980s. 


4. A heated debate has ensued about whether enclave economies refer to the place 
immigrants work or live (Wilson and Portes 1980; Sanders and Nee 1987; Nee and Sanders 
1987: Portes and Jensen 1987; Portes and Jensen 1989). If the two overlap significantly, 
then the hypothesis presented above holds. If they do not, then enclave theory would not 
necessarily predict a high prevalence of resources; that is, a high proportion of poor 


1718 © Social Forces Volume 84, Number 3 * March 2006 


immigrants who live together but work in different locations would not be expected to 
sustain organizational resources any more than a high proportion of poor natives. 


5. We divided each Consolidated MSA into its component PMSAs and used these, along 
with the MSAs in the rest of the country. This yielded 331 MSA/PMSAs, because of 
missing data for Puerto Rico. 


6. The NAICS codes are the following: banks (522120), identified as “savings institutions,” which 
excludes commercial banks, mortgage brokers, monetary authorities and other financial 
institutions; childcare centers (624410), identified as “child day care services;” convenience 
stores (445120); credit unions (522130); drug stores or pharmacies (446110); hardware stores 
(444130); coin-operated laundries and dry-cleaners (812310), which excludes dry-cleaners 
with no coin-operated laundry; grocery stores (445110), which excludes convenience stores; 
grooming stores, a variable combining barbershops (812111), nail salons (812113) and beauty 
salons (812112): and limited-service restaurants (722211), which excludes cafeterias and 
snack bars. We recognize that a few of these establishments, particularly childcare centers, 
may often be non-profits. The NAICS coding system does not distinguish these 
establishments along profit orientation; such research would probably have to be conducted 
based on studies of specific establishment types. Also note that we deliberately exclude 
schools and hospitals. These have extremely wide ranges in size, administrative control, profit 
orientation and specialization; for these reasons, they should be studied separately. 


7. The Office of Management and Budget define the category of “central city” following 
official standards published in the Federal Register Notice. “In each metropolitan statistical 
area and consolidated metropolitan statistical area, the largest place and, in some cases, 
additional places are designated as ‘central cities’ under the official standards. A few 
primary metropolitan statistical areas do not have central cities. The largest central city 
and, in some cases, up to two additional central cities, are included in the title of the 
metropolitan area (MA).” Zip codes were matched to central cities employing the same 
“centroid” approach by which zip codes were GIS matched to metropolitan areas. 
(http:/Avww.census.gov/geo\www/cob/ma_metadata.html). See also (U.S. Department of 
Commerce 1994). 


8. Preliminary tests showed that addition of the two interaction terms induced 
multicollinearity. We re-scaled the predictors before creating the interaction terms, based 
on Smith and Sasaki (1979), to reduce the effect of multicollinearity to a minimum. Let 
X1*X2 = X3. The interaction term, X3, is thought of as a special case of (X1-c)*(X2-d) 
where c and d equal zero. If c and d are set to the means of X1 and X2 respectively, the 
new interaction term X3* will be less collinear with X1 and X2, provided Var(X3*) < Var(X3). 
Smith and Sasaki (1979:23) demonstiate it is possible to identify the values of c and d that 
result in the smallest possible variance of the multiplicative term. Doing so minimizes the 
multicollinearity due to the interaction term without changing the total explained variance 
or the coefficient for the interaction term, and with a minimal (or no) effect on other 
estimated parameters. Resulting correlations are available upon request. 


9. Even though our data contain the entire population of zip codes in the entire 
population of large cities, we treat our data as a sample and present standard errors. 
This approach, sometimes referred to as model-based inference, assumes that even 
censuses should be considered samples, given that they contain measurement error 
and they are subject to chance. As Deming and Stephan (1941:45) write, “A census 
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describes a population that is subject to the variations of chance, because it is only 
one of the many possible populations that might have resulted from the same 
underlying system of social and economic causes.” An alternative approach, sample- 
based inference, assumes that a census directly measures the population. From the 
latter perspective, standard errors are irrelevant when presenting data such as ours. 
This is a statistical and philosophical debate; it does not affect the actual models 
estimated, only their interpretation. For the current practical purposes, the reader 
following the sample-based approach may simply ignore the standard errors and 
assess the coefficients and their signs. For classic treatments of this issue, see 
Deming and Stephan (1941) and Hartley and Sielken (1975). 


10. We use the term “effects” in accordance with established practice in the HLM literature. 
We note, however, that we do not make causal statements based on these data. The 
questions we ask are descriptive, not causal — that is, we ask whether poor 
neighborhoods do, in fact, exhibit the characteristics attributed to them. 


11. The coefficients for the variables of interest do not differ substantially when models with 
and without interactions are compared. To conserve space we only present the full model 
with interaction terms on Table 2. 


12. Of course, we cannot state this unequivocally because we do not have individual-level 
data on the characteristics of the immigrants. 


13. The models in Table 3 were estimated with proportion poor as fixed. This was necessary 
because of convergence difficulties regarding medium and large establishments. This 
should be taken into account when comparing point estimates to those in Table 2. Our 
main conclusions were no different when proportion poor in Table 2 was estimated as 
fixed. Results are available upon request. 


14. The between-city variance, though small, is statistically significant for all 10 
establishments (not shown). 
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Appendix 


Below describe our model. Generalized linear models specify a sampling model, a structural 
model, and a link function. A Poisson distribution is often assumed in estimating a rate (Such as 
homicides per 100,000 persons) or a simple count. In either case, the outcome variable is 
assumed to reflect a “rate,” \, or expected mean, so that E(Y|\) = X. In a Poisson distribution, 
the variance is identical to the mean. We make two modifications to these assumptions. First, 
we consider that, as shown in Table 1, the variance exceeds the mean for all our outcome 
variables. To adjust for over-dispersion, we assume in our estimates that Var(Y|\) = o2E(Y |X), 
and expect oa > 1. Second, the standard Poisson model assumes constant “exposure” across 
units. In our case, zip codes vary widely in area, as shown in the first row of the bottom panel 
of Table 1; thus, we assume a varying exposure, measured by the area in square miles 
(Raudenbush and Bryk 2002:309ff; McCullagh and Nelder 189: 193ff). 

We model d as a function of metropolitan- and zip-level variables, in a hierarchical framework 
(Raudenbush and Bryk 2002). Hierarchical linear models are systems of simultaneously 
estimated equations in which the zip-level (neighborhood) and metropolitan-level (city) 
predictors are modeled independently; in conjunction, they realize the linear predictor 1. Given 
neighborhood / in city /, we specify the connection between the linear predictor and \ with the 
log link, whereby, yj; = log(djj) (Long 1997:257). Our main concern is the effect of poverty. Our 
model takes the following form: 


Level 1 


nj = Boj + By (Neighborhood_Poverty); ae Baj(Proportion_Black);; E B3j(Proportion_ 
Foreign_Born)j e B4jVeighborhood Controls); + ej 


Level 2 
Boj = Yoo ai Yo1(City_Covariates)¢; ae Qj 


Bij = Y10 + AY, 1(City_Covariates) 4; + 4) 


Bai = Y20 
B3j = Y30 


Where Bj is the expected log number of establishments in the average neighborhood in city j: 
By is the average coefficient associated with the neighborhood's poverty level: Bj is the average 
coefficient associated with the neighborhood's proportion black; 3; is the average coefficient 
associated with the neighborhood's proportion foreign born; By; is a vector of coefficients 
associated with neighborhood controls; Yq is a vector of coefficients associated with city-level 
controls affecting the average log number of centers per neighborhood, and 9; Is the deviation 
from that average associated with a particular city. The parameter Y 17 is. a vector of coefficients 
associated with city-level controls affecting the effects of zip-level poverty; M4 is the associated 
random effect. The model also tests whether the average number of organizational resources per 
neighborhood varies significantly across cities (H6); the null hypothesis that it does not vary is 19; 
= 0. The model also tests, independently, whether the coefficient or slope for neighborhoo 
poverty varies across cities (H7); the null hypotheses that it does not vary is 4) = 0. Models are 
estimated using HLM 5.05 software (Raudenbush et al. 2000). 
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Abstract 

Ecological theories linking community characteristics to the level of crime have rarely been 
tested outside the context of the United States and Western Europe. In this study we 
examine the effects of social cohesion and neighborhood disorder on crime using data from 
a survey of neighborhoods in Brazil. We find that lower-income neighborhoods, including 
irregular settlements known as favelas, have higher levels of social cohesion. Contrary to 
the results of research in U.S. urban areas, we find that greater cohesion among 
neighborhood residents is not significantly associated with lower levels of crime, and is in 
fact associated with a higher perceived risk of victimization. By contrast, neighborhood 
social and physical disorder increases violent victimization, but does not affect residents’ 
perceived risk of being victimized. We argue that the effect of social cohesion on risk 
perception is explained by the greater spread of information regarding crimes occurring in 
more cohesive neighborhoods where residents interact more frequently with each other. 


A large body of empirical research has focused on the effect that social and organizational 
characteristics of urban communities have on crime rates. In contrast to criminological 
theories that emphasize individual correlates of criminal offending and victimization, 
ecological theories on which this research is based focus on the type of places where crime 
tends to occur. One of the best-known theories linking community characteristics and crime 
is social disorganization theory. Since the classic formulation by Chicago School theorists 
Clifford Shaw and Henry McKay (1942), researchers have argued that community 
characteristics such as poverty, ethnic heterogeneity and residential mobility disrupt the social 
organization of urban neighborhoods in such a way as to reduce their capacity to exercise 
social control, and are therefore conducive to higher rates of crime and delinquency. (See also 
Bursik 1988; Bursik and Grasmick 1993; Kornhauser 1978; Sampson and Groves 1989.) A 
community’s low socioeconomic status (SES) in particular is thought to reduce civic 
participation and weaken social ties among residents. However, despite the importance early 
social disorganization theorists gave to poverty as a key factor disrupting communities’ social 
organization, the extent to which the effect of neighborhood poverty on crime is mediated 
through community-level factors such as a loss of social cohesion among residents remains 
insufficiently understood. 

Research on the social organization of urban neighborhoods and its effects on crime have 
rarely been tested outside the context of the United States and other advanced capitalist 
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countries. Yet the organization of urban communities in developing countries may challenge 
long-held assumptions regarding the effect of structural characteristics, and specifically 
poverty, on crime. Because of the pattern in which low-income areas in Latin American cities 
were settled, as well as the prominent role of the informal sector in local economies, 
disadvantaged neighborhoods in these cities are often characterized as having dense social 
networks (Leeds 1973; Lomnitz 1977, 1982; Perlman 1976). Yet many of these disadvantaged 
neighborhoods have relatively high rates of violent victimization. In this article we test social 
disorganization theory in the context of urban Brazil. We examine the effect that neighborhood 
characteristics such as poverty and residential stability have on social cohesion, and the effect 
that neighborhood social cohesion in turn has on neighborhood crime rates. We also consider 
an alternative explanation for neighborhood crime that focuses on social and physical disorder. 
According to this perspective, signs of disorder such as abandoned buildings, litter and loud 
music encourage criminal behavior by giving potential offenders the impression that a 
neighborhood is unguarded (Kelling and Cole 1996; Wilson and Kelling 1982). 

After examining the effect of social cohesion and neighborhood disorder on crime we turn 
our attention to the consequences of these neighborhood characteristics on residents’ 
subjective perceptions of the risk of criminal victimization. Researchers have increasingly 
considered perceptions of risk, and consequently fear of crime, as important social 
phenomena in their own right, with potentially serious consequences for personal well-being 
and neighborhood stability (Skogan 1986; Skogan and Maxfield 1981; Warr 1994). 
Surprisingly, studies in U.S. urban areas have found residents’ fear of crime to be only weakly 
related to neighborhood crime rates (Skogan and Maxfield 1981; Taylor and Hale 1986). One 
explanation for this apparent anomaly is that residents receive information regarding the 
relative risk of victimization not only through their direct experience with crime but also 
indirectly through others’ experiences. Greater social cohesion may therefore actually lead to 
a greater perception of risk by facilitating the spread of information even while it may 
contribute to a reduction in crime. Thus, residents in disadvantaged neighborhoods in Latin 
American cities characterized by a dense network of social ties may perceive a greater risk 
of crime than indicated by actual victimization rates. 


Community Social Organization and Crime 


Researchers have for a long time observed a strong relation between neighborhood crime 
rates and structural characteristics such as the level of poverty, ethnic composition and the 
rate of residential turnover. A central tenet of social disorganization theorists is that the effect 
of these exogenous structural factors on criminal victimization is mediated by the 
organizational characteristics of the neighborhoods as a whole. Precisely what are the 
relevant organizational characteristics and which are the most important exogenous factors 
has been a source of some debate. Drawing on Kasarda and Janowitz's (1974) systemic 
model of community organization, social disorganization theorists since the 1980s have 
argued that the effects of neighborhood structural characteristics are specifically mediated by 
the prevalence of informal ties, such as friendship and kinship relations, as well as formal ties 
generated by participation in neighborhood organizations (Bursik and Grasmick 1993; 
Sampson and Groves 1989). According to this view, neighborhoods characterized by dense 
social networks will experience greater trust among residents and cooperation in the 
enforcement of social norms against crime and delinquency. 

Among all the exogenous factors thought to affect residents’ ability to control crime perhaps 
none is as ill-understood as a neighborhood's socioeconomic status (Warner 1999). Following 
the systemic model's emphasis on social ties, many social disorganization theorists assume that 
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the effect of poverty is mediated by the structure of social networks in the neighborhood. 
Disadvantaged neighborhoods are thought to have fewer social ties among residents and lower 
rates of organizational participation. As Kubrin and Weitzer (2003) summarize the prevailing view 
in a recent review of social disorganization theory: “Concentrated disadvantage not only 
deprives neighborhoods of resources that may be mobilized to control crime, but also increases 
social isolation among residents, which impedes communication and interferes with their 
capacity to pursue common values...” (p. 380) 

In practice, the relation between poverty and neighborhood social ties is far from clear. 
While some studies find a negative relation between neighborhood disadvantage and the 
density of social ties, others find the contrary. A clear comparison is further hampered by the 
use of different measures. In their landmark study of social disorganization theory using data 
from the British Crime Survey, Sampson and Groves (1989) find that communities with low 
SES have lower levels of organizational participation and supervision of local youth. Both of 
these factors are in turn associated with lower rates of crime. However, they find no effect 
of SES on friendship ties. Warner and Rountree (1997) also fail to find a significant relation 
between poverty and social ties in a sample of Seattle neighborhoods. They define social ties 
based on the proportion of residents who report borrowing tools or food from neighbors, 
having lunch or helping neighbors with problems. Sampson (1991:55) actually finds that 
community SES is inversely related to density of friendship ties and positively related with 
anonymity. In other words, according to his findings, disadvantaged communities have dense 
friendship networks and less anonymity, both of which may be thought to increase social 
control. By contrast, Bellair (1997) finds that SES positively affects social interaction among 
neighborhood residents in three medium-sized U.S. cities. Greater social interaction, defined 
as getting together at least once a year, in turn reduces the level of burglary, motor vehicle 
theft and robbery. 

Recent ethnographic studies have also challenged the notion that low-income areas are 
characterized by a low density of social ties. Wilson (1996:62-64) argues that residents of 
impoverished neighborhoods in Chicago are often tightly interconnected, yet fail to generate 
the necessary informal social controls to reduce delinquent behavior. Integration into 
neighborhood networks may, in fact, increase delinquency by bringing local youth into contact 
with criminal groups (Browning, Dietz and Feinberg 2000; Pattillo 1998). Findings such as 
these have prompted Sampson and his colleagues to develop a new theoretical approach in 
which social control does not depend solely on the density of social ties, but also on a shared 
sense of trust and willingness to act in order to resolve community problems, which they 
term “collective efficacy.” (Morenoff, Sampson and Raudenbush 2001; Sampson, Morenoff 
and Earls 1999; Sampson, Raudenbush and Earls 1997) In what is perhaps the most rigorous 
test of the social disorganization perspective to date, Sampson and his colleagues use 
multilevel models to measure the effect of concentrated disadvantage, immigrant 
concentration and residential stability on neighborhood collective efficacy and through it, 
crime, while controlling for the characteristics of individual respondents (Sampson, 
Raudenbush and Earls 1997). In their reformulation of social disorganization theory, economic 
disadvantage at the neighborhood level is strongly conducive to lower levels of collective 
efficacy and therefore more crime. 


The Latin American Context 
The lack of appropriate survey data, with representative samples of a sufficiently large 


number of neighborhoods, has until now prevented researchers from properly testing social 
disorganization theory outside the context of a few major cities in the United States and 
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Europe. In this article we use recently collected data from a survey of neighborhoods in a 
large Brazilian city to examine the effect of socioeconomic factors on the density of social ties 
and crime. Latin American urban centers constitute a challenging test case for theories that 
propose a relation between the social organization of neighborhoods and crime. Low-income 
areas of Latin American cities are characterized by particularly dense social networks and 
strong ties among residents, and yet many have high rates of violent crime. 

The unusually high level of social cohesion among residents of disadvantaged neighborhoods 
in Latin American cities is partially due to the way in which these neighborhoods developed. 
Spread out along the outer rings of large metropolitan areas, these neighborhoods varyingly 
known as favelas in Brazil, barriadas in Peru, ca/lampas in Chile and co/onias populares in Mexico, 
were the first destination for rural migrants in the second half of the 20" century. Unable to afford 
formal housing, rural migrants often settled in makeshift dwellings on illegally occupied lands. 
Homes were typically built from substandard materials such as sheet metal and had dirt floors. 
With time, and if the settlement survived government attempts to remove it, the dwellings were 
replaced by better ones and services such as water and electricity provided. Because the survival 
of the settlement depends on the organizational strength of its inhabitants, those settlements that 
survive are characterized by strong social cohesion and a sense of solidarity (Alvito 1996; Leeds 
1973; Perlman 1976; Ward 1999).' The common migratory experience is a key factor in the high 
level of social cohesion. Many of the rural migrants that live in low-income settlements had 
friends or relatives living there or migrated with them (Cornelius 1975; Perlman 1976). Social ties 
among the urban poor in Latin America are further reinforced by their participation in the informal 
sector. Residents of so-called marginal communities in Latin American cities are forced to rely 
on each other to secure jobs, as well as for help in the construction and upgrading of their homes 
(Lomnitz 1977). 

In her detailed study of life in the fave/as and low-income neighborhoods of Rio de Janeiro, 
Janice Perlman (1976) finds that residents have unusually strong friendship and kinship 
networks. More than half of the respondents in her survey reported having their best friends 
and favorite relatives living in the same community. Perlman also finds a high degree of trust 
and cooperation among residents in the communities she studied, most of whom were 
migrants from rural areas. A vast majority of respondents said they could count on friends and 
neighbors in their community if they needed them, and 70 percent thought there was at least 
as much mutual help as in the countryside (pp. 132-134). Contrary to common belief, the 
favelas in Perlman’s study had an “extremely active associational life,” with 68 percent of 
favela residents or favelados belonging to at least one community group. This characterization 
of low-income settlements in Brazil as internally cohesive and organizationally strong is 
markedly different from the description of disadvantaged neighborhoods in U.S. inner cities 
prevalent in much of the literature on social disorganization. The unusually high level of 
integration in low-income areas of urban Brazil documented in detailed case studies such as 
these leads us to expect an inverse relation between neighborhood income and social 
cohesion in the statistical analysis below. 

A different tradition of criminological research linking community characteristics to the 
level of crime focuses on the effect that signs of physical and social disorder have on 
victimization rates rather than the role of local social organization. Signs of urban decay such 
as broken windows, abandoned buildings and graffiti give potential offenders the impression 
that a neighborhood is unguarded and that residents are unwilling to intervene to maintain 
public order and safety, thereby encouraging them to commit delinquent acts (Kelling and 
Cole 1996; Skogan 1990; Wilson and Kelling 1982; but see Harcourt 2001; Taylor 2001). Other - 
signs of social disorder such as loud music, drug use and prostitution may convey the same 
message. Like social disorganization theory, explanations of crime based on social disorder 
focus on neighborhood-level processes. However, because they focus on the physical 
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characteristics of neighborhoods as well as disorderly activity occurring within their 
boundaries instead of social ties and mutual trust among residents, social disorder theories of 
crime provide an alternative explanation for observed differences in victimization rates. 
Moreover, since low-income areas in Brazilian cities are characterized by more signs of 
disorder, such theories lead to the opposite prediction regarding the effect of neighborhood 
disadvantage on crime. 


Fear and Perceived Risk of Crime 


Research on the social ecology of urban areas has focused not only on the effects of 
neighborhood characteristics on crime, but also on the effects of neighborhood 
characteristics on fear and perceived risk of criminal victimization (Covington and Taylor 1991; 
Ferraro 1995; LaGrange, Ferraro and Supancic 1992; Perkins and Taylor 1996; Robinson, et al. 
2003; Rountree and Land 1996a, 1996b; Skogan and Maxfield 1981; Taylor, Gottfredson and 
Brower 1984; Taylor and Hale 1986). The subjective perceptions of crime are important social 
phenomena in their own right since they can have harmful consequences at the individual and 
community levels (Skogan 1986; Skogan and Maxfield 1981; Warr 1994). Greater fear of crime 
may prevent individuals from leaving their homes at night, and lead them to avoid public 
places such as parks and streets. Fear of crime may also encourage residents to move away 
from particular neighborhoods and erect barriers against potential offenders (Caldeira 2000). 
In general, a greater perception of risk and consequently fear, may lead individuals to become 
reclusive, with negative repercussions for entire communities. Fear of crime may also have a 
damaging effect on the local economy by promoting the movement of jobs to areas deemed 
safer (Skogan and Maxfield 1981 :48). 

Researchers have identified a few stable individual correlates for fear of crime. Specifically, 
women, the elderly and non-whites tend to be more fearful according to studies conducted in 
U.S. urban areas (Stafford and Galle 1984; Warr 1994). However, individual characteristics, 
including prior victimization, appear to be insufficient predictors of fear (Skogan and Maxfield 
1981:59-68; Taylor and Hale 1986; Warr 1994:11-20). At the aggregate level, fear is often not 
closely associated with crime rates (Greenberg 1986; Taylor and Hale 1986), suggesting that 
other processes are in play. Two prominent explanations for the discrepancy between fear of 
crime and actual crime focus on the social organization of communities and the level of disorder. 
The two models for fear of crime closely parallel the theories of victimization discussed above 
(Covington and Taylor 1991; Rountree and Land 1996b; Taylor and Hale 1986). 

First, the indirect victimization mode/ holds that “[flear is more widespread than 
victimization because those not directly victimized are indirectly victimized when they hear 
of such experiences from others.” (Covington and Taylor 1991:232) Because greater 
interaction among neighborhood residents will result in more communication of information 
regarding victimization, social cohesion among residents may be conducive to more 
“vicarious Victimization” and therefore a greater perceived risk of crime. Interestingly, the 
positive relationship between social cohesion and perceived risk of crime runs counter to its 
effect on victimization according to social disorganization theory. Thus, while more cohesive 
communities may be better able to exercise social control and prevent crime, they may 
paradoxically exhibit more fear (Skogan 1986). As Bursik and Grasmick (1993:98) point out, 
these two contradictory effects of social cohesion make it difficult to predict its overall effect 
on fear. Empirical results regarding the relationship between social cohesion and subjective 
perceptions of risk and fear of crime are in fact mixed, although they tend to favor a negative 
association. For example, in one of the earliest works on the subject, Skogan and Maxfield 
(1981) found that social integration led to slightly lower levels of fear of crime in a sample of 
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neighborhoods in three U.S. cities. Similarly, Taylor, Gottfredson and Brower (1984) find that 
social ties, based on the level of organizational participation, reduce fear of crime among 
residents in a sample of Baltimore city blocks. In their study of fear of crime in Seattle 
neighborhoods, Rountree and Land (1996a, 1996b) also show that greater social cohesion is 
associated with significantly lower levels of perceived risk. 

The second model of the relationship between neighborhood characteristics and fear of 
crime holds that signs of neighborhood disorder or “incivilities” give residents clues about the 
likelihood of victimization (Covington and Taylor 1991). The /ncivilities mode/ of fear clearly 
resembles the argument regarding the effect of social and physical disorder on victimization 
rates. However, the model hypothesizes an effect of disorder beyond its impact on actual 
crime rates. In this case, the effects of neighborhood ecology on victimization and fear work 
in the same direction such that the overall relationship between disorder and fear may be 
expected to be positive. A relatively large body of work has consistently found neighborhood 
disorder to be positively associated with higher levels of perceived risk and fear of crime 
(Covington and Taylor 1991; Ferraro 1995; LaGrange, Ferraro and Supancic 1992; Perkins and 
Taylor 1996; Rountree and Land 1996a, 1996b; Skogan 1990; Taylor and Hale 1986). 

In the following sections of the article we examine the effect of neighborhood 
characteristics on the level of crime and perceived risk of victimization in a sample of Brazilian 
neighborhoods using multilevel regression models. We begin by describing the data and 
methods used. We then examine the effect of neighborhood characteristics, and especially 
neighborhood disadvantage, on the extent of social cohesion and perceived disorder. Next, 
we analyze the effect of cohesion and disorder on three types of crime: robbery, assault and 
homicide. In the final section, we test the effect of neighborhood cohesion and disorder on 
the perceived risk of criminal victimization. 


Data and Measurements 


Data for this study are drawn from a survey of adults in the Brazilian city of Belo Horizonte, 
located in the state of Minas Gerais 434 kilometers from Rio de Janeiro. With a population of 
2,238,526 residents, Belo Horizonte is the fourth largest city in Brazil. The Victimization Survey 
of Belo Horizonte was carried out by the Center for Crime and Public Safety Studies (Centro 
de Estudos em Criminalidad e Seguranga Publica) of the Federal University of Minas Gerais 
in 2002. Because of the center's interest in examining crime in the city's favelas or slums, and 
especially in those with higher official rates of violence, census tracts categorized as favelas 
in the 1996 Demographic Census were oversampled. According to the results of the census 
carried out by the Brazilian Institute of Geography and Statistics (IBGE), 10.8 percent of 
households in Belo Horizonte are located in fave/as. Out of a target number of 4,000 
interviews, 1,000 were conducted in favelas, 500 of which were considered violent areas 
based on an analysis of official crime reports (Beato et. al. 2001). After selecting the census 
tracts (setores) based on these criteria, 20 households in each tract were selected by random 
sample and a person 15 years or older was interviewed in each household. Sampling weights 
were generated to adjust for oversampling of fave/as and violent favelas. Additional variables 
used in the regression models, including measures of income and household structure at the 
neighborhood level, are obtained from the 2000 Demographic Census. 

Because of the sampling strategy employed there are sufficient respondents within each 
neighborhood to allow the construction of aggregate indicators for the neighborhood ~ 
variables of interest, and the use of multilevel regression analysis. We approximate 
neighborhoods using setores or census tracts, each containing an average of 915 residents 
(257 households). The final sample consisted of 3,873 respondents located in 197 
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neighborhoods (less than the target 4,000 due to non-responses), which means that we have 
an average of 19.7 respondents per neighborhood (for some of the regression analyses the 
average number of respondents in each neighborhood is reduced to 19.1 due to missing 
values). Although relatively low, the number of respondents is similar to those found in 
neighborhood studies in U.S. cities. For example, Raudenbush and Sampson (1999b: 132) test 
their models using an average of 23 respondents per neighborhood cluster where each 
neighborhood cluster has an average population of 8,000 people, in other words, a much 
larger geographical unit than our census tract containing approximately 915 residents. 
Raudenbush and Sampson (1999a:9) report relatively low improvements in the reliability of 
aggregate indicators beyond 20 raters per neighborhood. 

Table 1 lists the survey questions used to measure social cohesion and neighborhood 
disorder, the two main explanatory variables used in the analysis below. The survey items 
corresponding to our measure of perceived risk of crime used in a later section are also listed. 
The questions included in our measure of social cohesion not only tap into the level of 
interaction among neighborhood residents (either in person or by telephone), but also the 
frequency with which they exchange favors such as taking care of each others’ children, 
lending things and looking after each others’ home or car. Because some of these activities 
involve entrusting others with valuable property and the safety of their children, our measure 
of social cohesion captures the degree of trust as well as the willingness to engage in 
guardianship. Our operationalization of the concept of social cohesion is therefore 
comparable to previous studies. (See, for example, Lee 2000; Rountree and Land 1996a; 
Sampson 1991; Sampson, Raudenbush and Earls 1997. 


Table 1: Neighborhood Variables from Victimization Survey in Belo Horizonte 


Social Cohesion ICC = .12 Reliability = .67 

1. How frequently do you visit or receive visits from residents of this neighborhood? 

2. How frequently do you speak personally or by telephone with residents of this 
neighborhood? 

3. How frequently do you and your neighbors exchange kindnesses and favors such as taking 
care or playing with your children, lend stuff or groceries, look after the house or car, etc.? 


Neen eg peepee 
Neighborhood Disorder ICC = .84 Reliability = .75 
1. Are there abandoned buildings, houses or sheds in your neighborhood? If yes, many or 
few? 
2. Is there liter or debris on the street or sidewalks in your neighborhood? If yes, much or few? 
3. Are there empty lots with litter or debris in your neighborhood? If yes, many or few? 
4. Do your neighbors usually listen to loud music, have arguments or hold parties until late at 
night? If yes, do they do this frequently or once in a while? 


aS 


Perceived Risk ICC = .13 Reliability = .69 
Thinking about the things that you do and the places where you go by everyday, does it seem 
to you that the risk is very large, large, medium, small or very small of... 

1. Being robbed or threatened of robbery in your neighborhood? 

2. Being harmed or threatened of harm in your neighborhood? 

3. Being kidnapped in your neighborhood? 

4. Being gravely wounded or killed on purpose in your neighborhood? 
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Our measure of disorder includes questions that capture both the physical and social 
characteristics of a neighborhood that may lead residents and potential offenders to think that 
it is a place with lax enforcement of social norms (Sampson and Raudenbush 1999). 
Questions regarding the level of physical disorder ask residents whether there are abandoned 
buildings or empty lots and whether there is litter in public places. The level of social disorder 
is captured by a question asking respondents to assess whether there are arguments or 
parties until late at night, and whether neighbors listen to loud music. 

We use respondents’ assessments of the risk of being victims of four different types of 
crimes in their neighborhood to construct a measure of the overall perceived risk of 
victimization. We prefer to use the perceived risk of crime rather than fear of crime as a 
dependent variable because the latter involves an emotional response that is more difficult 
to measure (Ferraro 1995:21-39). Perceived risk of crime has been found to be closely 
associated with fear of victimization in previous studies, yet is conceptually and empirically 
distinct from it (Ferraro 1995; LaGrange, Ferraro and Supancic 1992; Rountree and Land 
1996a; Warr 1994). Warr and Stafford (1983) for example, find that fear is a multiplicative 
function of the perceived risk of being a victim of a particular crime and the perceived 
seriousness of that crime. 

For each latent variable in table 1 we computed measures of inter-rater agreement and 
scale reliability at the neighborhood level. Following the work of Raudenbush and Sampson 
(1999a), we measured the extent of agreement among neighborhood residents using the 
intra-neighborhood correlation coefficient (ICC). The ICCs are computed as “the ratio of 
between-neighborhood variance to the sum of between- and within-neighborhood variance, 
where the variance attributable to item inconsistency has been removed.” (Raudenbush and 
Sampson, 1999a:7)° Although they may appear to be low, the ICCs are comparable to those 
presented by Raudenbush and Sampson (1999a) as well as other multilevel studies 
(Browning, Dietz and Feinberg 2000; Raudenbush, Rowan and Kang 1991). The neighborhood 
reliabilities measure the internal consistency of reported assessments of cohesion and 
disorder at the aggregate level. They are preferable to standard individual-level measures of 
reliability such as Cronbach's alpha because the latter confound the reliability of neighborhood 
measures such as social cohesion and the reliability of residents’ perception of cohesion 
within neighborhoods (Raudenbush, Rowan and Kang 1991:311-312). 

In addition to our measures of social cohesion and perceived disorder we constructed 
several neighborhood variables to be used as predictors in the analyses below. First, the level 
of organizational participation is measured by the presence of any organization devoted to the 
problem of violence in the neighborhood, based on the results of the Victimization Survey. 
According to social disorganization theory, organizational participation in a neighborhood 
should be conducive to lower levels of crime. Although participation in any form of 
organization may have the hypothesized effect, unfortunately no information is available on 
the existence of organizations other than those devoted to violence prevention. Second, 
residential stability is defined as the pefcentage of residents living in the neighborhood for five 
or more years, and is expected to increase social cohesion and reduce crime. Third, the 
percentage of female-headed households according to the 2000 Census is also introduced as 
a predictor. A large presence of female-headed households may be expected to reduce social 
cohesion by decreasing the overall level of guardianship at the neighborhood-level (Sampson 
1987; Sampson and Groves 1989). Fourth, we constructed a measure of neighborhood 
disadvantage that takes into account both the income level and the racial composition of a 
neighborhood. Specifically, the index of neighborhood disadvantage is computed as the first : 
principal component of the percentage of household heads earning twice the minimum wage 
or less according to the 2000 Census and the percentage of non-white residents according to 
the Victimization Survey.® Despite popular perception of greater racial equality in Brazil, 
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neighborhood poverty is so closely related to the percentage of non-white residents that we 
are unable to distinguish the effect of these two variables (the correlation between the 
measures of low income and percent non-white is 0.64). Sampson, Raudenbush and Earls 
(1997) and Morenoff, Sampson and Raudenbush (2001) also encounter difficulties 
distinguishing the effect of racial composition and poverty in Chicago neighborhoods and 
construct a similar index of neighborhood disadvantage.° Descriptive statistics for all these 
neighborhood variables are shown in Table 2. 

The models estimating the effect of social cohesion and neighborhood disorder use 
respondents’ answers to questions regarding their victimization during the previous 12 
months. We specifically examine whether respondents were victims of two types of crimes: 
assault and robbery. The victimization variables were coded as 1 if a respondent reported 
being a victim of that type of crime or attempted crime in their neighborhood and 0 
otherwise.’ As shown in Table 2, 3 percent of all individuals in the sample reported being 
victims of a robbery or an attempted robbery, and 5 percent reported being victims of an 
assault or an attempted assault in their neighborhood during the past year. To further examine 
the effect of social cohesion and disorder on crime we also test models using the number of 
homicides reported in official government statistics over the 2001-02 period in each 
neighborhood as the dependent variable and the total neighborhood population as the 
exposure group. Descriptive statistics for both of these variables are also reported in Table 2. 

Finally, some recent studies examining neighborhood social cohesion and disorder have 
considered the feedback effect that crime may have on these two measures of neighborhood 
social ecology. As Skogan (1990) and others have argued, residents may be less willing or 
afraid of forming friendship ties with neighbors and getting involved in community 
improvements in areas with high crime rates (Bellair 2000; Markowitz, Bellair, Liska and Liu 
2001; Sampson and Raudenbush 1999). A full consideration of these reciprocal effects is 
beyond the scope of this study and would ideally involve the use of longitudinal data. 
However, following Sampson, Raudenbush and Earls (1997:922) we introduce a lagged 
measure of the neighborhood crime rate in order to partially control for the effect that the level 
of crime at an earlier point in time may have had on social cohesion and disorder. The prior 
crime rate is computed as the number of thefts, robberies, homicides and attempted 
homicides reported to the authorities per 100,000 residents in 1998, four years before the 
Victimization Survey.® 


Method 


We begin by examining the effect of neighborhood characteristics such as socioeconomic 
disadvantage, residential stability and the prevalence of female-headed households on the 
level of social cohesion among neighborhood residents. Following our theoretical discussion 
above we are especially interested in the relation between neighborhood disadvantage and 
social cohesion. Using similar models we compare the effect of the same neighborhood 
characteristics on the level of disorder. We will then examine the effect that social cohesion 
and neighborhood disorder have on criminal victimization and perceived risk of victimization. 
We employ three-level regression models in which responses to each item making up our 
index of social cohesion and disorder are nested within individuals, which are in turn nested 
within the neighborhoods where they reside (Sampson, Raudenbush and Earls 1997). This 
methodological approach allows us to measure the impact of neighborhood variables while 
holding constant those characteristics of individual respondents that may affect their 
assessments of social cohesion and neighborhood disorder as well as their likelihood of being 
victims of crime? For the purpose of illustration we will focus on the models for social 
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Table 2: Descriptive Statistics for Neighborhood- and Individual-Level Variables 





Variable N Mean sD Min. Max. 
Neighborhood-level Variables 


Organizational participation 197 85 36 0 1 
Neighborhood disadvantage 197 .00 .80 -1.65 1.71 
Percent non-white 197 49.52 23.23 0 95 
Percent low income 197 38.28 23.28 Uae 87.27 
Residential stability 197 76.83 12.99 35 100 
Female-Headed Households 197 34.21 8.44 3.85 63.64 
Prior Crime Rate per 100,000 197 87.87 126.28 00 1219.51 
Favela 197 .26 44 0 { 
Violent Favela 197 n13 34 0 1 
Homicides (2001-02) 197 44 89 0 7 
Neighborhood population 197 915.31 294.40 82 1771 
Individual-level Variables 
Female 3766 53 50 0 1 
Married 3766 39 49 0 1 
Homeowner 3766 at, 45 0 1 
Non-White 3766 50 50 0 1 
Age 3766 38.03 16.43 14 89 
Years in neighborhood 3766 16.13 13.13 0 77 
SES 3766 .00 1.00 -1.94 3.41 
Victimization Variables 
Robbery 3766 .03 18 0 1 
Assault 3766 .05 2 0 1 


“Scale based on z-scores. 


cohesion, but the models for disorder are similar. (See Sampson, Raudenbush and Earls 1997 
and Raudenbush and Sampson 1999b for a further description of the method used.) 


Measurement Model: Our level-1 model captures within-person differences in the latent 
perception of social cohesion. Let Yj, be the response to the /th question measuring social 
cohesion by person / living in rieighbowised k. We model Yiik as a function of the latent 
perception of social cohesion, plus error: 


PI 
Vik = 7 jk + >; 1° Dijk + Sik 
p = 


where TT jk is the “true score” for that particular person; D jjk ‘S a dummy variable that takes 
on a value of 1 if response i is to question p or it is 0 otherwise; and a is a coefficient that 
captures what is often referred to as the “difficulty” of question p in the psychometric: 
literature; here, it gives more weight to those survey questions that best measure social 
cohesion. (One of the three survey items is omitted as a reference group.) The error term erik 
is assumed to be normally distributed. 
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Individual-level Model: Our level-2 model captures the between-person variation in the 
assessment of social cohesion. The latent “true score” of social cohesion TT jz is modeled as 
a function of seven individual characteristics: (1) gender, (2) marital status, (3) home 
ownership, (4) race (1 = non-white, 0 = white), (5) age, (6) the number of years a respondent 
has lived in the neighborhood, and (7) the respondent's SES measured as the first principal 
component of education and income." These individual-level variables are meant to control 
for differences in the extent to which individual respondents are integrated into neighborhood 
social networks. For example, residents who are homeowners and those who have lived in 
the same neighborhood for many years may be expected to have more and stronger ties with 
other residents. All individual-level variables are centered around a grand mean for all 
respondents in the sample such that the intercept of the level-2 model, nz, may be thought as 
the neighborhood level of social cohesion for the average individual: 


7 
Tk = Kt > 8gX aik + Tk 


where X,,; are the individual-level predictors, 8g is the coefficient measuring the effect of each 
predictor on an individual's level of social cohesion, and vik is an individual-level error term. 


Neighborhood-level Model: Our \evel-3 model captures the between-neighborhood variation 
in social cohesion. This neighborhood-level model tests our principal hypotheses regarding 
the effect of neighborhood characteristics on social cohesion net of individual factors used as 
predictors in level 2. As described above, we introduce the neighborhood crime rate in 1998, 
in addition to the neighborhood-level predictors in order to control for the effect that the level 
of crime at an earlier point in time may have had on social cohesion. We also introduce 
dummy variables for fave/as and violent favelas in order to control for over-sampling of these 
types of neighborhoods:" 


Nk =Yoot 01 (disadv) +99 (res. stab.) +y9g (fem head hslds.) + 
Yo (prior crime) +og (favela) +¥qg (viol. favela) + Ugg 


Results 


Table 3 shows the results of the multilevel regression models of social cohesion and 
neighborhood disorder on individual and neighborhood characteristics. The most striking 
finding in table 3 is the positive and significant association between neighborhood 
disadvantage and social cohesion. According to these results, poorer neighborhoods of Belo 
Horizonte actually have greater levels of social integration and exchange of favors among 
residents than wealthier ones. While somewhat surprising given the assumptions of social 
disorganization theory, this finding is consistent with the hypothesis derived from our 
discussion of Latin American cities. As predicted by systemic theory, greater residential 
stability is also conducive to greater social cohesion. The prevalence of female-headed 
households on the other hand, appears not to have a significant effect. The coefficients of the 
individual-level predictors indicate that when all other factors are taken into account, 
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homeowners and those living more years in the neighborhood are more integrated into their 
communities, while older residents are less so.xii The first two findings support systemic 
theory in that residents who own a home or who have lived in the neighborhood for many 
years are more likely to have established close ties with their neighbors. The fact that, net of 
other factors, older respondents report having less social interaction with other residents may 
be indicative of the amount of time they spend outside the home. 

Neighborhood disadvantage is also the most important predictor of neighborhood disorder 
according to the results of the model presented in the second column of table 3. This result is 
consistent with findings in studies of U.S. urban areas: poorer neighborhoods are more likely to 
exhibit signs of abandonment and will lack the resources needed to rebuild and clean up public 
spaces. However, in contrast to the results for social cohesion, residential stability is not a 
significant predictor of neighborhood disorder. The coefficients of the individual-level predictors 
for the models of social disorder indicate that, net of other factors, female residents are more 
likely to perceive signs of disorder while older residents perceive less. These differences may be 
attributed to the relative amount of time these two groups of residents spend in public spaces 
in their neighborhoods as well as different standards of what constitutes disorder. The greater 
level of disorder reported by homeowners may reflect their greater commitment to the 
neighborhood and sensitivity to factors that may influence the value of their property. 

In order to examine the effect of social cohesion and perceived disorder on criminal 
victimization in neighborhoods of Belo Horizonte we tested two-level logistic regression 
models where individuals are nested within neighborhoods. Because the dependent variable 
is based directly on respondent's answers to questions regarding whether or not they were 
victims of a crime and not on latent variable constructs as above, no measurement-level 
models is needed. Our dependent variable Y;; takes on a value of one if respondent / in 
neighborhood / reported being a victim of a crime or an attempted crime during the previous 
12 months and zero if not. We assume that the probability that Yj=1 is distributed Bernoulli 
and the log-odds of the probability, nj, is a function of the same seven individual-level 
characteristics used in the level-1 model above: 


7 
y= B+ Bai Xai * 7 
q = 


where Bgj are the coefficients for each of the individual-level predictors Xi each centered 
around a grand mean such that Bo; may be thought to be the log odds of victimization of an 
average individual in neighborhood /. 

In our level-2 model the log-odds of victimization in a neighborhood is assumed to be a 
function of social cohesion and disorder. The neighborhood measures of social cohesion and 
disorder used for this purpose are computed as the predicted values of three-level models 
similar to those presented above where the individual characteristics of respondents are 
entered in the second level but no neighborhood predictors are included in level 3. These 
predicted values therefore correspond to the levels of cohesion and disorder after individual 
characteristics that may affect respondents’ judgments are controlled. We include 
organizational participation, neighborhood disadvantage, residential stability and the 
percentage of female-headed households as predictors in the regression models in order to 
assess their direct effect on neighborhood crime: 


Boj = Yo0+ Yo1(soc. cohesion)+y99 (disorder)+yqa(org. participation)+yga(disadv) + 
Yo5 (res. stab) +¥9g (fem. headhslds.) +7 (favela) +99 (viol. favela) +Ugk 
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Table 3: Coefficients from the Multilevel Regressions of Social Cohesion and Neighborhood 
Disorder on Individual and Neighborhood Characteristics (X100) Association Between 
Neighborhood Social Cohesion and Victimization 


Cohesion Disorder 
71.964** 97.192** 
Intercept ! (16.214) (12.243) 
Neighborhood-level predictors 
11183"% 1.048** 
Neighborhood disadvantage (3.819) (2.801) 
534™ -.095 
Residential Stability (.180) (.136) 
107 -.079 
Female-Headed Households (.258) (.195) 
.004 .016 
Prior Crime Rate (.017) (.013) 
073 1.047 
Favela (7.438) (5.626) 
-1.807 -6.947 
Violent Favela (8.305) (6.295) 
Individual-level predictors 
-4.987 4.474** 
Female (2.846) (1.663) 
-5.650 2.226 
Married (3.120) (1.833) 
12.443** 4.024** 
Homeowner (3.459) (2.043) 
5.781 .307 
Non-White (3.188) (1.871) 
-.934** -.532™* 
Age (.101) (.059) 
.686** .083 
Years in Neighborhood (.126) (.074) 
-1.232 5.1 s 
SES (1.899) (1.132) 


Sera eS ee 
Note: Numbers in parentheses are standard errors. 
*p <.05 **p <.01 (two-tailed tests) 
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Results 


Table 4 shows the results of the multilevel logit regression models of victimization on social 
cohesion and neighborhood disorder. Contrary to the hypothesis derived from social 
disorganization theory, the level of social cohesion in a neighborhood is not significantly associated 
with a lower probability of robbery or assault. By contrast, greater neighborhood disorder appears 
to increase the likelihood of assault when all other relevant individual and neighborhood 
characteristics are controlled. The magnitude of the effect is indeed large. One standard deviation 
increase in neighborhood disorder is associated with a 25 percent increase in the odds of being a 
victim of assault. With regards to the remaining neighborhood variables, neighborhood 
disadvantage reduces the odds of robbery, which is a property crime, but not assault. This result is 
consistent with routine activities theory in that more disadvantaged neighborhoods will generally 
have less targets for robbery than affluent ones (Cohen and Felson 1979). Although favelas appear 
to have a higher incidence of assault, the positive coefficient cannot be interpreted in substantive 
terms because favelas with higher rates of violence were purposely oversampled. 

The coefficients of the individual-level predictors indicate that women are more likely to be 
victims of assault than men, but less likely to be victims of robbery. Older individuals are also 
less likely to be victims of assault, while non-whites are more likely to be victimized. And finally, 
wealthier and more educated individuals are considerably less likely to be victims of physical 
assault, once all other relevant factors are controlled, but more likely to be victims of robbery. 
Because this last finding holds when controlling for neighborhood characteristics, it indicates 
that more affluent individuals are the targets of property crimes regardless of the neighborhood 
socioeconomic level. This finding is once again consistent with routine activities theory. 

In order to further examine the effect of social cohesion and neighborhood disorder on 
criminal victimization in neighborhoods of Belo Horizonte, we tested additional models using the 
number of homicides reported in official statistics in 2001-2002 for each neighborhood as the 
dependent variable and twice the total neighborhood population as the exposure group. These 
regression models are not hierarchical since individual characteristics of homicide victims are 
unknown and only the aggregate homicide counts for each neighborhood are available from 
official police reports. Individuallevel predictors are therefore removed from the regression 
models. As is customary in studies of homicide, negative binomial regressions are used due to 
the skewness of the dependent variable."* The results of this new set of regression models 
shown in table 5 are consistent with the findings from the multilevel regressions for assault 
because neighborhood disorder — and not social cohesion — is a significant predictor of homicide. 
However, in these new models neighborhood disadvantage is associated with a higher incidence 
of homicide even after the level of social and physical disorder is taken into account. 


Perceived Risk of Crime in Brazilian Neighborhoods 


In the previous section we tested social disorganization and social disorder theories of crime. 
The results generally contradict social disorganization theory in that social cohesion among 
neighborhood residents was not found to be a significant predictor of criminal victimization. 
The findings lend greater support to the hypothesis that neighborhood social and physical 
disorder are conducive to crime, and specifically violent crime such as assault and homicide. 
In this section we turn to an examination of individuals’ subjective interpretation of the risks of 
victimization rather than the actual probability of being victimized. As discussed above, 
research in U.S. urban areas has shown that individuals’ perceived risk of victimization is only 
weakly related to actual victimization. Neighborhood social cohesion may therefore have a 
strong effect on the perception of risk despite its lack of association with victimization. 
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Table 5: Coefficients from the Negative Binomial Regressions of Homicide on Social Cohesion 
and Neighborhood Disorder, 2001-2002 





Variables Model 1 Model 2 - Model 3 
Neighborhood-level predictors 
.205 -.232 
Social Cohesion (.722) (.813) 
ee Oa 2.300" 
Neighborhood Disorder (.739) (.753) 
-.449 -.595 -.594 
Organizational Participation (.314) (.317) (.318) 
-.017 -.014 -.013 
Residential Stability (.012) (.012) (.012) 
.887** B5i"* 863" 
Neighborhood Disadvantage (.197) (.203) (.204) 
024 027 .027 
Female Headed Households (.016) (.015) (.015) 
-8.094** dos. -9.943** 
Intercept 1.247 1.284 1.450 
Chi Squared 24.45 38.6 39.18 
Log-likelihood -135.48 -131.93 -131.89 
n 197 197 197 





Note: Numbers in parentheses are standard errors. 
*p<.05 **p <.01 (two-tailed tests) 


Interestingly, the effect of social cohesion on perceived risk of victimization may be the 
opposite of its expected effect on neighborhood crime. While greater social cohesion in a 
neighborhood is thought to decrease victimization rates by increasing guardianship and a 
willingness to intervene on behalf of others, social cohesion may actually lead to a greater 
perception of risk by facilitating the spread of information regarding criminal incidents 
occurring in the neighborhood. All else being equal, residents of cohesive neighborhoods may 
have knowledge of a greater number of crimes against acquaintances and others simply by the 
fact that they are more closely interconnected. According to the indirect victimization model 
we therefore expect a positive association between our measure of social cohesion and the 
perceived risk of crime, especially after prior victimization is controlled. The social disorder or 
incivilities model, on the other hand, predicts that neighborhood social and physical disorder 
will be associated with a greater perceived risk of crime as residents will interpret signs of 
disorder as evidence that they live in an areas where social norms are not well enforced. 

In the analysis presented in this section we employ a method proposed by Raudenbush and 
Sampson (1999b) specifically intended to assess the direct and indirect effects of aggregate-level 
variables in multilevel designs. This method allows us to determine the extent to which the 
effects of exogenous neighborhood variables such as neighborhood disadvantage, residential 
stability and the prevalence of female-headed households are mediated through their effects on 
social cohesion (indirect effects) and the extent to which they directly affect subjective 
perceptions of risk (direct effects) controlling for the individual characteristics of respondents. This 
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method relies on a transformation of the parameters of the joint distribution of the latent variables 
for social cohesion and perceived risk given the individual- and neighborhood-level predictors to 
obtain total, direct and indirect effects. One critical feature of this method is that it adjusts for 
measurement error of our intervening variable of social cohesion, as opposed to regressions 
using a manifest version of social cohesion given by a simple aggregation of responses to the 
corresponding survey items. Raudenbush and Sampson (1999b:142-144) demonstrate that using 
manifest variables instead of latent neighborhood variables leads to a substantial underestimation 
of the magnitude of their effect and an overestimation of their statistical significance."* 

Table 6 presents the total effects of the neighborhood-level variables, as well as their direct 
effects. The direct effects of neighborhood variables are those that are unmediated by social 
cohesion. As in the regression models for social cohesion above, we enter the prior crime rate as 
a predictor of both social cohesion and perceived risk in order to ensure that we are not simply 
capturing the effects of previous crime. The first thing to note from the results in Table 6 is that 
social cohesion has a strong positive effect on the perceived risk of victimization. This finding is 
consistent with the indirect victimization model because greater social cohesion may be increasing 
residents’ assessments of the likelihood of being victimized by promoting a greater exchange of 
information regarding criminal incidents occurring in the neighborhood. By contrast, neighborhood 
disorder appears to have no effect on perceptions of risk." Second, none of the exogenous 
neighborhood-level variables, including neighborhood disadvantage, have a significant total effect 
on perceived risk of victimization. However, when the effect of neighborhood disadvantage is 
decomposed into its direct and indirect components we find significant opposite effects as 
illustrated in figure 1. Because neighborhood disadvantage increases the level of social cohesion, 
and neighborhood cohesion in turn significantly increases perceived risk, the indirect effect of 
neighborhood disadvantage is positive, while its direct effect is negative. In other words, residents 
of disadvantaged (poor, non-white) neighborhoods perceive the risk of victimization to be greater 
than their counterparts in other neighborhoods because of greater interaction and communication. 
This negative direct effect of neighborhood disadvantage may, in part, reflect a desensitization of 
residents of such neighborhoods to risk, resulting in an overall shift in the scale of their answers 
to the survey questions pertaining to risk. 

The coefficients for the individual-level predictors in Table 6 indicate that residents who 
were victims of a robbery or assault during the past year perceive a greater risk of 
victimization. Consistent with findings in the United States, women generally perceive a 
greater risk of victimization. However, contrary to results from studies conducted in U.S. urban 
areas, older residents perceive a significantly lower risk, while non-whites report similar levels 
of risk as white residents (Skogan and Maxfield 1981; Warr 1994). 


Figure 1. Direct and Indirect Effects of Neighborhood Disadvantage on Perceived Risk of Crime 
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Table 6: Total and Direct Effects of Neighborhood Characteristics on Perceived Risk as Mediated 
by Social Cohesion (X 100) 





Model 1 _Model 2 


Total Direct Total Direct 
Effect Effect Effect Effect 


63.422" 47.023" + —-63.374** ~—-«6.655** 
Intercept | (16.01) (17.425) (16.337) (19.406) 


Neighborhood-level predictors 


.231* .281* 
Social Cohesion (.102) (.115) 
-.307 
Neighborhood Disorder (.187) 
044 -.117 -.164 462 
Organizational Participation (5.733) (5.685) (5.847) (5.866) 
-5.535 -8.026* -5.483 -~6.875f 
Neighborhood Disadvantage (3.484) (3.630) (3.577) (3.801) 
011 -.111 013 -.134 
Residential Stability (.171) (.178) (.174) (.183) 
-.049 -.064 -.047 -.090 
Female-Headed Households (.246) (.243) (.251) (.249) 
015 014 015 .016 
Prior Crime Rate (.017) (.016) (.017) (.017) 
3.370 3.484 3.299 3.566 
Favela (7.080) (7.008) (7.221) (7.151) 
.630 .790 645 323 
Violent Favela (7.921) (7.839) (8.074) (7.998) 
Individual-level predictors 
9.464** 1.489** 
Female (1.817) (2.019) 
-.581 -.385 
Married (2.004) (2.224) 
-1.901 -1.988 
Homeowner (2.240) (2.483) 
-2.248 -2.434 
Non-White (2.043) (2.268) 
133" sO 
Age (.065) (.073) 
-.036 -.041 
Years in Neighborhood (.083) (.092) 
1.042 1.442 
SES (1.246) (1.378) 
21.141 2st 2 
Victim of Robbery (5.236) (5.812) 
27.566** O87? 
Victim of Assault (4.224) (4.690) 


Note: Numbers in parentheses are standard errors. 
tp <.10 *p<.05 **p <.01 (two-tailed tests) 
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Knowledge of Crime and Social Cohesion 


We have argued that the greater perceived risk of victimization in neighborhoods with 
more social cohesion among residents may be explained by the greater spread of 
information regarding crimes occurring in neighborhoods. For two neighborhoods with 
similar crime rates, the one with more communication among residents will likely result 
in a greater (perhaps even exaggerated) assessment of the actual crime rate and 
consequently (individual factors remaining constant) a greater perceived risk of being 
victimized. In order to test this proposition we fitted a new set of regression models where 
knowledge of the occurrence of homicides in the neighborhood during the past year was 
used as the dependent variable. Respondents to the Victimization Survey were asked if a 
homicide had taken place in their neighborhood during the previous 12 months: 32.4 
percent of all respondents reported having knowledge of such an event. Because the 
survey was conducted in January 2002, we used official homicide crime report data from 
2001 to create a variable indicating whether a homicide had indeed occurred during the 
previous year. We entered this variable as a predictor in the regression models to control 
for the actual occurrence. The results of the two-level logit regressions shown in Table 7 
indicate that social cohesion is indeed an important predictor of knowledge of a homicide 
event. Greater communication and interchange among neighbors as indicated by the 
variables used to measure social cohesion is associated with a greater probability of 
knowledge of that particular type of crime. 

Finally, in order to test the effect that knowledge of a homicide has on the perceived 
risk of victimization we fitted a regression model using respondents’ perceived risk of 
being gravely wounded or killed on purpose or by accident as the dependent variable." 
The results of this model presented in Table 8 indicate that knowledge of a homicide 
occurring in the neighborhood during the previous year substantially increases the 
perception that respondents’ themselves could be victims. More importantly, when 
knowledge of homicide is introduced in the regression model in the second column of 
Table 8, the effect of social cohesion is reduced and its corresponding coefficient is no 
longer statistically significant at the .05 level. Although not conclusive, this finding 
suggests that the effect of social cohesion on perceived risk may indeed be due to the 
greater spread of information about criminal incidents occurring in neighborhoods with 
more social cohesion. 


Conclusions 


Over the past several decades sociologists have renewed their interest in the effects that 
neighborhood contexts have on crime and other social outcomes. (See Sampson, 
Morenoff and Gannon-Rowley 2002 for a recent review.) These neighborhood studies 
draw on a rich tradition of sociological theory, the origins of which may be traced back to 
the Chicago School of urban sociology, among others. The availability of better surveys 
and the development of sophisticated methodological techniques have contributed to an 
improvement in the measurement of the social processes involved. However, researchers 
have focused almost exclusively on the context of U.S. urban areas. The organization of 
neighborhoods in the large urban centers of the developing world present a challenge to 
long-held assumptions regarding the effect of community characteristics on crime. In this 
article we have examined the effect of neighborhood organization on crime and 
perceptions of crime in a new social context, namely that of contemporary urban Brazil. 
Contrary to the assumptions of social disorganization theory, disadvantaged 
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Table 7: Coefficients from the Multilevel Logit Regression of Knowledge of Homicides in 





Neighborhood 

Model 1 
-2.903** 

Intercept (.758) 

Neighborhood-level predictors 

7 901* 
Social Cohesion (.448) 

.027 

Organizational Participation (.232) 
638" 

Neighborhood Disadvantage (.143) 

.004 

Residential Stability (.007) 

013 

Female-Headed Households (.010) 
coat 

Homicide in 2001 (.255) 

-.267 

Favela (.281) 
1.005** 

Violent Favela (.309) 

Individual-level predictors 

-.069 

Female (079) 

-.055 

Married (.088) 

142 

Homeowner (.097) 

.010 

Non-White (.089) 
-.014** 

-.002 

Years in Neighborhood (.004) 

014 

SES (.057) 


a 


Note: Numbers in parentheses are standard errors. 


*p <.05 **p <.01 (two-tailed tests) 


1746 © Social Forces Volume 84, Number 3 * March 2006 


Table 8: Coefficients from the Multilevel Regressions of Perceived Risk of Homicide on 
Knowledge of Homicide in Neighborhood 





Model 1 Model 2 
Intercept ‘oo .843** 
(.253) (.250) 
Neighborhood-level predictors 
eed aoe 
Social Cohesion (.136) (.135) 
-.191 -.197 
Neighborhood Disorder (.145) (.143) 
138" oo 
Organizational Participation (.070) (.069) 
.085* .048 
Neighborhood Disadvantage (.038) (.038) 
-.002 -.002 
Residential Stability (.002) (.002) 
-.001 -.002 
Female-Headed Households (.003) (.003) 
.008* .005 
Homicide Rate (2000-01) (.004) (.004) 
Individual-level predictors 
343™ 
Knowledge of Homicide (.035) 
shin aie 
Female (.030) (.029) 
-.003 -.001 
Married (.032) (.032) 
-.057 -.064 
Homeowner (.035) (.035) 
.032 .033 
Non-White (.033) (.033) 
-.004** -.003** 
Age (.001) (.001) 
.001 .001 
Years in Neighborhood (.001) (.001) 
; -.049** -.051** 
SES (.019) (.019) 
.241** kas 
Victim of Robbery (.083) (.082) 
420** 394** 
Victim of Assault (.074) (.073) 


——<—$——— 
Note: Numbers in parentheses are standard errors. 
*p<.05 **p <.01 (two-tailed tests) 
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neighborhoods in Brazil were found to have higher levels of social cohesion among 
residents. We argued that this high level of social cohesion is the result of the specific 
social process by which low-income areas were first settled as well as the strategies of 
economic survival of the urban poor, which include among other things a greater participation 
in informal markets. 

Despite the high level of social integration among residents, disadvantaged 
neighborhoods in Brazil continue to have high victimization rates. In fact, our analysis 
revealed no statistical relation between social cohesion and neighborhood crime. This 
finding contradicts social disorganization theory, at least in its systemic reformulation 
according to which a community's capacity to regulate behavior depends on the density 
of social networks (Bursik 1999). However, a growing number of studies in the United 
States also suggest that social networks by themselves are not enough to generate social 
control and prevent crime (Bursik 1999; Morenoff, Sampson and Raudenbush 2001; 
Pattillo 1998; Wilson 1996). As Sampson and his collaborators have argued, social control 
does not only depend on social ties, but also on a shared sense of trust and willingness to 
intervene (Sampson, Morenoff and Earls 1999; Sampson, Raudenbush and Earls 1997). 
Although the specific survey questions used to construct our measure of social cohesion 
tap into residents’ sense of trust and guardianship of others’ property, it is unclear whether 
our non-significant findings with regards to the effect of cohesion on crime are due to 
measurement differences with Sampson and his collaborator’s index of “collective 
efficacy” or reflect an entirely different social process at work in Brazilian neighborhoods. 

Research by Pattillo (1998) and others suggests another possible explanation for why 
greater social cohesion is not associated with lower neighborhood crime rates: when 
residents who violate the law are tightly integrated into community social networks other 
residents may be less willing to intervene against them. Social cohesion may therefore 
result in a greater tolerance for criminal activity (Browning, Dietz and Feinberg 2000). More 
research into the social networks of Brazilian neighborhoods with high crime rates is 
clearly needed before we can arrive at this conclusion. In contrast to the negative findings 
with regards to the effect of social cohesion on crime, neighborhood physical and social 
disorder was found to be significantly associated with higher levels of assault and 
homicide, even after individual-level predictors of crime were taken into account. These 
results are consistent with a large literature in the United States, and suggest a more 
universal causal connection. 

While social cohesion seems to have no reducing effect on criminal victimization, It 
nevertheless increases the perceptions among neighborhood residents that they are at 
risk.of being victimized according to the results of the second part of our analysis. We 
attributed this perception of risk to a greater level of communication regarding incidents 
of crime occurring in more cohesive neighborhoods where residents interact more 
frequently with each other. However ungrounded such perceptions of risk may be on 
actual victimization rates, they may nevertheless have important consequences of their 
own. Fear of crime has been blamed for a rise in private security guards as well as the 
flight of affluent Brazilians to gated communities resulting in a rise in income segregation 
(Caldeira 1996, 2000). Because social cohesion among neighborhood residents 
encourages greater perceptions of risk, it may be partly responsible for these social 
problems. The findings presented here should therefore serve as a warning against overly 
optimistic conclusions regarding the beneficial effects of greater social cohesion. 
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Notes 


1. In his comparison of new development areas on both sides of the U.S.-Mexican border, 
Ward (1999: 165-198) finds a much greater level of integration among residents in the poor 
Mexican communities than among residents of communities of similar socioeconomic 
background on the U.S. side of the border. He argues that the common experience of 
settling in illegally-occupied lands, and then defending and upgrading these settlements 
leads to the formation of stronger bonds between members of these Mexican communities. 


2. Although they lead to opposite predictions with regards to the effect of neighborhood 
disadvantage in this specific instance, social disorganization and social disorder theories 
of crime are not strictly competing theories. Because they invoke different mediating 
neighborhood variables both could be true. We thank the anonymous reviewers for this 
observation. 


3. Responses to all items are entered into a three-level model similar to that presented in a 
section below where level 1 may be thought to capture the within-person variation in 
assessments of social cohesion, disorder and perceived risk of crime, and level 2 and 3 
capture the within- and between-neighborhood variation respectively. For a detailed 
description of the method, see Raudenbush, Rowan and Kang (1991). 


4. Responses included church-based organizations, as well as residents’ associations, 
political groups, community policing councils, samba schools and others. In all, 85 
percent of neighborhoods were found to have at least one organization dealing with 
violence. Respondents were asked whether such an organization existed, not whether 
they participated in it. The variable was dichotomized because it is impossible to ascertain 
whether a greater percentage of respondents reflects a greater degree of involvement or 
simply a greater degree of awareness of such organizations. 


5. Unfortunately, there is no official poverty line in Brazil, so we are forced to create our own 
measure. Income levels at the census-tract level are reported in multiples of minimum 
wages earned by the head of household. A threshold of two minimum wages 
corresponds roughly to international standards for poverty. For example, the World Bank 
(2002) defines individuals living on less than 1 or 2 dollars per day as poor. Using the 
average household size for Belo Horizonte and the exchange rate at the time of the survey, 
the threshold of two minimum wages corresponds roughly to an income of $1.60 per 
person, midway between the two standards used by the World Bank. Second, using our 
criteria we find that slightly more than 38 percent of households in Belo Horizonte are 
poor. This overall figure corresponds roughly with that found in a more detailed study of 
poverty by Rocha (2003). Using other sources, Rocha estimates that 33.5 percent of 
residents of Belo Horizonte live in poverty. 


6. While the racial composition of a neighborhood is conceptually distinct from poverty, 
which has been the focus of our theoretical discussion, almost all the substantive findings 
discussed below are the same when the percentage of low-income households are used 
in the regression models instead of our index of neighborhood disadvantage. The only 
difference arises in the regression models examining knowledge of a homicide (tables 7 
and 8). When the percentage of low-income households is entered instead of our index of 
neighborhood disadvantage in those models the significance level of the coefficient for 
social cohesion drops slightly below .05, unless robust standard errors are used. 


10. 
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The survey question does not allow us to fully distinguish between assault and attempted 
assault, and between robbery and attempted robbery. 


The inclusion of the prior crime rate may introduce a problem of endogeneity in the 
regression models because both the current and prior crime rates may be functions of the 
remaining set of predictors. To strike a balance between the need to control for a possible 
feedback effect of crime on neighborhood social cohesion and disorder and the risk of 
endogeneity in the multilevel models, we decided to include the prior crime rate as a predictor 
only in those models where social cohesion or neighborhood disorder are explicitly modeled 
as dependent variables (i.e., tables 3 and 6 below). However, we also tested models of 
criminal victimization (tables 4 and 5 below) with the prior crime rate as a predictor. In every 
case, the inclusion of the prior crime rate failed to alter any of the findings discussed. 


Recent neighborhood studies have emphasized the importance of the spatial 
interdependence of crime and other neighborhood characteristics across neighborhood 
boundaries (Morenoff, Sampson and Raudenbush 2001; Sampson, Morenoff and Earls 
1999). Because very few neighborhoods in our sample are contiguous, there is very little 
risk of dependence across boundaries. 


Values for respondents’ income were imputed for 211 missing cases (5.6 percent of the 
sample) using the remaining individual-level predictors and respondents’ employment 
status as predictors in an OLS regression. Although the imputed values resulted in a larger 
sample size, they did not substantially alter the results presented below. 


. The software used to test the multilevel models (HLM 5.04).does not allow neighborhood 


sampling weights to be used in non-linear and three-level regression models (Raudenbush 
et al., 2001). The coefficients of the two fave/a dummies in the regression results cannot 
be interpreted in meaningful terms because they are included merely to compensate for 
oversampling. As discussed in the results section below, introducing the fave/a dummy 
variables has very little effect. In addition, similar models using only two levels with 
sampling weights (omitting the measurement model) yielded similar results. 


As stated above, the coefficients for the dummies for fave/as and violent favelas cannot be 
interpreted in substantive terms because they are meant to control for oversampling of 
these types of neighborhoods. 


Because the statistical package used to test these models allows for neighborhood-level 
sampling weights, the fave/a dummy variables are no longer necessary and are therefore 
also removed from the regression models. 


In order to conserve space, a detailed explanation of Raudenbush and Sampson's (1999b) 
method is omitted. The reader is referred to that original article for details. It is important 
to emphasize that in this method social cohesion is assumed to be an endogenous 
variable and is not simply included with the remaining predictors. 


Neighborhood disorder also failed to have a significant effect on perceived risk when it 
was introduced without social cohesion in separate models not presented. 


16. The average response to the perceived risk of being gravely wounded or killed on purpose 


and by accident was used as the dependent variable. 
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Differential Participation and the Nature of a Movement: 
A Study of the 1999 Anti-U.S. Beijing Student 
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Abstract 

In this article, we develop a quadripartite classification of social movement participants 
based on the participants’ levels of interest in a movement goal and the amount of 
resources that the participants contribute to the movement. We use the latent class model 
to determine the percentage of participants in each of the four categories in our 
classification system for the case of the 1999 anti-U.S. student demonstrations in Beijing in 
the aftermath of the Belgrade Embassy bombing. Finally, we apply the multinomial logit 
regression to investigate the mechanisms behind the mobilization of different types of 
participants during the anti-U.S. demonstrations. Our analysis and empirical findings not 
only show the importance of studying movement mobilization based on a classification of 
movement participants, but also reveal much about the nature of the 1999 anti-US. 
demonstrations. 


In his 1978 classic, Tilly (1978: 87-90) categorizes movement participants into four ideal types: 
zealots, misers, opportunists and run-of-the-mill participants. Based on this classification 
system, Tilly analyzes the mobilization potentials for each type. According to Tilly, zealots will 
spend up to the limit of their resources to fight for certain collective goods. Misers, however, 
value their resources so highly that they join a social movement only when they see a great 
return for their action. Run-of-the-mill contenders resemble the zealots except that they are 
unwilling to pay more than what they expect to get. Finally, opportunists will join a social 
movement as long as they see some personal benefit. 

Tilly's classification of social movement participants is formally defined, not empirically 
derived. His classification and analysis demonstrate that social movement participants differ in 
terms of their levels of involvement in a movement and commitment to the movement goal, 
and that different mechanisms are involved in mobilizing different kinds of movement 
participants. Given these widely acknowledged differences, one may expect that categorizing 
movement participants and examining their composition in a movement would be a major area 
of inquiry in studies of social movement mobilization. This is, however, not the case. Since the 
1970s, with a few exceptions (e.g., McAdam 1986; Passy and Giugni 2001), most empirical 
studies of movement mobilization have focused on how the recruitment process has been 
facilitated or impaired by various conditions such as resources (McAdam and Zald 1973, 1977), 
interpersonal networks and organizations (Fernandez and McAdam 1988; Gould 1991; 
McAdam and Paulsen 1993; Snow et al. 1980), the built environment and the associated spatial 
routines of the potential participants (Gould 1995; Sewell 2001; Tilly 2003; Zhao 1998, 2001), 
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and frames of movement organizers or the media (Benford 1997; Gamson and Meyer 1996; 
Gamson and Wolfsfeld 1993; Molotch 1979; Snow et al. 1986). Yet, because different kinds 
of movement participants may have been recruited into a social movement through different 
mechanisms, without first finding ways to classify and identify different types of social 
movement participants, it will be difficult for empirically-based movement mobilization 
research to develop further after its early success during the 80s and 90s. 

It is not that movement scholars after Tilly have not analyzed social movement mobilization 
by categorizing movement participants. For example, Oliver et al. (1985) and Marwell and 
Oliver (1993) divide movement participants between critical mass and other participants in 
order to understand the roles that the critical mass play in the early stage of a movement. Kim 
and Bearman (1997) base their analysis on a tripartite classification of movement participants 
(critical mass, followers and latecomers), and their simulation model shows that these groups 
differ in terms of the timing when they join a movement and their interest level in the 
movement goal. Yet while these studies have generated great insights, their classification of 
movement participants, like Tilly's analysis, is theoretically defined, not empirically derived. 
However important and central their classification systems, how to identify different types of 
participants in a real life social movement remains an issue. We all accept that individuals 
participate in social movements with different reasons and that the study of differential 
participation is crucial for our understanding of a social movement, so it is surprising that 
almost no systematic empirical efforts have sought to identify and classify movement 
participants in a concrete movement setting and understand the nature of the movement 
mobilization based on an empirically derived categorization of movement participants. This 
article aims to take a step in filling the gap. Empirically, it seeks to understand the nature of 
the 1999 anti-U.S. student demonstration in Beijing triggered by the U.S. bombing of China's 
Belgrade Embassy by identifying different kinds of participants in the protest and 
investigating their motivations in participating in this protest. Theoretically, it represents 
perhaps the first systematic attempt in social movement research to establish an objectively 
measurable system of classification of movement participants and to analyze the nature of a 
movement based on that classification. 

After a section on the development of the 1999 anti-U.S. demonstration and a 
methodology section, this article proceeds as follows. First, we propose a quadripartite 
typology of social movement participants based on the participants’ levels of interest in the 
movement goal and the amount of resources that the participants contribute to the 
movement. Then, using the latent class model, we try to verify the validity of the quadripartite 
typology by applying it to the case of the 1999 anti-U.S. protest and quantifying each of the 
four categories of movement participants. Finally, we analyze the mechanisms behind the 
mobilization of different kinds of participants and the nature of the anti-U.S. demonstrations. 


The Anti-U.S. Demonstration 


On the night of May 7, 1999, more than 40 days after the NATO bombing of Yugoslavia, a B- 
2 stealth bomber launched from the United States fired five missiles at China's Belgrade 
Embassy. Three journalists were killed and more than 20 Chinese diplomats were wounded. 
When the news reached China, anti-U.S. demonstrations surged in major Chinese cities. In 
Beijing alone, between May 8 and 11, thousands of students protested in front of the U.S. 
Embassy. They shouted anti-American slogans, burned American flags and pelted the U.S. 
embassy with rocks and bottles. 

Before the embassy bombing, U.S.-China relations had already experienced a major 
change. In the United States, the collapse of the Soviet Union in conjunction with China’s 
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brutal repression of the 1989 Pro-democracy Movement fundamentally changed China's 
geopolitical importance to the United States as well as Americans’ view of the Chinese 
regime. The U.S. government's attitude toward China turned negative as a result. In China, to 
boost its falling legitimacy after the 1989 Pro-democracy Movement, the government staged 
Patriotic Education Campaigns (Zhao 1998) and deepened its market-oriented reforms. The 
combined effect of the Patriotic Education Campaigns, the success of the economic reform 
and the worsening of the U.S.-China relations provoked anti-American feelings among the 
Chinese. Although these sentiments were exaggerated in the U.S. media, it is certainly true 
that most Chinese in the 1990s no longer held a rosy picture about the United States as they 
had during the 1980s (Gries 2001; Unger 1996; Zhao 2002; Zheng 1999). 

Perhaps because of the above developments in the U.S.-China relations, the 1999 
demonstrations were seen in the U.S. media as an act that was instigated by the Chinese 
government (e.g., Dionne 1999; Forney and Johnson 1999; Kolatch 1999; Lawrence and 
Crispin 1999). The reality, however, was more complicated. The Chinese government 
conducted patriotic campaigns to teach the students to accept the communist government 
as the representative of the nation and national interests (Wang 2000; Zhao 1998); 
spontaneous mass-based nationalistic activities were not part of these campaigns because 
such activities posed a potential challenge to the authoritarian rule.’ During the 1999 anti-U.S. 
demonstrations, however, the Chinese government not only allowed the students to 
demonstrate but also asked the university authorities to organize the protest activities. A 
senior official from a major university (informant no. 62) provided an insider's story. The 
Chinese government was very concerned about the possibility of losing control should a 
large-scale anti-U.S. demonstration break out,? he said, meanwhile government officials knew 
that they had to allow students to demonstrate this time. This is partly because, in 1998, the 
Chinese government repressed a student demonstration triggered by the rape of ethnic 
Chinese women in the Indonesia riot. The government knew that many Beijing students were 
still very upset about the repression. The government also knew that in the eyes of most 
students, the embassy bombing was an incident that was much more serious than the 
violence in Indonesia.’ The government was keenly aware that the students were likely to 
rebel if they were not permitted to demonstrate under such circumstances.* For these 
reasons the government permitted the protest, but tried to manage the event. 

Two other motives were also behind the Chinese government's decision. As the 10" 
anniversary of the 1989 Prodemocracy Movement drew near, the government was concerned 
that the demonstrations would turn into anti-government protests. As several informants (nos. 
39, 41 and 47) recalled during the anti-American demonstrations, some protesters brought up 
the 1989 Movement and others denegrated various government policies. The government was 
also worried that the demonstrators would use excessive violence, which could hurt China's 
economy. During the entire event, demonstrations organized by university authorities were 
much more peaceful than those that were spontaneously initiated. University authorities 
sought to prevent radical actions against American nationals and businesses (no. 61). 

Against this background, on the morning of May 8, the government-sponsored student 
unions and the Communist Youth Leagues in many universities started to prepare for the 
demonstration. That afternoon, several thousand students from 11 Beijing universities boarded 
buses prepared by the student unions to take them to the U.S. and British embassies. 

The demonstrations, however, did not unfold as the government had hoped. Even before 
the government-sponsored demonstration took place, some students posted big-character 
posters on campus and messages on the universities’ electronic bulletin boards. When the 
government-sponsored mobilization started, their efforts actually met with strong resistance 
from some students, who later told us that their past experiences with the state had made 
them suspicious of the government's involvement in the protests against the United States. 
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These students avoided any demonstration controlled by university authorities. Instead, they 
initiated independent demonstrations. By the evening of May 8, more and more protests 
organized by the students broke out. These protests resembled more of a social movement 
like the mass uprisings that marked Mao’s China. Our survey shows that, between May 8 and 
11, a total of 38.1 percent of our respondents had participated in the anti-U.S. demonstrations. 


Data and Methods 


The research was conducted in Beijing between August and September 1999. During the 
research, we interviewed 62 individuals and surveyed 1,211 students. Our 62 informants 
include one high-level university official and one former university student. Otherwise all our 
interviews and surveys were conducted in three major universities in Beijing: Beijing 
University, Qinghua University and People’s University. The three universities were selected 
because they are among the largest and most prestigious with a history of influencing other 
universities and Chinese politics in general. Earlier studies have shown that these three 
universities had a higher level of student activism than most other universities during the 1989 
Prodemocracy Movement (Zhao 2001). They are the vanguard of student activism in China. 

The informants were recruited mainly through a snowball method: after each interview, 
the informant was asked to suggest friends or acquaintances who might be willing to 
participate. To ensure the representativeness of informants and to maintain efficiency in data 
collection, we would decide whether a suggested candidate would be contacted. We tried to 
avoid interviewing two students from the same dormitory room. We also tried to balance the 
number of undergraduate and graduate informants in the sample. We failed to make 
arrangements with several potential informants due to scheduling conflicts. No student, 
however, refused to be interviewed after he or she was contacted. 

The interviews with informants were semi-structured and audiotaped. Before an interview, 
the informant was assured of anonymity. We started the interview by asking the informant to 
recall as detailed as possible his/her personal experiences during the 1999 anti-U.S. 
demonstrations, in particular with regard to what he/she did and saw during that period. Then 
the interview proceeded variously according to the informant's narratives. Meanwhile, we 
always probed into a few issues that we saw as of vital importance: the sources from which 
the informant learned the bombing-related news, the informant's personal reaction to the 
embassy bombing, the informant's personal activities during the demonstrations, the 
mobilization process and the route of a protest in which the informant participated, the role 
played by university authorities in the protest in which the informant was involved, the events 
or big-character posters that most impressed the informant during the protests. 

For the survey of students, we designed a stratified random sample. The survey procedure 
was adopted because most students in these three universities live on campus. In Beijing 
University, for example, the students were living in 35 on-campus dormitory buildings. With 
between six and eight undergraduate students living in each dormitory room, these dormitory 
buildings house more than 10,000 students. At Beijing University (the sampling in the other 
two universities followed a similar procedure), about 400 questionnaires were evenly 
distributed to the regular student buildings (between 11 and 12 samples in each building). 
Because most of these dormitories are five- to six-story buildings, we basically took two 
samples on each floor. Then, to ensure that each student had an equal chance to be surveyed, 
we randomly drew two dormitory rooms on each floor and one student from each of the two 
selected rooms. If a student refused to participate, we would randomly choose another one 
in the same dormitory room. Should the second student also refuse to participate, we would 
select another room following the above procedure. In reality, however, very few students 
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refused to participate in the survey. Once a student was selected, he/she was asked to fill out 
the questionnaire right away. Our assistants would return to his/her dormitory room soon after 
to pick up the finished questionnaire. The return rate for this study is thus 100 percent. 

Our sampling method introduced certain biases. For example, our sample missed married 
graduate students who rented apartments off campus. As a rough test of how representative 
of the entire student population, we were able to compare characteristics of those in the 
sample with students at Beijing University during the 2000/2001 academic year. In general, 
the make-up of the sampled students highly resembles that of the entire student population 
at Beijing University. 

We adopt latent class analysis and multinomial logit regression model as the statistical 
tools for our analysis (Hagenaars 1993; Hagenaars and McCutcheon 2002; McCutcheon 
1987; Yamaguchi 2001). While multinomial logit regression has already been widely used in 
sociology in analyzing nominal dependent variable with more than two categories, latent class 
model has only begun to appear in sociological analysis in recent years (Silverstein and 
Bengtson 1997; Evens and Mills 1998; Macmillan and Gartner 1999; Gerber 2000; D’Unger et 
al. 1998; Yamaguchi 2000). 

We use the latent class model to analyze our data because we want to classify the 
movement participants based on the survey result, yet we do not want to use the 
respondents’ self-identification as the basis of categorization. If we do so, the respondents 
may tend to identify themselves with people in those positively regarded categories, which 
will undermine the study's validity. To avoid the problem, our strategy is to rely on the 
respondents’ answers to some indirect but more objectively measurable statements in the 
survey, then to classify them based on their answers. 


A Typology of Differential Participation 


In the social movement literature, movement participants are frequently labeled, usually in a 
casual manner, as leaders, activists, radicals, opportunists, rank-and-file participants, 
adherents and so on. Behind these commonly used terms, we see two criteria that are often 
used to classify the movement participants: the level of commitment to the movement goal, 
which refers to a participant's commitment to the ideologies or values of a movement, and 
the level of movement involvement, which refers to the amount of resources that a participant 
invests in movement activities. Here resource is broadly defined to include time, money, 
organizational skills, influence and other human/social capitals of the participant. A movement 
participant is labeled as an activist if he or she deeply commits to the movement goal and also 
plays a major role in organizing the movement activities. On the other hand, a movement 
participant is called a rank-and-file member if his or her level of involvement in a movement 
and commitment to the movement are both considerably lower than that of movement 
leaders and activists. Figure 1 illustrates the combination of the commitment and involvement 
dimensions, which result in a quadripartite typology of movement activists, followers, 
opportunists and uncommitted participants. 

Movement activists are highly committed to the movement goal and maintain a high level 
of involvement in movement activities. They play a crucial role in the initial stages of a 
movement by identifying the problems, recognizing and acting on opportunities, providing 
examples of action and converting potential conditions for mobilization into an actual social 
movement (McAdam 1982; Morris 2000). Movement activists’ involvement in consciousness- 
raising activities may greatly increase the level of interest of other people toward the 
movement goal. They offer frames, tactics and organizational vehicles that allow participants 
to actively construct collective identity and participate in collective action at various levels 
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Figure 1. Four Ideal Types of Movement Participants? 
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(Benford and Snow 2000; Morris 2000; Oberschall 1973; Smelser 1962; Snow et al. 1986). If 
their mobilization efforts are successful, a cascade of activism will occur and more and more 
less-committed members will be recruited into the movement (Kim and Bearman 1997). 
Identifying the movement activists and their activities in a movement is crucial to 
understanding the nature of a social movement. 

Movement followers endorse the movement goal, but they bring fewer resources to a 
movement. Compared to activists, they are usually more numerous but play a less important 
role in promoting the movement goal; they also devote considerably less time, money and 
other resources to movement activities. As implied by the label, followers join the movement 
later when the early actions of movement activists have lowered the cost of joining. In the 
process of a movement, some participants will continue to be followers throughout, while 
others will grow stronger identities through their personal participation and their contacts with 
activists (Calhoun 1994; Tilly 2002). Such identity shifts are crucial to sustain a movement (Kim 
and Bearman 1997; McAdam 1982; Snow et al. 1986). 

Uncommitted participants are those who have lower levels of interest in the movement 
goal but, nevertheless, participate. This is a category of participants that has so far received 
little attention in the literature. However, they are everywhere in a real life social movement. In 
comparison with followers, they bring even fewer resources to movement activities and they 
usually don't understand or commit to the movement goals. In many cases, uncommitted 
participants are motivated to join a movement by some personal grievances unrelated to the 
movement goal. Because of this, uncommitted participants are more likely to have a 
significant presence in low-risk and low-cost forms of movements (McAdam 1986). When a 
social movement can overcome its initial hurdles of development, lower the cost of 
movement participation, and create a public consensus that movement participation is an 
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honorable and even a fun activity, uncommitted participants may also flood into a movement 
(Kuran 1991; Zhao 2001). This is a situation that a Chinese proverb vividly captures: “When a 
wall is about to collapse, everybody starts to push it.” Also, when individuals are bound by 
some dense ecological/network conditions to the point that they have to participate in a 
movement in order to maintain their membership in a community, they may also join a social 
movement as uncommitted participants (Gould 1998, 1995; McAdam 1986; Zhao 1998). ° 

Opportunists have a lower commitment toward a movement goal, but at the same time 
can be highly involved in movement activities. When opportunists figure that the movement 
will bring them benefits beyond the movement goals, they will support and join a movement; 
when their personal interests are threatened by the movement activities, they thwart the 
movement. In other words, they are actively involved in a movement because they are 
interested in some private goods associated with the movement participation. Such private 
goods could mean political ambition, monetary benefits unrelated to the movement goal, or 
even an action in order to please some authorities who can confer status. By bringing more 
resources into the movement, opportunists can facilitate movement mobilization. On the 
other hand, they can also withdraw or even invest their resources in the opposing side if 
circumstances change. A movement that involves a large number of opportunists, therefore, 
tends to be less stable. Obviously, opportunists are more easily co-opted by the authorities 
than other types of participants. They are usually not interested in radical claims and actions 
unless they see a clear personal benefit. It is generally the case that less radical claims will 
be made in a movement with a high level of involvement by opportunists (Meyer and 
Staggenborg 1996). Thus the presence and percentage of opportunists will greatly affect the 
nature and development of any movement. 


Results and Analyses 


During our interviews, we found that most informants who had participated in the anti-U.S. 
demonstrations could be sorted into one of the four above-defined categories, even though 
our interview data do not allow us to pin down the exact percentages of the participants in 
each of the four categories. Therefore, relying on our survey data, we will classify the 
participants of the anti-U.S. demonstrations, examine the exact percentages of the 
participants in each of the four categories, and investigate what had motivated each of these 
four groups. 


Identifying the Types of Participants 


The latent class model is used to identify the four types of movement participants during the 
1999 anti-U.S. demonstrations. A participant's commitment toward movement goal and level 
of involvement in the movement are hard to measure directly. However, it is reasonable to 
assume that people who have stronger grievances/interests concerning the problems that a 
movement tries to address are more likely to have a higher level of commitment toward the 
movement goal. The amount of time, money and manpower that an individual spends on a 
social movement will reflect that individual's level of commitment. We believe that each of the 
observed indicators is caused by an unobserved, or latent, variable of interest; we expect 
covariation among the observed measures; and we study the patterns of interrelationships 
among the observed indicators to understand and characterize the underlying latent variable. 

Figure 2 is a path diagram of the latent class model on the typology of movement 
participants. Two kinds of variables exist in latent class models: manifest variables can be 
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directly observed while latent variables are those that cannot. Therefore, manifest variables 
are indicators of the latent variables. In the latent class model, both manifest and latent 
variables are treated as categorical variables. In our quadripartite typology of social movement 
participants, “commitment to the movement goal” and “level of involvement” are both latent 
variables. Q1, Q2, Q3 are indicators for the latent variable “commitment;” Q4, Q5, Q6 are 
indicators for the latent variable “level of involvement.” 


Figure 2. A Latent Class Model with Two Latent Variables 



































LV1: Interest toward Movement Goal 

LV2: Level of Involvement 

Q1: The Chinese Embassy was bombed because China is not strong enough. 

Q2: The bombing of Chinese Embassy was due to China's opposition to 
NATO’s eastern enlargement. 

Q3: U.S. wanted to disturb China by the embassy bombing. 

Q4: | had written posters and slogans for the demonstration. 

Q5: | had actively participated in organizing demonstration(s). 

Q6: | had other forms of protest except joining in the demonstration. 
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Q1, Q2, Q3 are statements regarding the respondents’ opinions about the nature of the 
embassy bombing. Respondents were asked whether they agreed with the following 
Statements: Q1: The Chinese Embassy was bombed because China is not strong enough; 
Q2: The Chinese Embassy was bombed because China had opposed NATO's eastward 
expansion; Q3: The U.S. wanted to induce political instability in China through the embassy 
bombing. The original responses that the survey respondents could choose are “strongly 
agree,” “agree,” “don't know,” “disagree” and “strongly disagree.” In the analysis, we create 
a dichotomous variable by combining “strongly agree” and “agree” into “agree,” and 
“disagree,” “strongly disagree” and “don't know" into “disagree.” These three questions are 
selected because during our interviews they were frequently mentioned by our informants as 
the reasons why the bombing happened. We assume that a survey respondent who agreed 
with each of the three statements would have a higher dissatisfaction toward the United 
States’ China policy. Thus, they were more likely to have stronger commitment toward the 
movement goal. 

Q4, Q5, and O6 are statements regarding the respondents’ promotion behavior. They are 
employed to indicate the respondent's level of involvement in the protest. The three 
statements are O4: | had written posters and slogans for the demonstration; Q5: | had 
actively participated in organizing a demonstration; O6: | had joined other forms of protest 
besides the demonstrations.® We asked the responses to choose “yes” or “no” for these three 
statements, and we assume that the respondent who responded “yes” was likely to have a 
higher level of involvement in movement activities. The cross-classitied frequencies of the six 
variables are listed in Table 1. 


Table 1: Cross-Classified Frequency of Six Manifest Variables 





Q4:Y Q4:N 
Q5:Y Q5:N Q5:Y Q5:N 
O6:Y --Q6:N..Q6:Y~ .Q6:N--—OG:Y -=.06:N: -Q6;¥-—Q6:N 
Q1:A 
Q2:A 
Q3:A 0 { 3 2 { 0 8 11 
Q3:D 0 2 0 2 2 0 { 
Q2:D 
Q3:A 0 0 { 2 0 0 9 8 
Q3:D 2 1 2 1 2 2 5 13 
Q1:D 
Q2:A 
Q3:A { 0 5 2 3 0 4 6 
Q3:D 1 0 4 2 2 1 8 i 
Q2:D 
Q3:A a { 9 6 Z 7 14 26 


Q3:D~ 14 6 24 21 10 11 6Oen« 440 


EEE 
Note: See the text for the definition of variables Q1 through Q6. A = agree; D = Disagree. Y = yes; N = no. 
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We use MLLSA (Maximum Likelihood Latent Structure Analysis) written by the late Clifford 
Clogg for parameter estimation of the latent class models.’ We estimate latent class models 
with one latent class, two latent classes, three latent classes, four latent classes and four 
latent classes with equality constraints. For the four-latent-classes model with equality 
constraints, restrictions on the estimates of conditional probabilities are imposed in order to 
test the hypothesis that has been laid out in figure 1, that is, movement activists and followers 
have similar interest in movement goals, and they differ with respect to their levels of 
involvement in the movement. Likewise, movement activists and opportunists exhibit similar 
levels of involvement; they differ with respect to their interest in the movement goal. 
Opportunists and uncommitted participants are similar in their levels of interest in the 
movement goal, but differ in their levels of involvement. Followers and uncommitted 
participants have similar levels of involvement, but differ with respect to their levels of interest 
in the movement goal. 

Table 2 reports the test results of the model fit. Likelihood ratio chi-squared (L2) is the 
preferred test statistics for the fit of the models. Table 2 shows that the independence model 
(single latent class) and two-latent-classes model fit the model poorly, and can be rejected at 
the .05 alpha level. The models with three latent classes, four latent classes and four latent 
classes with equality restriction all fit the data well. 


Table 2: Test Results of Latent Class Models Without Covariates 


L2 df P 
Independence 153.7196 57 .0000 
Two latent classes 83.3407 50 .0014 
Three latent classes 4e0tl?7 45. 3103 
Four latent classes 40.0117 37 3934 
Four latent classes with equality restriction 53.4187 48  .3163 





Note: N = 449 


The procedure for model selection with latent classes is not yet established.’ However, 
comparing the model with three latent classes and the model with four latent classes, we 
choose the four-class model because it fits our theoretical model. Comparing the model with 
four classes and the model of four classes with equality restriction, because these two 
models are nested with each other, we thus partition the L2 to check the acceptability of 
model restrictions by differentiating respective model 12s. According to the results from table 
2, the addition of the conditional probability generates an acceptable increase in the L2 of 
13.41 (53.4187-40.0117 = 13.4070) with 11 of (48-37 = 11). The increase in 12 is not 
statistically significant (0 = .29), indicating that little fit to the data is lost by accepting the 
equality restriction. Our result thus shows that the four-class model with equality restriction is 
a better model for our data. All our analyses in the following are based on this model. 

The conditional probability estimates and the latent class probability estimates for the 
restricted latent class model of the types of movement participants are reported in table 3. Let 
us first take a look at latent class probability. Latent class probability gives us two pieces of 
information: the number of classes and the proportion of the population in each class. In 
Table 3, we identify four classes of movement participants, among them 10.06 percent are 
of Type I, 25.19 percent are Type Il, 13.51 percent are Type Ill, and 51.22 percent are Type IV. 
Another kind of parameter estimate, conditional probability, enables us to interpret the nature 
of the classes of the latent variables. The first class corresponds most closely to our expected 
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“movement activists;" 52.92 percent of respondents in this class strongly agreed with Q1, 
69.59 percent expressed strong belief in Q2, and 52.09 percent strongly agreed with Q3. Also 
for the three statements that measure the respondents’ levels of involvement in the 
movement, 67.91 percent of these respondents answered “yes” to Q4, 33.97 percent 
answered “yes” to Q5, and 56.47 percent answered “yes” to O6, Thus, these respondents 
expressed strong interest in the movement goal and had actively participated in the anti-U.S. 
protest. Their attitudes and behaviors correspond most closely to the ideal type of 
“movement activists.” 


Table 3: The Proportion and Conditional Probabilities of Responses of Each Latent Class 





LATENT CLASS 1 2 3 4 
(activist) (opportunist) (followers) (uncommitted) 

Proportion .1006 2019 ood 5122 
Response Probabilities: 

Q1 AGREE 5292 .0736 .0292 .0736 
Q1 DISAGREE 4708 .9264 4708 .9264 
Q2 AGREE .6959 .1666 .6959 .1666 
Q2 DISAGREE 3041 8334 3041 8334 
Q3 AGREE 5209 .0617 5209 .0617 
Q3 DISAGREE A791 .9383 4791 .9383 
Q4 Y .6791 6791 .3019 3019 
Q4 N .3209 .3209 6981 6981 
Oe aS 3397 3397 0775 0775 
Q5 N .6603 .6603 9225 9225 
Q5oey 5647 5647 1016 1016 
Q6 N 4353 4353 8984 8984 





Other types of respondents were as follows: Class II corresponds closely to the ideal type 
of “opportunists,” Class Ill to “movement followers,” and Class IV “uncommitted 
participants.” Thus, our survey shows that 10.06 percent of the respondents were activists, 
25.19 percent were opportunists, 13.51 percent were followers, and 51.22 percent were 
uncommitted participants. That means that the majority, 76.41 percent of these students who 
participated in the demonstrations, did not have any serious commitment to anti-U.S. 
ideologies. Our result also shows that the majority of the participants in this protest were 
uncommitted participants. The fact that most of the participants did not have a strong 
commitment to the movement goal tells us a lot about the nature of this anti-U.S. 
demonstration. These percentages, however, don't explain who the students in each group 
were and why they joined the protest. 


Determinants of Differential Participation 
In this section, we assign observations to different latent classes and use the multinomial 


logit regression models to examine the socio-biographic characteristics of the four types of 
participants. This enables us to calculate the differential contribution of each of the four 
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classes to each cell of the observed contingency table, and then, to assign the observations 
in a cell to the latent class with the largest conditional probability. 


Description of the Variables 


In the multinomial logit regression model, the dependent variable is the type of protest 
participant (activist, follower, uncommitted participant and opportunist) plus the non- 
participant. For the independent variables, we include ideational factors, network 
mechanisms and campus environment effects. Based on our understanding of the student 
movements in China in general and this demonstration in particular, we also include the 
following background variables in the regression analysis: student's gender, hometown, self- 
perceived academic standing in the class, CCP membership and whether or not a student 
held a cadre position in the class or in the student union (student cadre membership). Table 
4 presents the descriptive statistics for the variables used in the multinomial logit analysis. 

Two measurements of the ideational factors are included. One is familiarity with some 
major events related to the U.S.-China relations during the 1990s. “Familiarity” consists of the 
average familiarity test scores on five events of U.S.-China relations that were widely covered 
in the Chinese media. ("1" indicates “least familiar” and “5” indicates “very familiar”) These 
events were: (1) America’s opposition to Beijing's bid for the 2000 Olympic Games, (2) The 
Yinhe (“Galaxy”) ship incident in 1993 when it was alleged that the ship was carrying chemical 
weapon precursors to Iran. The ship was searched, but no such items were found. (3) 
Taiwan's president Li Teng-hui’s 1995 visit to the United States, (4) The Chinese espionage 
scandal (the Cox Report) in which former Los Alamos National Laboratory scientist Wen Ho 
Lee has been accused of engaging in espionage for China, (5) China’s illegal U.S. presidential 
donation scandals. Since these events frequently appeared in China’s nationalistic writings as 
evidence of U.S. hostilities toward China, we expect that students familiar with these 
incidents would be more likely to join the anti-U.S. demonstrations. 

The other measurement is whether or not a respondent has read the book China Can Say 
No. This book, authored by Song, Zhang and Qiao (1996), was the first anti-U.S. bestseller in 
China. A major message of the book is that the imperialistic United States has been plotting 
against the rising China in a new cold war. Therefore, it is imperative for China to adopt a 
counter-containment strategy and to stand up and say no to America clearly and loudly. The 
book was published at a time when U.S.-China relations were strained as a result of China's 
repression of the 1989 pro-democracy movement, and China’s booming economy had 
bolstered Chinese confidence. The radical nationalistic mood expressed in the book was 
immediately appealing to some young Chinese. Since its publication, some other anti-U.S. 
books also appeared on China’s bookstands, creating an anti-U.S. wave during the mid to late 
1990s. We expect that the students who had read the book would be more likely to join the 
demonstrations. . 

In Chinese universities, as many as eight undergraduate students or four graduate students 
live in a dormitory room. Roommates for both the undergraduate students and graduate 
students are randomly assigned according to their departments, gender and entrance dates. 
After assignment, roommates seldom change before their graduation. Such a living 
arrangement brings students into a pattern of regular interaction and nurtures strong bonds 
and influences among the roommates. Past studies on Chinese student movements show that 
such a living arrangement greatly enhanced the students’ chances of joining a collective action 
(Chow 1967; Zhao 1998, 2001). In this study, the roommate participation rate variable is 
introduced to measure the effect of the dormitory-based student networks on student 
mobilization. We also introduce another variable — the discussion rate — to measure the impact 
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Table 4: Frequency Distribution of Variables 


SS 


Response Variable: 

Group 
Activists 49 4.12 
Opportunists 56 471 
Followers 47 3.96 
Uncommitted 295 24.83 
Non-participants 741 62.37 

Predictor Variables: 

Familiarity of U.S.-China Relations 
Never Heard Of 32 2.66 
Just Heard Of Zor 19.67 
Somewhat Familiar 473 39.20 
Familiar 349 28.96 
Very Familiar 114 9.46 

Read China Can Say No 
Yes 463 38.33 
No 745 61.67 

Discussion Rate 
Never Rie 1.41 
Very Few Times 35 2.90 
Somewhat Frequently 166 13.74 
Frequently 543 44.95 
Very Frequently 447 37.00 

Roommate Participation Rate .33 (Mean) .96 (Std. Dev.) 

Campus Poster Reading Frequency 
Never 97 8.04 
Very Few Times 131 10.85 
Somewhat Frequently 281 23.28 
Frequently 385 31.90 
Very Frequently 313 20.09 

Cadre 
Yes 429 35.43 
No 782 64.57 

Male 
Yes 883 72.91 
No 328 27.09 

Hometown is Beijing 
Yes 126 10.40 
No 1085 89.60 

Academic Performance 
Far Lower Than The Average 55 4.59 
Lower Than The Average 178 14.74 
Average 405 33.00 
Above The Average 379 31.37 
Far Above The Average 191 15.81 

CCP Membership 
Yes 247 20.40 
No 964 79.60 


cS 
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of some more general student networks on the students’ participation in the anti-U.S. 
demonstrations. The roommate participation rate is measured by dividing the number of the 
respondent's roommates who participated in the protest by the total number of students who 
lived in the room minus the respondent. The discussion rate variable is defined as the amount 
of time that a student spent discussing with other students the issues related to the embassy 
bombing and anti-U.S. demonstrations. Our survey asked the following question: “Did you 
often discuss the embassy bombing event with classmates/roommates/friends?” Responses 
were coded as “1” for “never” and “5" for “very frequently.” 

In Chinese universities, some public spaces are designated for bulletin boards. When a 
demonstration starts, these bulletin boards immediately become the site of big-character 
posters and protest announcements. During the days of the anti-U.S. demonstrations, these 
bulletin boards were full of posters regarding the bombing, including photos of the bloody 
scenes of the bombed embassy, opinion essays, slogans, etc. By visiting these spaces and 
reading the posters, students became familiar with the issues related to the bombing and the 
protests, and might sympathize with the movement goal. Students were also more likely to 
learn when and where a demonstration would be staged. Therefore, we introduce the “campus 
poster reading frequency” variable with the expectation that the students who spent more time 
reading posters were more likely to join the protest. The “campus poster reading frequency” is 
a five-point index measuring the time that a respondent spent reading posters. Respondents 
were asked the following question: “Did you follow the embassy bombing event by reading the 
campus posters?” Responses were coded as “1” for “never” and “5” for “very frequently.” 

During the 1999 demonstrations, the Chinese government-sponsored Communist Youth 
League and the student unions were ordered by university authorities to lead the movement. 
Even though the government's efforts were not entirely successful (Zhao 2003), such efforts did 
engage many student cadres in the protest. We include the “student cadre membership” 
variable in the analysis to examine the level of involvement of the government-sponsored 
organizations in the protest. The effect of student cadres in the movement is measured by a 
dummy variable with “1” for being a member of a student cadre and “0” for not being a member. 


Results 


Table 5 contains the results from the multinomial logit regression analysis. As shown in the first 
four columns, the variable that has the strongest impact on our dependent variable is the 
roommate participation rate. Our result indicates that for any kind of protest participant, whether 
he or she was an activist, follower, uncommitted participant or opportunist, the chances of 
participation in the demonstrations greatly increased with a rising roommate participation rate. 
However, looking at the rest of the columns, roommate participation rate has no significant 
effect on the contrasts among any two of the four types of participants. In other words, 
roommate participation rate does not predict which form of participation a student would take. 

To further explore the roommate effect, we plot the five predicted probabilities against 
different values of roommate participation rate in Figure 3. Some results are obvious. For 
example, the figure shows that when the roommate participation rate rises, the probability for 
a student to be a non-participant sharply declines. What is interesting and not entirely obvious 
is that with the increase in the roommate participation rate, the probability for a protest 
participant to be an uncommitted one increases dramatically and the probability for that 
participant to be a leader, follower or opportunist increases only slightly. In fact, as the figure 
shows, when the roommate participation rate of a particular dormitory room reaches 90 
percent, the probability for a protest participant in that dormitory room to be an uncommitted 
participant is almost 70 percent.’ 
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It is clear that the roommate participation rate is a major factor contributing to the high rate 
of uncommitted participants in the anti-U.S. demonstration. But the question is, with no 
Strong anti-U.S. feelings, why did these students join the protest? An immediate finding of our 
Survey is that, among those who had participated in the protest, 83 percent agreed that other’ 
students had the right not to participate in the protest. With this kind of majority attitude, we 
do not see any coercion among the roommates in joining the protest as Zhao (1998) found 
during the 1989 Prodemocracy Movement. Some of our interview results, however, do give 
us a clue. During the interviews, some informants (nos. 11, 39, 43) claimed that they joined 
the demonstration just because they wanted to have a demonstration experience. As 
informant (no. 39) put it: 


/ heard the news of the embassy bombing-trom the radio around 11 
a.m. on May 8". Initially, | did not feel anything special. | felt as if the 
U.S. had bombed one more building in Yugoslavia... Around 3 p.m. | 
walked by one school building and saw a couple of big buses waiting 
there. | was told that the Communist Youth League was organizing 
students to go on demonstrations. Some banners with slogans, such 
as “Down with the U.S. Imperialism,” were also put up. | went back to 
my dorm, thinking of whether or not | should go. | told my roommate 
about it. He was very eager to go. He said that it was very rare to have . 
such a demonstration experience during our four-year college life and 
we'd better go. So / went with him. 


During the days of the anti-U.S. demonstrations, some students discussed the bombing- 
related issues with their roommates and friends. Some of them even invited their roommates 
and friends to watch or even to attend some of the demonstrations together.? Due to this kind 
of discussion and invitation, many students who did not have strong anti-U.S. feelings 
participated in the protests merely to experience a demonstration or to just “watch for fun.” 
This is what another informant (no. 9) recalled: 


That night (May 8") there were about one or even two thousand 
students marching on campus. They were marching around and 
trying to get more and more people involved. Some of my friends 
and | were taking a walk inside the campus and saw a demonstration 
in the dormitory area."® | thought they were just demonstrating on 
campus to express their anger, so we joined them. We felt very 
curious and wanted to watch for fun. We then marched in the 
dormitory areas, making noises and urging more people to join us. 
The crowd then headed to the teaching building areas to call more 
students who were studying there. We also marched in front of the 
library afterward. When the marching formation had swelled into 
thousands of students, someone proposed to demonstrate in front of 
the US. Embassy. We all agreed and / went with them. 


The predominance of uncommitted participants during the 1999 anti-U.S. demonstrations 
in conjunction with the strong roommate effect show clearly that rather than demonstrations 
of mass-based anti-U.S. nationalism in China, this event was more likely a short-term reaction 
to the embassy bombing facilitated by the campus environment. Of course, we need to 
acknowledge the role of the Chinese government as well. By permitting the protest, the 
government greatly lowered the risk and cost of participation. 
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In addition to the roommate participation rate, student cadre membership proved to be an 
indicator of the type of protestor. Members of this group are more likely to be opportunists 
than non-participants, uncommitted participants or followers. The probability of being an 
opportunist is 2.6 percent higher for a cadre than a non-cadre. To contain the movement, 
university authorities asked the Communist Youth League and student unions to become 
involved. As a result, many student cadres actively participated in the protest not because of 
strong anti-U.S. feelings, but because they were directed to do so. The existence of a higher 
percentage of opportunists clearly reflected the role of university officials such as No. 62: 


Around noontime on May 8" our students’ government was notified 
to organize the demonstrations in the U.S. Embassy area. We then 
started to prepare for them immediately. We first notified the student 
government in each department, asking them to spread the word and 
assemble students who wanted to demonstrate. We also prepared 
slogans and posters and provided buses to transport them to the U.S. 
Embassy. 


Our studies also show that many anti-U.S. demonstrations at that time were 
spontaneously initiated by the students. Even so, university authorities still tried to take 
control. This is what a student union leader (no. 21) recalled: 


In the days when the anti-U.S. demonstrations were going on, we sent 
student government cadres to every corner of our campus. As soon as 
they saw the formation of a crowd of several hundred students, they 
called us immediately... In those days, we stayed at the offices of the 
student government all day and night to make sure that we would be 
there whenever there was a need. /n those days, we did not have 
much rest. All the Chinese were very upset about the bombing, and the 
students wanted to unleash their anger toward the bombing. To protect 
the students and to keep the demonstrations in a better order, the 
cadres in the graduate and undergraduate student government 
decided to follow the students when some demonstration activities 
were initiated. In the process, we tried to take contro/ of the 
demonstration by controlling such resources as the university flags and 
by encouraging students to shout slogans that we had prepared. 


We need to be cautious with many of the claims that this student cadre makes. As the 
recollections of other informants have shown, the government's manipulation and 
management of the demonstrations was rejected by students (Zhao 2003). What is 
unmistakable in this recollection is that many student cadres were actively involved in the 
protest regardless of their feelings toward the United States. 

Some of our other findings are more obvious but, nevertheless, also need to be 
documented. Our results show that students who were more knowledgeable about major 
events affecting U.S.-China relations were more likely to be activists or followers than non- 
participants. It also shows that students who had read China Can Say No were more likely to 
be activists than non-participants or uncommitted participants (even though the result is not 
significant statistically). In particular, the students who had read China Can Say No had a 
higher propensity to be activists; the probability of being an activist was 2.8 percent higher for 
those who had read the book than those who had not. Also more likely to be activists were 
students who frequently read campus posters. 
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Finally, some control variables also show significant effects in our regression analysis. 
Male students were more likely to be followers than non-participants and uncommitted 
participants. Being a member of the CCP seems to have no effect on the type of movement 
participation. Students with poor academic records were more likely to be activists than non- 
participants. Compared to the non-Beijing students, Beijing students were more likely to be 
non-participants than activists or uncommitted participants. The reason for this is because 
most of the anti-U.S. demonstrations took place on Saturday and Sunday when most Beijing 
students were at home with their families. One surprising finding is that female students were 
more likely to be leaders than followers. This might be a result of selection bias as female 
students are generally less likely to participate in the protest. 


Summary 


Empirically based studies of movement mobilization tend to treat the movement participants 
as a single block without considering the following aspects: People participate in the same 
social movement for quite different reasons, and different mechanisms are involved in the 
mobilization of different kinds of movement participants. An investigation of the types and 
compositions of participants in a movement and the different sociological factors behind the 
mobilization of these participants is crucial to understanding the process and nature of a 
particular movement. 

This article aims to establish an objectively measurable system of classifying movement 
participants and to use this classification system to better understand the differential 
mobilization during the 1999 anti-U.S. demonstrations in Beijing. To fulfill the tasks, we 
preceded in three steps. First, we developed a quadripartite classification of social movement 
participants (activists, movement followers, opportunists and uncommitted participants) 
based on two criteria: the movement participant's level of interest in the movement goal and 
the amount of resources that a participant wanted to contribute. Then, we tried to verify the 
empirical validity of our classification system by examining the anti-U.S. demonstration and 
using the latent class model to determine the proportions of the anti-U.S. protest participants 
in each of the four categories. Finally, by applying the multinomial logit regression, we tried to 
understand the characteristics of the mobilizing structures and mechanisms of mobilization 
behind each type of the participants. 

Our results show that uncommitted participants and opportunists were the two dominant 
groups during the 1999 anti-U.S. demonstrations. Together, they constituted 76.41 percent 
of the protest participants. Because neither of these student groups had any serious 
commitment to the anti-U.S. ideologies, the anti-U.S. demonstrations were more likely a short- 
term event triggered by the embassy bombing than a reflection of the rise of anti-US. 
nationalism in China as portrayed in the U.S. media as well as in the political science and 
international relations literature. 

To understand who the students were in each of these four categories and why they 
participated in the demonstrations, we also employed the multinomial logit regression 
analysis, in which the activists, followers, opportunists, uncommitted participants and non- 
participants were treated as values of a categorical dependent variable. Among our findings, 
two are most interesting. First, the probability for a protest participant to be an uncommitted 
member increases greatly with the roommate participation rate. This finding is 
understandable. The increase in the roommate participation rate raised the curiosity and 
interest of the uncommitted students and thus attracted many uncommitted members. This 
had certainly contributed to the high percentages of the uncommitted participants in our 
sample. Second, cadres of the Communist Youth League and student unions were more likely 
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to be opportunists than other types of participants. Based on our interviews, we know that 
the university authorities in Beijing had asked student cadres to involve themselves and take 
control of the anti-U.S. demonstrations. Consequently, many student cadres were actively 
involved in the demonstrations not because they had strong anti-U.S. feelings but because 
they were asked to do so. The existence of a higher percentage of opportunists among the 
student cadres reflects the role of the Chinese government in the demonstrations. In 
summary, our findings show that different participants in the anti-U.S. demonstrations were 
mobilized to protest through very different mechanisms and most participants joined in for 
reasons unrelated to their attitudes toward the United States. 


Notes: 


1. For example, on July 14, 1996, some right-wing Japanese students built a lighthouse on the 
disputed Diaoyu Islands. Following the incident, while the Chinese government condemned 
the action and reaffirmed China's sovereignty over the islands, it also tried hard to prevent anti- 
Japanese protests in mainland China (Zheng 1999, 134). Most recently, we were informed that, 
after the collision of a U.S. spy plane and a Chinese fighter jet near China's Hainan Island, the 
Chinese government tried to prevent large-scale anti-U.S. student demonstrations. 


2. The information provided by the official was confirmed by other interviews. 


3. Under the Chinese law, ethnic Chinese who have acquired citizenships in other countries 
are no longer Chinese nationals. Therefore, while the Chinese people were angry at the 
rapes, many of them considered it Indonesia's internal affair. On the other hand, due to the 
Chinese government's resolute stance in opposing the NATO bombing of Yugoslavia, 
most Chinese believed that the bombing of the Chinese Embassy was a deliberate act of 
the U.S. government or U.S. military aimed at punishing China for its opposition. 


4. During Zhu Rongji’s visit to the United States, the Clinton administration refused to sign a 
WTO deal with China although Zhu had offered huge concessions. Soon after, the United 
States also unilaterally posted the details of China's concessions on the website of the 
Office of the United States Trade Representative. Thus, Zhu went back to China almost 
empty-handed and many Chinese with access to the internet read about his unilateral 
concessions. According to Fewsmith (2001, ch.7), this diplomatic failure played a major 
role in pushing the Chinese government to support the anti-U.S. demonstrations. 
Fewsmith’s analysis is certainly logical and has empirical support, but in our interviews 
few students linked their participation in the anti-U.S. demonstrations with this incident. 


5. If the respondent answered “yes” to this question, they were required to specify what 
those forms were. The answers included “writing articles or opinion essays on the 
Internet/BBS,” “actively participating in the debate or discussions among students,” “or 
“talking to friends or teachers from the U.S. and tried to communicate with them.” 


6. This can be downloaded from the web site of Professor Scott Eliason at 
http://www.soc.umn.edu/~eliason/CDAS.htm. Instructions are given on the website as 
well as in an Appendix to Allan McCutcheon’s (1987) monograph. 


7. For model selection with latent classes, see Yamaguchi (2000) for details. He contends 
that “the identifiability conditions — which are required for the likelihood test, AIC or BIC, 
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and are needed for the asymptotic chi-square distribution under the alternative hypothesis 
— do not hold under the null hypothesis, which specifies certain boundary conditions of 
parameters, for this kind of test.” (pp. 1715) 


8. The predicted probabilities here and in the rest of the section are calculated by STATA 
SPost commands developed by J. Scott Long and Jeremy Freese. See Long and Freese 
(2001) for details. 


9. The existence of this kind of recall in our interviews prompted us to add the following 
statement to the survey: “| joined the demonstration because | wanted to have a 
demonstration experience.” Our result shows that about 41 percent of the students 
who had participated in the demonstrations had given an affirmative answer to this 
statement. 


10. Please note that most friendship networks on Chinese university campuses are 
dormitory based. 
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Abstract 

The human dimensions of greenhouse gas emissions and global warming attract 
considerable attention in macrosociology. However, cross-national analyses generally 
neglect greenhouse gases other than carbon dioxide. The current study addresses this 
paucity through the testing of theoretically derived models for the social structural causes 
of the second largest anthropogenic contributor to global warming: methane emissions 
intensity. The cross-national analyses consider the effects of particular economic activities 
and their social organization as well as other domestic conditions and the environmental 
commitment of nation states. Results suggest that both the intensity and social 
organization of production in different sectors contribute to methane emissions per capita. 
In particular, the production of beef and veal, oil and natural gas, and biomass energy all 
positively affect methane emissions intensity. Evidence indicates that while the level of 
economic development and foreign direct investment in the manufacturing and petroleum 
sectors increases emissions, the level of state environmentalism has the opposite effect. 
These findings illustrate the necessity for social scientists to take more nuanced approaches 
when studying human-caused environmental degradation. 


Introduction 


The anthropogenic driving forces of global warming and climate change are a central topic 
in the social scientific areas of international political-economy and environmental 
sociology. The majority of relevant empirical research conducted by social scientists 
concerns the effects of human activities on carbon dioxide emissions (Dietz and Rosa 
1997; Fisher and Freudenburg 2004; Grimes and Kentor 2003; Roberts and Grimes 1997; 
Roberts, Grimes and Manale 2003; Shandra et al. 2004). However, methane emissions are 
the second largest overall human generated contributor to global warming, and like carbon 
dioxide, methane emissions increased substantially during recent decades’ (Houghton et 
al. 1996: Moore et al. 1998; Oliver 2002). While atmospheric carbon dioxide is 200 times 
more plentiful than atmospheric methane, molecule-for-molecule methane is 10 times 
more effective at absorbing and reradiating infrared energy and heat back to the earth's 
surface, which has an impact on global warming (Christianson 1999:219). The relative 
increase in global methane emissions corresponds with the most recent waves of trade 
and investment globalization as well as the broadening and deepening of global 
commodity chains, energy consumption and uneven levels of natural resource depletion 
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(Chase-Dunn, Kuwano and Brewer 2000; Geriffi and Korzeniewicz 1994; Jorgenson 2003; 
Jorgenson and Kick 2003). 

With few exceptions (e.g., Burns, Davis and Kick 1997; Rosa, York and Dietz 2004; York, 
Rosa and Dietz 2003), social scientists neglect methane emissions in their macro-level 
analyses. The studies that do include methane often apply explanatory models to both carbon 
dioxide and methane gases, and the focus is usually on total levels of emissions rather than 
intensity.? Collectively, population levels and economic affluence are treated as the most 
important factors affecting both forms of greenhouse gas emissions. However, scholars 
working in the earth sciences provide more particular explanations for uneven levels of human 
caused methane emissions — some of which differ from carbon dioxide, but these 
explanations typically lack social scientific context. 

Indeed, the relative size and growth of populations are key human drivers of total 
emissions of greenhouse gases (National Research Council 1999). Moreover, some social 
scientists argue that the absolute size of populations has a proportional effect on total 
methane emissions, and per capita differences do exist that cannot be explained by this 
societal characteristic (Rosa et al. 2004; York et al. 2003). For example, the per capita 
emissions of methane for the United States is more than 12 times larger than that of Burundi, 
and 37 times larger for the United Arab Emirates than for Rwanda (World Resources Institute 
2004). Variation in greenhouse gas emissions intensity and per capita levels of other forms 
of environmental degradation are often tied to a variety of political-economic conditions 
(Jorgenson 2004, 2005; Jorgenson, Rice and Crowe 2005; Roberts 2001; Rosa 2001). Thus, 
nuanced analyses are warranted that address the variation in methane emissions per capita. 

In the current study, | conduct a series of cross-national analyses of the social structural 
causes of methane emissions intensity. | pay particular attention to economic sector-level 
causes, including production of rice and cereal, cattle, oil and natural gas, as well as biomass 
energy (burning of biomass). Drawing on various sociological theories and related studies of 
greenhouse gas emissions, | address the impact of economic development, urbanization, 
foreign and domestic investment, state environmentalism, and democratization. Findings 
from the analyses indicate that the causes of methane emissions intensity are multiple and 
complex. Nations with higher levels of state environmentalism exhibit lower per capita 
methane emissions. And, both the levels and social organization of production in different 
sectors contribute to the generation and emission of methane. Results illustrate the necessity 
for social scientists to adopt more fine-grained approaches when studying human caused 
degradation of the biosphere, and point to the need of international policies that deal more 
directly with the transnational social organization of production. 


Social Structural Causes of Methane Emissions Intensity 


Economic Development and Urbanization 


Most social scientific studies of greenhouse gas emissions address the impact of economic 
affluence (Burns et al. 1997; Dietz and Rosa 1997; Grimes and Kentor 2003; Roberts and 
Grimes 1997; Shandra et al. 2004; York et al. 2003). Political-economic perspectives, including 
the treadmill of production and world-systems theory, argue that more developed countries 
generally consume greater levels of energy, which leads to higher environmental externalities 
(Roberts and Grimes 2002; Schnaiberg and Gould 1994). 

The treadmill of production perspective argues that to maintain profits, producers must 
constantly expand production, which requires additional ecological material inputs 
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(Schnaiberg and Gould 1994). The expansion of production and resource consumption 
increases the level and intensity of waste, largely in the form of emissions. Similarly, world- 
systems theory posits that more affluent nations possess the economic power and capital 
intensity that allow them to consume higher levels of energy, which ultimately increases 
methane production and emissions (Burns et al. 1997; Chase-Dunn 1998). Urban political- 
economy scholars argue that energy consumption, which causes greenhouse gas emissions, 
is typically higher and more concentrated in urban areas (Jorgenson 2003, 2004). Thus, one 
would expect a nation’s level of urbanization to positively affect its domestic level of methane 
emissions intensity. 

In contrast to political-economy perspectives, ecological modernization theorists argue 
that “advanced capitalism” is not in fundamental conflict with the environment (Mol 2001). 
Rather, economic development or “modernization” will lead industries to become more 
ecologically efficient. This involves the reorganization of production processes with a net 
outcome of reduced environmental externalities through more eco-efficient modes of 
production. The majority of empirical work supporting this perspective involves case studies 
of particular industries or regions (Sonnenfeld 1998). Conversely, findings of most cross- 
national studies of greenhouse gas emissions and other forms of environmental degradation 
fail to support ecological modernization theory (Jorgenson 2003, 2005; Jorgenson and Rice 
2005; Roberts et al. 2003; York et al. 2003). 


Domestic Economic Structure and Particular Forms of Production 


The overall structure of domestic economies is often cited as an important factor affecting 
global warming (Grimes and Kentor 2003; Roberts et al. 2003; York and Rosa forthcoming). 
However, many of the most common causes of methane emissions involve particular 
agricultural and manufacturing sector activities, including the production of cattle, rice, oil and 
gas, and the burning of biomass (Moore et al. 1998). 

A large proportion of the world’s livestock are ruminants, such as cattle. These animals 
possess four chambered stomachs that generate methane as a byproduct while breaking 
down food (Kaiser and Drennen 1993; Khalil and Rasmussen 1989; Drake 2000). Five to nine 
percent in weight of what a cow consumes ends up as methane, compared to approximately 
1.5 percent for pigs (Christianson 1999). Moore et al. (1998) argue that methane emissions 
from ruminant animals and their manure in more developed countries arise from high-density 
livestock operations (Duxbury and Mosier 1993; Houghton et al. 1996). The vast amounts of 
waste from high-density feedlots are generally collected in large pools exhibiting rising 
bubbles of methane gas (Christianson 1999). However, large-scale and high-intensity beef 
production has increased substantially in less developed countries, typically for export to 
developed nations (Burns et al. 1997; Jorgenson and Burns 2004). 

Mono-agricultural practices, particularly rice production, generate methane through the 
anaerobic? decomposition of organic matter in the soil (Drake 2000; Khalil and Rasmussen 
1989: Warrick et al. 1990). The methane gas typically reaches the atmosphere through the 
plant's vascular system. Climate, soil characteristics and growing methods also contribute to 
the level and intensity of methane produced and emitted (Moore et al. 1998). Some scientists 
argue that as the human population continues to grow rapidly, the production of rice will need 
to increase at high levels (Drake 2000). This is compounded by the preexisting need for greater 
yields to avert famine in various regions of the world, coupled with unsustainable levels of per 
capita consumption in more-developed countries (Jorgenson 2003). 

Methane emissions from oil and gas occur during exploration, production, transportation 
and use (Houghton et al. 1996). During exploration, emissions often result from blowouts and 
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well testing. Methane is also emitted during oil and gas refining when flaring and venting 
processes are employed. Safety valves are another potential source. These valves enable gas 
to be released to prevent the over-pressurization of equipment, and methane is sometimes 
used as a purge gas in flare and vent systems, which ensures that oxygen cannot enter them 
(Moore et al. 1998). Because of purging and unintended leakages, older pipelines used for 
transportation are also prone to emit higher levels of methane. Oil and gas installations as well 
as various manufacturing processes often use turbines or reciprocating engines for heating 
and power generation (World Resources Institute 2004). These are a significant source of 
methane because they commonly use natural gas as a fuel source, and more outdated 
machinery emits waste due to incomplete combustion. 

The burning of biomass for clearing and for fuel is an additional human activity contributing 
to methane generation and emissions (Drake 2000; Khalil and Rasmussen 1989; Levine 1994). 
Savannah and forest soils act as sinks for methane, but this capacity is lessened after 
clearance burning. Often, in less-developed countries these areas are burned to increase feed 
availability for grazing animals and for the development of large-scale agricultural production 
(Jorgenson and Burns 2004). The associated livestock and crops are typically intended for 
export to higher-consuming countries. Moreover, biomass in the form of wood is a main 
source of household fuel in many less-developed countries (Kick et al. 1996) as well as a 
common energy source for various production processes at different nodes in global 
commodity chains (Princen et al. 2002). 

Thus, it is critical to address the environmental impact of particular forms of production 
and fuel use. However, of equal importance is the potential effect of the transnational social 
organization of production in various economic sectors, particularly in the context of foreign 
and domestic investment. 


Social Organization of Production: Foreign and Domestic Investment 


Foreign capital penetration is the most widely studied form of international dependence in 
macrocomparative research (Bornschier and Chase-Dunn 1985; Dixon and Boswell 1996; 
Kentor and Boswell 2003). This form of dependence refers to the extent to which 
transnational corporations dominate the economy of host countries, which is often 
operationalized as the level of accumulated stocks of foreign direct investment relative to a 
nation’s total gross domestic product. Rooted in the sociological traditions of world-systems 
and dependency theories, the model of foreign investment dependence further asserts that 
the accumulated stocks of foreign investment generally make a less-developed country more 
likely to organize its economy around external trade, largely at the expense of the well-being 
of domestic populations and the environment (Chase-Dunn 1998). 

Findings for recent studies suggest that foreign capital penetration is a structural mechanism 
partly responsible for particular forms of environmental degradation, including carbon dioxide 
emissions, sulfur dioxide emissions and water pollution intensity (Grimes and Kentor 2003: 
Shandra et al. 2004; York and Rosa forthcoming). Foreign investors and transnational 
corporations are more likely to make investments in less developed countries that exhibit lower 
environmental standards and policies. A large proportion of foreign investment in less- 
developed countries finances ecologically inefficient, labor- and energy-intensive manufacturing 
processes outsourced from developed countries. Moreover, power generation in the countries 
receiving foreign investment is considerably less efficient (Grimes and Kentor 2008). This often 
results in increased emissions of noxious greenhouse gases (York and Rosa forthcoming). 

Foreign investment in agriculture tends to positively affect forms of water pollution 
intensity associated with the use of pesticides, many of which are banned in more developed 
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countries (Jorgenson and Burns n.d.). This is often referred to as the “pesticide treadmill” 
(Schnaiberg and Gould 1994). Water pollution is also exacerbated by large-scale cattle 
production, an economic activity largely controlled by a combination of foreign investors and 
domestic elites (Jorgenson and Burns 2004), 

Thus, foreign capital penetration in various economic sectors affects environmental 
degradation. Because these economic sectors are known to cause methane emissions for 
reasons similar to other greenhouse gases and organic water pollution (Moore et al. 1998; 
World Resources Institute 2004), | model and test the effects of foreign investment in 
manufacturing, agriculture, petroleum and extraction sectors on methane emission intensity. 

Many social scientists posit that higher levels of domestic investment are beneficial for local 
social conditions and the natural environment (e.g., Bornschier and Chase-Dunn 1985; Kentor and 
Boswell 2003). For example, profits derived from domestic investment are more likely to be 
reinvested domestically, which generally increases economic development and human capital. 
Issues of scale are relevant for environmental outcomes. When foreign direct investment is 
involved, there is pressure to maximize profits for absentee investors, which tends to favor 
economies of scale at the expense of the local ecology (Gibson et al. 2000). Conversely, domestic 
investment stands a better chance of increased local accountability for more environmentally 
friendly processes coupled with smaller scales of productive focus (Young 1997). 


Democratization and State Environmentalism 


An aspect of ecological modernization theory holds that democratization encourages 
environmental responsibility through increased demand for government activism on behalf of 
the natural environment (Ehrhardt-Martinez et al. 2002). Democratization is expected to lead 
to environmental reforms and more sustainable production processes because it provides 
conditions in which concerned groups and organizations can influence policy development and 
behavior (Shandra et al. 2004). Likewise, sociologists working in the “environmental state” and 
related approaches argue that the institutionalization of environmental policies should lead to 
more ecologically efficient forms of production and consumption with lower emissions of 
greenhouse gases (Goldman 2001; Mol and Buttel 2002). Studies in these areas typically 
examine the official adoption of environmentally friendly policies (e.g., Frank et al. 2000). Yet 
evidence of increasing policy formation or environmental treaty ratification does not necessarily 
lead to improved environmental conditions (Buttel 2000). Ina recent study, Fisher and 
Freudenburg (2004) conclude that environmental institutionalization fails to reduce carbon 
dioxide emissions in OECD nations. However, they do not include an index of environmental 
treaty ratification, which many argue is a more appropriate measure of state environmentalism 
(e.g., Dietz and Kalof 1992; Roberts et al. 2004). 


Analyses 


The primary goal of the current study is to empirically evaluate the extent to which particular 
social structural factors affect levels of methane emissions intensity in the contemporary 
world economy. Using ordinary least squares regression with listwise deletion, | conduct a 
series of quantitative cross-national analyses of methane emissions per capita, 1995*. OLS 
with listwise deletion is the most common regression technique used in social scientific 
cross-national research. | include measures of economic development, urbanization, overall 
domestic economic structure, particular forms of production, sector-level foreign direct 
investment and overall domestic investment, democratization and state environmentalism.° 
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Dependent Variable 


The dependent variable is methane emissions intensity, measured as per capita emissions of 
methane, 1995 (logged).° Measures of total methane emissions in thousand metric tons are 
gathered from the World Resources Institute 2004.’ These measures include emissions in 
thousand metric tons of CO9 equivalent from energy, agriculture, waste and other sources.® 
| transform these data into per capita measures by dividing total emissions by the total 
population of nations (World Bank 2000). 


Independent Variables 


Gross Domestic Product Per Capita, 1990 (logged) measures a nation’s relative level of 
economic development and capital intensity. These data are taken from Maddison (2001), and 
are measured in 1990 international dollars. 


Urban Population as Percentage of Total Population (residualized), 1990 controls for relative levels 
of urbanization (World Bank 2000). To reduce collinearity, | regress these data on per capita GDP 
and use the residuals as measures of urbanization, independent of economic development.° 


Manufacturing as Percentage of Total Gross Domestic Product (residualized), 1990 measures 
the extent to which a domestic economy is manufacturing based (World Bank 2000). 


Agriculture as Percentage of Total Gross Domestic Product (residualized), 1990 controls for the 
extent to which a domestic economy focuses on agricultural production (World Bank 2000). 


Exports as Percentage of Total Gross Domestic Product (logged), 1990 measures the extent 
to which a nation is integrated into the international trading system. These data are taken from 
the World Bank (2000). 


Gross Domestic Investment as Percentage of Total Gross Domestic Product, 1990 represents 
the level of domestic investment in fixed assets plus net changes in inventory levels.'° | obtain 
these data from the World Bank (2000). 


Cereal Production Per Capita (logged), 1990 refers to the per capita amount of cereals 
produced in 1990." Cereals include rice, wheat, barley, maize, rye, oats, millet, sorghum, 
buckwheat, alpiste/canary seed, fonio, quinoa, triticale, wheat flour and the cereal component 
of blended foods. | gather total cereal production in thousands of metric tons from the WRI 
(2004). Using total population data from the World Bank (2000), | convert the measure into a 
per capita score. 


Beef and Veal Production per Capita (logged and residualized), 1990 includes beef and veal 
from bovine animals whether salted, in brine, dried or smoked. The measures include edible 
flours and meals. Beef and veal production is usually reported inclusive of bone and exclusive 
of meat that is deemed unfit for human consumption. | gather measures of total beef and veal 
production in thousand metric tons from the WRI (2004), and convert these data into per 
capita scores with the use of total population measures taken from the World Bank (2000). 


Oil and Natural Gas Energy Production Per Capita (logged), 1990 quantifies production of 
crude oil and natural gas liquids (WRI 2004), Petroleum products such as refinery gas, ethane, 
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aviation fuel, kerosene, diesel fuel and paraffin are also included. These data are measured 
in thousand metric tons oil equivalent. Using total population measures (World Bank 2000), | 
transform these data into per capita scores. 


Biomass Energy Production Per Capita (logged), 1990 measures the per capita amount of 
plant matter (i.e., wood, vegetal waste, animal wastes/materials, sulphite lyes and other forms 
of solid biomass) used directly as fuel or converted into other forms before combustion (WRI 
2004). | transform the measures of total biomass energy production into per capita values 
using measures of total population (World Bank 2000). 


Accumulated Stocks of Foreign Direct Investment in Manufacturing as Proportion of Total 
Gross Domestic Product, 1967. Investment data are gathered from Bornschier and Heintz 
(1979), and GDP data are taken from Maddison (2001). The investment data refer to the 
accumulated stocks of foreign private direct investment in millions of US dollars in the 
manufacturing sector. 


Accumulated Stocks of Foreign Direct Investment in Extraction and Petroleum as Proportion 
of Total Gross Domestic Product, 1967. Sources for these data are the same as for the 
preceding independent variable. The investment data refer to the accumulated stocks of 
foreign private direct investment in millions of U.S. dollars in the resource extraction and 
petroleum production (including crude oil and natural gas) sectors. 


Accumulated Stocks of Foreign Direct Investment in Agriculture as Proportion of Total Gross 
Domestic Product, 1967. Sources for these data are the same as the two preceding 
independent variables. The investment data refer to the accumulated stocks of foreign private 
direct investment in millions of U.S. dollars in the agricultural sector. 


State Environmentalism (residualized) is a weighted index of relative state environmental 
performance. These data are gathered from Dietz and Kalof (1992). The index is a cumulative 
measure of ratification of international environmental treaties, 1963-1987. Dietz and Kalof weight 
these data through a series of principal component analyses of the original measure, which 
consists of a count of identified treaties ratified by a given nation during the 24-year period. 


Index of Democratization, (residualized) 1980 consists of two components: electoral 
competition and popular participation. The first component is calculated by subtracting the 
percentage of votes won by the largest party from 100. The second component quantifies the 
percentage of the total adult population who voted in the election concerned. Both 
components are weighted equally in the construction of the democratization index, which | 
obtain from Vanhanen (1997).' 


Countries Included in the Analyses 


Many similar cross-national studies restrict their analyses to less-developed countries (e.g., 
Grimes and Kentor 2003; Kentor and Boswell 2003; Shandra et al. 2004). The general argument 
is that various macro-level conditions and social processes exist and operate quite differently 
in less developed countries relative to developed ones. However, other social scientists stress 
the theoretical importance and empirical utility for including all countries for which data are 
available (e.g., Chase-Dunn 1998; Jorgenson 2008). Moreover, it is quite likely that this “mixed 
sample” approach will likely underestimate the effects of particular independent variables. | 
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agree with this approach and thus include all countries for which data were available at the 
time of the study. To maximize the use of available data, | allow sample sizes to vary among 
tested models. Using Cook's distance and residual diagnostics, | determine that the reported 
analyses do not include any influential cases or countries with standardized residuals greater 
than 3. Table 1 provides descriptive statistics and correlations for all variables included in the 
analyses. 


Results 


Table 2 presents findings for the regression analyses. Model 1 is treated as a baseline, 
consisting of the most common controls in similar cross-national studies of greenhouse gas 
emissions. Models 2 through 5 are thematically derived, consisting of additional independent 
variables corresponding to each theme. Model 6 is the most fully saturated of the tested 
series, consisting of all but one variable addressed in preceding models. Model 7 excludes all 
non-significant baseline controls included in Models 1 through 6, which reduces relative levels 
of multicollinearity among the remaining independent variables and results in a more 
appropriate number of predictors relative to the reduced sample size. 

Findings for Model 1 indicate that both the level of economic development and 
urbanization positively affect the intensity of methane emissions. Effects of all remaining 
predictors are non-significant. This baseline analysis predicts a relatively low proportion of 
variation in levels of methane emissions per capita (adjusted rsquare = .20). 

Model 2 includes measures of two more specific forms of food production activities: (1) 
cereal production and (2) beef and veal production per capita. The inclusion of these two 
measures lessens the effects of economic development and urbanization, both of which 
become non-significant. Consistent with the arguments of other researchers, | find that both 
cereal production and livestock production positively affect methane emissions intensity, net 
of the baseline controls. The standardized coefficient for beef and veal production is more 
than double that of cereal production per capita. With an adjusted r-square of .53, the 
inclusion of these two variables greatly increases the proportion of variance explained in the 
dependent variable. 

Model 3 addresses the effects of oil/natural gas and biomass energy production, net of 
the baseline control variables. It warrants noting that the sample size for this model drops to 
45 countries. Unlike Model 2, the inclusion of these two variables does not lessen the positive 
effect of economic development, but the effect of urbanization remains non-significant. 
Turning to energy production, | find that biomass and oil/natural gas production positively 
affect methane emissions per capita. The relative effect of oil/natural gas production is greater 
than biomass energy production, and the proportion of variance explained in this model is 
moderate (adjusted square = .45). 

Model 4 examines the impact of foreign direct investment in these economic sectors: (1) 
manufacturing, (2) extraction and petroleum production, and (3) agriculture. The sample size 
for this analysis returns to 68 countries (identical sample as Model 1), and the proportion of 
variance explained in methane emissions per capita increases from the baseline model 
(adjusted square = .26). The effect of economic development continues to be positive and 
statistically significant, while the effects of the remaining baseline controls remain non- 
significant. Turning to foreign investment, | find that investment in manufacturing as well as 
extraction and petroleum both positively affect methane emissions intensity. The magnitude 
of the impact of investment in manufacturing is slightly larger than for investment in 
extraction and petroleum production, and the effect of the latter is only slightly statistically 
significant, p < .10. 
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Model 5 consists of measures of democratization and state environmentalism. Inclusion 
of these two predictors does not temper the positive effect of economic development, and 
a low to moderate level of variation in methane emissions intensity is explained (adjusted r- 
Square = .26). Turning to the two new variables, | find that state environmentalism negatively 
affects methane emissions while the effect of level of democratization is non-significant. 

Model 6 includes all baseline controls as well as predictors with significant effects in 
Models 2 through 5. The inclusion of a higher frequency of independent variables coupled 
with the application of listwise deletion results in a reduced sample size of 39 countries. The 

. effects of all baseline controls are non-significant. It warrants noting that the variance inflation 
factor for gross domestic product per capita elevates to 3.11, and VIFs for the remaining 
baseline controls (except level of exports) are elevated to or slightly above 2. Turning to the 
other independent variables, livestock production (beef and veal), oil and natural gas energy 
production, biomass energy production, and foreign investment in manufacturing all continue 
to positively affect methane emissions intensity. Moreover, the effect of state 
environmentalism remains negative and statistically significant. Unlike models 2 and 4, the 
effects of both cereal production per capita and foreign direct investment in extraction and 
petroleum production become non-significant. This more fully-saturated model explains a 
high proportion of variance in levels of methane emission per capita (adjusted square = .63), 

To further explore the impact of the statistically significant predictors for preceding 
models, | remove all baseline controls except level of economic development from Model 7. 
The exclusion of these variables increases the sample by two countries and greatly reduces 
the VIFs for economic development and beef and veal production per capita as well as the 
remaining independent variables to a lesser extent. Consistent with Models 1 and 3 through 
5, the positive effect for level of economic development becomes statistically significant yet 
lower in magnitude. Effects for the remaining predictors are similar to those reported in 
Model 6. However, the relative magnitude of the effect decreases for beef and veal 
production and increases for state environmentalism, biomass energy production, and oil and 
natural gas production. The standardized coefficient for foreign investment in manufacturing 
stays relatively the same as for Model 6. This reduced model explains a slightly higher 
proportion of variance in methane emissions intensity than the more fully saturated model 
(adjusted rsquare = .69). 


Discussion and Conclusion 


Methane emissions are the second largest anthropogenic contributor to global warming, one of 
the most pressing environmental issues facing all societies. Like other noxious greenhouse 
gases, global methane emissions have risen dramatically over the past decades and century. 
Increased emissions parallel recent upswings in trade and investment globalization as well as 
global commodity production and resource consumption. With few exceptions, social scientists 
ignore methane in their analyses of greenhouse gas emissions, and those studies that do 
include methane often apply explanatory models that neglect important sectorlevel factors 
‘identified by researchers working in the earth sciences. Thus, in this study | provided nuanced 
cross-national analyses of the social structural causes of methane emissions intensity. 

Level of economic development is found to positively affect per capita methane 
emissions. This finding, coupled with the results for other recent studies (Jorgenson 2003, 
2004, 2005; Burns et al. 1997; Grimes and Kentor 2003), supports propositions of political- 
economic perspectives regarding the environmental impact of capital intensity, particularly 
the arguments of world-systems theory and the treadmill of production (Roberts and Grimes 
2002; Schnaiberg and Gould 1994). Conversely, this result challenges neoclassical economic 
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and ecological modernization assertions concerning the ability of nations to lessen their 
environmental impact through economic development (Grossman and Krueger 1995; Mol 
2001). The current study illustrates the importance of careful consideration of scope 
conditions and units of analyses when testing key propositions of ecological modernization 
and other theoretical perspectives (York and Rosa 2003). 

Particular sector-level activities contribute to per capita methane emissions. These include 
the production of beef and veal, oil and natural gas, and the use of biomass for energy. Some 
evidence is found indicating that cereal production, including rice, positively affects methane 
emissions per capita. Export-oriented cattle and cereal production have increased substantially 
in less-developed countries (Burns et al. 1997; Jorgenson and Burns 2004). Paradoxically, the 
majority of goods produced in these sectors are consumed in more-developed countries, 
which highlights the outsourcing of consumption-based environmental costs (Princen et al. 
2002). Overall, these results underscore the necessity for considering the environmental 
impact of particular economic activities in general and production processes in particular. 

Consistent with other studies (Grimes and Kentor 2003; Kentor and Grimes forthcoming), 
robust evidence is found concerning the environmental impact of foreign direct investment in 
manufacturing. Similarly, some evidence is found indicating that foreign investment in extraction 
and petroleum production positively affects methane emissions intensity. These findings 
support sociological theories emphasizing foreign capital dependence (Dixon and Boswell 1996) 
and highlight the environmental implications of the transnational social organization as well as 
levels of production in different economic sectors (Bornschier and Chase-Dunn 1985). 

Nations with higher levels of state environmentalism emit lower levels of methane 
emissions per capita. This result supports arguments concerning the potential environmental 
impact of institutional commitments by nation states (e.g., Fisher and Freudenburg 2004; 
Goldman 2001; Mol and Buttel 2002). However, other cross-national studies of environmental 
degradation fail to empirically validate this proposition (e.g., Jorgenson and Burns n.d.; York 
et al. 2003). Indeed, the relationship between formal environmental commitments of nation 
states and actual environmental outcomes clearly requires additional empirical inquiry. | hope 
the current study will encourage other social scientists to further investigate these 
institutional-ecological dynamics. 

The policy implications for our findings are significant. Foreign capital has a built-in 
incentive to ignore environmental externalities in favor of maximizing profits. Widely accepted 
assertions about the beneficial effects of foreign investment need to be weighed against 
ecological concerns related to the scale, intensity and social organization of production. For 
example, both per capita oil production and foreign investment in this economic sector 
positively affect methane emissions intensity. Ecologically inefficient extraction and 
refinement processes — such as pipeline leakages and purging as well as excessive flaring 
~increase methane emissions (Moore et al. 1998). These conditions are deemed problematic 
for foreign investors if they become economically inefficient. This also applies to the 
transnational social organization of production in the manufacturing sector. Just as it does 
for carbon dioxide emissions, foreign investment in manufacturing affects methane 
emissions intensity. More stringent economic penalties for environmentally inefficient 
manufacturing, extraction and refining processes could increase the likelihood that firms will 
upgrade equipment used in production, resulting in decreases of methane emissions. 

Thus, implementation of stricter penalties or taxation for these kinds of activities could 
force transnational firms to resolve and internalize certain contradictions between economy 
and ecology (Foster 2002). Given the potential for “capital flight” (Robinson 2004) and the 
global scope of greenhouse gas emissions, austere forms of regulation would be more 
effective at the international level. Moreover, considering the double-edged impact of cattle 
production (emissions from cattle and biomass burning to increase grazing areas) and 
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potential health problems associated with the consumption of beef products, community 
health organizations would do well to frame their public campaigns as partial solutions for a 
combination of diet-related health problems and concurrent environmental degradation. 


Notes 
1. Total global methane emissions increased 2 percent from 1970-1995 (Oliver 2002). 


2. Throughout this study | use the terms methane emissions intensity and methane emissions 
per capita interchangeably. 


3. Anaerobic refers to processes not requiring oxygen. 


4. A limitation of the present study is the analyses of cross-sectional data. Reliable national- 
level measures of methane emissions for earlier time points are unavailable at the time 
of this study for a relatively large number of cases. This disallows for comprehensive 
cross-national analyses of the social causes of change in methane emissions intensity. 
Thus, caution is warranted when making connections between this form of static analysis 
and temporal changes in the effects of the independent variables on per capita methane 
emissions. 


5. Additional factors known to contribute to methane emissions include the production of coal 
and decomposition of solid waste in landfills (Moore et al. 1998). Data for these activities 
are unavailable at the time of this study for an adequate number of countries. 


6. Variables that are logged in the present study are done so to correct for skewness. 


7. The WRI (2004) obtains these data from the National Institute for Public Health and the 
Environment's 2001 Database for Global Atmospheric Research [EDGAR], which is 
also available in a published format (Oliver 2002). The methane emissions data are 
calculated using a standardized methodology and reviewed for accuracy by the United 
Nations Framework Convention on Climate Change (WRI 2004). This standardized 
methodology involves EDGAR, using international statistical data sources to estimate 
emissions of methane gas for particular activities as reported by their database (WRI 
2004). EDGAR provides conservative measures of reliability or “uncertainty” for their 
calculated estimates of methane emissions as well as carbon dioxide and nitrous oxide 
emissions. Their methodology is also approved and used by the Intergovernmental 
Panel on Climate Change. Overall, the methane emissions estimates are considered 
more reliable and less “uncertain” than the estimates of nitrous oxide emissions, but 
slightly less reliable and slightly more “uncertain” than the estimates of carbon dioxide 
emissions (Oliver 2002). 


8. Energy sources consist of emissions from the production, handling, transmission and 
combustion of biofuels and fossil fuels. Agricultural sources are comprised of emissions 
from rice production, animals, animal wastes, agricultural waste not intended for energy 
production, and savannah burning. Waste sources include emissions from wastewater 
treatment and disposal, landfills, and waste incineration not intended for energy 
production. Other sources are comprised of emissions from industrial processes, and 
tropical as well as temperate forest fires (WRI 2004). 
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9. To reduce collinearity, other independent variables in the current study are residualized (and 


labeled accordingly) using the same method (residuals from being regressed on per capita 
GDP). 


10. Ideally, | would prefer measures of domestic investment for different economic sectors. 
Nevertheless, those types of data are unavailable at the time of the present study. 


11. | acknowledge that this comprehensive measure might underestimate the effects of rice 
production on methane emissions intensity. However, national-level indicators focusing 
explicitly on rice production are unavailable at the time of the present study. 


12. | acknowledge the temporal limitation of this measure. Yet, more recent adequate 
measures of democratization are unavailable for a large sample of countries. 


13. In a series of unreported analyses, | analyze Models 1-5 and 7 with the dataset used for 
the reported analysis of Model 6 (N = 39). This allows for the assessment of potential 
sensitivity of the parameter estimates to the varying sample sizes in the reported 
analyses. Results differ very little from those reported in the current study with the 
exception of the effect of urban population in Model 1, which becomes non-significant 
as well as the effect of biomass energy production in Model 3, which is slightly lower in 
magnitude relative to the effect reported for Model 3. However, the effect of urban 
population is non-significant in the reported findings for Models 2-7 as well as results for 
all other unreported models. | thank an anonymous reviewer for suggesting the inclusion 
of this form of sensitivity analysis, which further validates the reported findings. 
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Statistical Inference and Patterns of Inequality 
in the Global North 


Timothy Patrick Moran, State University of New York-Stony Brook 





Abstract 

Cross-national inequality trends have historically been a crucial field of inquiry across the 
social sciences, and new methodological techniques of statistical inference have recently 
improved the ability to analyze these trends over time. This paper applies Monte Carlo, 
bootstrap inference methods to the income surveys of the Luxembourg Income Study 
database to identify patterns of distributional change in the global North from 1980 to 
2000. While it is now generally accepted that inequality has increased in the United States 
and United Kingdom during this period, the extent to which other wealthy nations have 
been able to avoid this trend (or not) has generated some debate. This paper presents new 
evidence to address this discussion, demonstrating along the way how the ability to conduct 
formal statistical inference with statistics such as the Gini index provides an effective and 
important new evaluative tool for comparative research. 


In the early 1980s researchers in the United States and the United Kingdom began to notice that, 
after a long period of relative stability, the distribution of income was becoming noticeably 
unequal. The trend continued, and in 1988 the phenomenon was coined the “great U-turn” by 
Harrison and Bluestone in recognition of the historically unique rising levels of inequality in 
wealthy, developed nations. While no single “smoking gun” has been found to explain this trend 
(Gustafsson and Johansson 1999), debates tend to implicate various forms of economic 
restructuring (shifting patterns of trade, increased capital and labor mobility, deindustrialization, 
and the rise of the skill-based economy) that fall under the “globalization” rubric (Alderson and 
Nielsen 2002). Because this restructuring characterizes the global North more broadly, both 
academic and popular interpretations tend to discuss rising inequality as a nearly universal 
outcome of these processes in the 1980s and 1990s (Friedman 2000; Smeeding 2002a). As 
summarized by Ram (1997:577), “[t]he somewhat cheerless distributional position recently 
noted for the United States seems to characterize most of the postwar developed world.” 
Others, however, have begun to question the extent to which trends in the United States and 
United Kingdom were replicated throughout the global North. Observers of these contrasting 
outcomes argue that technological change has been less skill-based in parts of Europe than in the 
United States and the United Kingdom, and that returns to education and skill increased less 
sharply in these areas (because the supply of skilled workers increased faster), leading to “less of 
an increase, or even no change” in wage inequality in these countries (Acemoglu 2002: 1). 
Another line of interpretation has emerged around the general idea that political contexts and 
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institutional configurations matter. Some argue, for example, that European labor policies and 
wage-setting institutions mitigate the tendency toward increasing earnings inequality (Acemoglu 
2002; Blau and Kahn 1996; Freeman and Katz 1995; Nickell and Bell 1996). In particular, many 
studies find that labor union density significantly reduces inequality (Alderson and Nielsen 2002; 
Freeman 1993; Gustafsson and Johansson 1999), and that policy liberalism/leftist government 
strongly drives the redistribution process (Bradley, et al. 2003; Brady 2003; Kelly 2004). 

Using the surveys of the Luxembourg Income Study Project (LIS), this analysis brings new 
methodological considerations to bear on this debate. First, the paper presents the first 
systematic application of recently developed “bootstrap” resampling procedures in the context of 
cross-national comparisons of inequality trends. Whereas traditionally only subjective evaluations 
of differences in inequality indicators are made, these procedures allow us to assess more 
formally the statistical significance of changes in inequality, thus helping to distinguish “real” 
movements in the Gini index from ones better attributed to chance, error and random variation 
in the surveys. Second, changes in inequality over time are situated within the context of cross- 
national levels of inequality, thus contextualizing trends over time within broader macro-structural 
differences. Incorporating these two empirical issues into an historical analysis of inequality thus 
facilitates the detection of three patterns of distributional change in the global North over the past 
20 years: a “Continental” pattern, an “Anglo” pattern and a “Scandinavian” pattern. 


Data and Methods 


The LIS project has pioneered the collection and distribution of cross-nationally comparable 
socio-economic data and is now the primary source of such information for comparative 
analyses (Smeeding 2002b). The effective “harmonization” of international survey data 
originated by the LIS - where microdata from member nations is standardized to maximize 
cross-national comparability — has now led to a database of more than 130 observations 
across 29 nations. Of these, 17 constitute the wealthy countries of the global North." 

While the LIS database has often been mined for inequality trends, this paper presents the 
first systematic evaluation of the LIS income surveys using bootstrap inference techniques. In 
the analysis that follows, inequality is measured by the Gini index as specified according to 
the methodological recommendations of the LIS project (and more fully outlined in Appendix 
A). The unit of analysis is the household, and income is measured as net, equivalent (i.e., 
adjusted for household size using an equivalence scale), disposable income subjected to both 
top and bottom coding. Also based on LIS specifications, the Gini index is weighted by 
“person weights” — the product of the household weight (the weights established by the 
procedure used in the original dataset, not by LIS) and the number of persons in the 
household — to account for differences in household size. 


Statistical Inference via the Bootstrap 


The absence of accessible methods to assess statistical significance has been a glaring 
shortcoming in comparative research more broadly and within the inequality literature in particular. 
Practitioners seeking to quantify inequality have traditionally relied on descriptive devices such as 
the Lorenz curve, quintile income shares and various summary indices. In the absence of well 
established and practically useful analytical error theories for these measures (theories that exist 
for statistics such as the mean), practitioners are left making evaluations based on little more than 
visual or absolute-numerical comparisons or by establishing arbitrary percentage-change scales. 
As Rossi, et al. (2001:905) explain, “[point estimates unaccompanied by any precision measure are 
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the rule rather than the exception, so that tracking the changes of personal distribution through 
time rests to a large extent on scholars’ a prior beliefs.” As a result, error or random variation in the 
samples can produce discrepant portraits of distributional change within countries even as 
scholars build models attempting to explain such changes cross-nationally. Moreover, minor 
changes in Gini coefficients might not warrant directional conclusions at all, perhaps signaling 
instead that “no change” in inequality has occurred (a description rarely used by scholars looking 
for trends). In short, the conclusions we tend to draw in empirical inequality research are “far more 
tentative than has often been realized.” (Mills and Zandvakili 1997: 140) 

Within the past few years, however, significant developments have occurred in both the 
theory and practice of conducting formal statistical inference on inequality measures like the Gini 
index, new methods that seek to nonparametrically estimate the underlying sampling 
distributions of such statistics. Pioneered by Efron (1979), these “bootstrap” methods substitute 
computerdriven simulation for theoretical calculation by employing Monte Carlo resampling 
plans that evaluate the accuracy of a statistic using the empirical distribution as a simple estimate 
of the entire probability distribution found in the population. More specifically, the bootstrap 
procedure seeks to recover the standard error and sampling distribution of a parameter estimate 
numerically (as opposed to theoretically) via repeated random samples drawn with replacement 
from the observed sample distribution, thus allowing the construction of confidence intervals and 
hypothesis tests on inequality and poverty indicators common in quantitative cross-national 
research. Not only do bootstrap techniques increase the application of statistical inference to a 
vast number of new research situations, they also reduce the number of assumptions necessary 
for such analyses, and eliminate the need to work with cumbersome analytical formulae. Thus, 
they are likely to soon proliferate to a wide range of social scientific settings. 

The bootstrap technique is relatively straightforward yet analytically powerful. (For general 
discussions of the bootstrap, see Chernick 1999; Davidson and Hinkley 1997; Efron 1982; Efron 
and Tibshirani 1993.) The method is conceptually based on the “plug-in principle,” where known 
sample values are taken as simple estimates of the entire population. Having observed a random 
sample of size n from a population, the nonparametric, one-sample procedure works as follows:? 


(1) Randomly draw an independent resample of size n with replacement from the 
original sample distribution, so that each case has probability 7" of being selected. 
Replacement here refers to each draw — not each sample. Thus, some observations 
can be counted more than once, some not all in any given resample.* 


(2) Evaluate the statistic for the bootstrap resample, which becomes a bootstrap 
replication of the sample estimate. 


(3) Independently repeat this procedure a large number of B times, (in this analysis B 
= 1000), each time obtaining bootstrap replications of the statistic.* 


(4) Calculate the bootstrap standard error of the sampling distribution, estimated as: 


B 1/2 
es eal Ae _K* 
Sboot = {54 >; Ee: } (1.0) 


b-1 
where 65 represents a bootstrap replication of the original point estimate of an unknown 
population parameter ©, computed from the 6" bootstrap resample, and 


B 
<7 26) 


D>! 
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is the mean of the parameter estimates obtained in the B resamples. Based on the law of 
large numbers, as the number of B replications approaches infinity the bootstrap standard 
error (1.0) converges to the (ideal) estimate of the standard error found in the sample, which 
is still an estimate because it is based on a particular sample and not the population.® 

Several alternative methods have been suggested for assigning confidence intervals to the 
unknown parameter based on the bootstrap distribution. While more technical discussions 
illustrate mathematical reasons to prefer one method over another, in practice there is little 
distinction in the numerical performance of the various confidence intervals. The similarity in 
performance is important in this context because certain interval types are unavailable in 
versions of the statistical software currently on the LIS system (in particular the “bias- 
corrected and accelerated” method that many argue produces the most accurate confidence 
intervals).® In a separate analysis conducted at the LIS, | compared the various methods 
calculable within their system (including the non-accelerated, bias-corrected method) and 
found that the methods had very little practical impact on interval locations. Specifically, there 
was never more than a .001 difference across the confidence interval methods (Moran 
forthcoming). The close conformity of the different methods found in the LIS database is 
consistent with previous performance comparisons in the applied literature. Using various 
forms of income distribution data, for example, Mills and Zandvakili (1997), Biewen (2002), 
Trede (2002) and Van Kerm (2002) all find the same small, one percentage point variation in 
coverage between the various bootstrap confidence intervals. 

Because comparisons, especially in the context of income data, have resulted in little 
practical difference across the interval estimates (and especially as sample size gets large), 
the decision regarding which bootstrap interval to report can be based on more practical 
considerations. Between the confidence interval procedures calculable within the constraints 
of the LIS, most arguments favor what has come to be called the “percentile” method, and 
this method is preferred in the applied literature as well (Mills and Zandvakili 1997). After 
establishing a bootstrap distribution of a statistic over a B number of replications, the 
percentile method consists of establishing confidence interval limits directly from the 
resamples. The method gets its name because the a/2" and 1- a/2" empirical percentiles of 
the bootstrap distribution are taken as two-sided a-level percentile confidence interval 
boundaries. Once the values of ©» are ordered ascendingly, the lower limit of the confidence 
interval is found at the B X a/2" value of the distribution of bootstrapped estimates and the 
upper limit is the B x (1- a/2)" value of the distribution. 

Percentile confidence intervals have much to recommend over other interval methods. First, 
this procedure does not require implicit assumptions (e.g., shape and symmetry of the distribution, 
an unbounded statistic) necessary for intervals established using the normal approximation. 
Second, the percentile method is transformation preserving, meaning that “confidence intervals for 
a transformation of the statistic are identical to the transformed confidence intervals.” (Trede 
2002:268) Third, it provides asymmetrical and range preserving intervals that will never extend 
beyond the permissible range of a bounded statistic such as the Gini index. Confidence intervals 
that are both asymmetrical and range preserving tend to be more accurate and reliable. Fourth, it 
is computationally simple to execute, and thus quite accessible in practical applications. 


Hypothesis Testing via the Bootstrap 


The statistical significance of the difference between two sample estimates (either between units 
or within them over time) can most immediately be assessed by evaluating the width of the two 
confidence intervals. When confidence intervals do not overlap, we can unambiguously assess 
this difference to be statistically significant (at level a). In the presence of overlap, however, it does 
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not immediately follow that the difference is not statistically significant. In this case, an hypothesis 
test is required to evaluate the statistical significance of the difference in parameter estimates. 

With access to the microdata, such procedures are readily available via the bootstrap 
procedures. The conventional method in the literature is to conduct a one-sided test to determine 
if the observed difference is statistically significant or not. This method is preferred because 
specifying the test in this manner allows for the calculation of pseudo p-values. Like conventiorial 
hypothesis testing using the mean, the idea is to find a one-sided p-value defined as the probability 
of finding at least that large of a difference when the null hypothesis is true.” As above, the bootstrap 
method derives a “plug-in” approximation of the distribution of the difference of the two estimates 
through repeated resampling. A bootstrap p-value, p*, can then be estimated as the proportion of 
bootstrap samples yielding a test statistic more extreme than the actual test statistic computed 
from the data. A one-tailed test that rejects above the test statistic can be approximated as: 


B 
1 a * ax 
oz! {aj =0 } (1.1) 


where I denotes the indicator function of the set (i.e., the number of times the bootstrap 
estimate is greater than the parameter estimate). 

Alternatively, one can specify a test that renders a two-tailed decision (one that tests if the 
difference between the estimates is higher or lower), but this test will have no corresponding 
p* values as above. Following the procedure outlined in Efron and Tibshirani (1993), say we 
have two /id samples x (size 7) and y (size ™) taken to represent populations X and Y, and we 
want to test the null hypothesis H: X = Y. The hypothesis test requires a test statistic ds) 
which could be a parameter estimate © but not necessarily. In the cross-national application 
presented in this study, t(s) = Gini, - Giniy, the difference between the inequality estimates 
in one country in two time periods. 

Under the assumption that the two samples are independent, first draw a bootstrap 
resample s,* of size n with replacement from x so that the probability for selection in s,* is 
1/n, and calculate the statistic of interest G.*. Then draw a bootstrap resample s,* of size m 
in the same manner and again calculate the statistic of interest Gy*. Repeat the procedure a 
B number of times, each time evaluating the difference between the Gini coefficients: 


ts)* = G,* - GY (1.2) 
Construct a bootstrap confidence interval of the differences in the Gini index, for example by 
using the percentile method. The difference originally found between the two samples is then 
said to be statistically significant (at a) if zero is not contained in the interval. 

The two-tailed procedure does not render p* values as in in 1.1. Doubling the values from 
1.1 has the same interpretive meaning as two-sided p-values, but these will only be 
approximations because we cannot assume that the bootstrap distribution of the test statistic 
is symmetrical. For this study | conducted hypothesis tests using both procedures, and even 
though theoretically the two methods need not correspond, in every case the conclusion 
regarding statistical significance was the same — decisions reached by doubling the one-sided 
p*-value always corresponded with the two-tailed test decision. 

In contexts where we can assume that the bootstrap replications are drawn from 
independent populations, the testing procedures outlined above are straightforward. In 
dependent data conditions, however — such as the LIS surveys employing panel designs — 
bootstrap resampling should be drawn simultaneously (as opposed to separately) so that the 
resample inherits the correlations of incomes between time ty and tg of the panel. This task 
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turns out to be difficult in the LIS database where there is no clean way to track households 
over time. The overall closeness of the various bootstrap methods and the relatively minor 
effects of various bootstrap corrections found across the econometric literature, however, 
suggest that these refinements are unlikely to produce very different intervals than reported 
here. (See, for example Trede 2002.) Nonetheless, the reader should be aware that Ireland, 
Luxembourg, Germany and the Netherlands involve tests where both samples come from the 
same panel, and that these tests are likely to be slightly biased in a conservative direction. 


Distributional Change in the Global North: 1980-2000 


As discussed at the outset, recent debates have challenged the extent to which rising 
inequality is a universal characteristic of the global North in the 1980s and 1990s. Using the 
inference procedures outlined above across the 17 nations examined here, however, three 
distinct patterns of distributional change are evident — a “Continental” pattern, an “Anglo” 
pattern and a “Scandinavian” pattern.’ The three patterns and the specific trends in inequality 
found in the constituent countries are now described in turn. Historical trends in inequality for 
individual nations are presented in a series of figures (Figures 1a - 3) that chart the movement 
in the Gini index enclosed within the bootstrap 95 percent confidence interval. For various 
sub-periods within the 20-year period (marked by two vertical arrows), the magnitude of 
changes in the Gini index are evaluated using the following format: 





.008 «~ Absolute change in the Gini coefficient 
2.8 «— Percentage change in the Gini coefficient 
.016 « One-sided p*-value of the null hypothesis test (if the intervals overlap) 


a. The Continental Pattern — Steady States 


Looking at the bootstrap results in Figures 1a through 1c, the prevailing trend in Continental 
Europe is one of little distributional change — or change that at least is not large enough to 
reach statistical significance. The Continental pattern is characterized by relatively moderate 
levels of inequality that remain essentially unchanged throughout the period, although in a 
few cases inequality actually declined. 


Exemplars of the Continental Pattern 

Seven countries form what can be called the exemplar Continental group — experiencing 
essentially no change in (or even declining) levels of inequality over the period. France shows 
statistically no change in inequality since 1979 (and significantly less inequality than in 1984); 
Germany is just the opposite, showing significantly less (although substantively maybe the 
same) inequality than in 1973, with no change since 1984; the Netherlands shows no change 
since 1983; Italy and Canada illustrate the influence of beginning and end points in defining 
historical trends. Taking 1986 as the starting point for Italy, the trend shows significantly more 
inequality, but since 1987 Italy shows no change (and also no change since 1993). The last 
two data points for Canada (1998 and 2000) come from a different survey source than the 
rest, so the observed uptick in inequality may be attributed in part to survey inconsistencies. 
However, even using the 2000 figure, Canada shows significantly less inequality than in 1971, 
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Figure 1a. The Continental Pattern of Inequality 
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Figure 1b. The Continental Pattern of Inequality 
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Figure 1c. The Continental Pattern of Inequality 
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a span of some 30 years. Using the 1997 figure, Canada shows no change since 1975. 
Switzerland also shows a statistically significant decrease in inequality; a Gini coefficient 9.4 
percent smaller in 2000 than in 1982. Spain has only two data points in the LIS database, but 
shows no change over the 1980s. 


A Caveat Country 
lreland represents a “hybrid” pattern where overall levels of inequality are more on par with 
the Anglo pattern (i.e., relatively high), but shows no change in inequality levels since 1987, as 


is the prevailing trend in the Continental pattern. 


A Caveat Group 
Three countries form a caveat group within the Continental pattern. Each basically exhibits the 


overall pattern, but each also experiences a significant increase in inequality in one survey in 
the 1990s. Luxembourg shows no change in inequality from 1985-1994, but an increase in 
inequality between the 1994 and 1997 surveys means inequality is significantly higher in 1997 
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Figure 1d. The Continental Pattern of Inequality 
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than in 1994. The same is true for neighboring Belgium which shows no change from 1985- 
1992, but a significant increase between the 1992 and 1997 surveys means inequality is 
significantly higher in 1997 than in 1992. Austria shows a significant increase between the 
1987 and 1994 surveys, but then has a statistically significant decrease in Gini coefficients 
from 1994 to 2000. The change in inequality from 1987 and 1994 may also be partially 
attributed to changes in survey sources for Austria, and in this case the two surveys have 
noticeably different sample sizes.® 


b. The Anglo Pattern — Divergence From the Continent 


As discussed in the introduction, the Anglo pattern is now a well known and much analyzed 
phenomenon. Often overlooked, however, is the fact that at the start of the 1980s, levels of 
inequality are in line with those found in Continental Europe. Beginning sometime in the late 
1970s through the early 1980s inequality then begins to increase, rapidly in some cases, leading 
to the continuous separation of these countries from inequality levels found on the continent. 


Exemplars of the Anglo Pattern 

Three counties in the LIS database illustrate the pattern (see Figure 2). For the United Kingdom, 
inequality levels remain constant until the 1980s, and at the start of the decade are lower than in 
France, Canada, Spain and Switzerland. Inequality then begins to steadily increase beginning in 
1979 and continues over the course of the period. Most evidence shows that inequality was also 
fairly stable in the United States prior to the 1980s, and at the start of the decade inequality in 
the United States is lower than Spain and Switzerland. Beginning in 1980, inequality begins to 
increase and also continues throughout the period. (The down-turn seen in the last survey is not 
statistically significant). The LIS has less data for Australia, but here the pattern is also repeated — 
continuously rising inequality throughout the period. 


c. The Scandinavian Pattern — Convergence to the Continent 


The Scandinavian pattern is characterized by relatively low levels of inequality at the start of 
the period that consistently increase throughout the 1980s and 1990s as they converge to 
levels of inequality seen on the Continent. This increase, however, still leaves these countries 
with perhaps the lowest levels of inequality in the world. 


Exemplars of the Scandinavian Pattern 

Three Scandinavian countries illustrate the pattern (see Figure 3). In Norway, Sweden and 
Finland, Gini coefficients at the start of the 1980s are as low as .197, and then increase by 18 
percent or more throughout the period. Although in Sweden, the Gini index in 2000 is still not 
higher than in 1967. 


d. Summary of Patterns in the Global North 


As opposed to a universal tendency toward rising inequality, the prevailing pattern in the global North 
(prevailing in that more countries fit this pattern than the other two) is the one found in Continental 
Europe (and Canada). This pattern is characterized by moderate levels of inequality that are generally 
maintained throughout the period (with a few countries experiencing declining levels of inequality). 
The other two patterns are both characterized by rising levels of inequality, but should be described 
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Figure 2. The Anglo Pattern of Inequality 
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Figure 3. The Scandinavian Pattern of Inequality 
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in relation to the Continental pattern — the context of the Anglo Pattern of rising inequality is 
divergence from Continental levels of inequality, while the context of the Scandinavian pattern of 
rising inequality is convergence to these levels. Data for circa 1980, 1990 and 2000 cross-sections 
are presented in Table 1, and the average Gini indices for the three patterns are plotted in Figure 4. 


Figure 4. Patterns of Distributional Change in the Global North, 1980-2000 
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As evidenced here, there are large differences in inequality trajectories for rich nations, and it 
seems that the tripartite typology of welfare-state regimes famously argued by Esping-Anderson 
(1990) still has descriptive power today. As Bradley et al. (2003) recently found, the different 
institutional configurations and political traditions underling Social-Democratic, Christian- 
Democratic and Liberal welfare states play a decisive role in determining distributional outcomes. 
Given that these countries face a common set of socio-economic phenomena — continued de- 
industrialization, aging populations, immigration from the global South — and experience them from 
the same advantaged locations within the global wealth hierarchy, the fact that distributional 
outcomes vary suggests that “globalization” has not usurped the importance of national policy or 
led to the insignificance of the state; world-economic processes do not compel any single national 
trajectory. This finding is consistent with recent literature, discussed at the outset, which 
emphasizes the importance of national political processes in counteracting market-driven 
inequality. Strong leftist government (Bradley, et al. 2003; Brady 2003; Kelly 2004), high levels of 
democratic participation (Mueller and Stratman 2003), and low public tolerance for inequality 
(Lambert, Millimet and Slottje 2008) are all associated with more equal income distributions. As 
summarized by Smeeding (2002a:28), “[t]he overall distribution of income in a country depends on 
the domestic political, institutional, and economic choices made by those individual countries.” 


Conclusions 


This project applied bootstrap resampling techniques to the income surveys of the LIS 
database in an effort to interpret patterns of distributional change in the global North over the 
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past 20 years. The utility of this evaluative tool for comparative research was evident, 
illustrating in particular that for a sizable number of countries, representing a “Continental” 
pattern of distributional change, inequality levels at the end of the period were not statistically 
distinguishable from levels at the start. Where previously small changes in the Gini index, 
such as those observed here, would lead scholars to make normative determinations of 
inequality trends, statistical inference provides new grounds to substantiate conclusions that 
“no change” in inequality has occurred.° It also suggests why others, evaluating inequality 
estimates in absolute terms, simultaneously argue that inequality has increased universally 
across the global North. These findings provide empirical support to recent arguments 
emphasizing the impact of political contexts and collective social forces on distributional 
outcomes, and in particular that such contexts have mitigated the tendency for rising 
inequality in Continental Europe. 

In the end, the ability to conduct formal statistical inference improves the quality of our 
empirical descriptions, which in turn helps focus theoretical explanations of distributional 
variation across countries and over time. As Efron and Tibshirani (1993:1) argue, the 
bootstrap provides accessible methods for “finding a real signal in a noisy background” 
while also facilitating checks against the over interpretation of random patterns. The 
descriptions presented here, for example, can be useful in reorienting future studies of 
inequality in the global North. They can move researchers from specification of the general 
relationship between globalization and inequality toward uncovering the processes by 
which specific institutional configurations and power arrangements can (1) modify the 
extent to which different sectors within a population are included/excluded from global- 
economic processes; and (2) shape the distribution of the gains and losses that result from 
such economic change. 


Notes: 


1. The other LIS member nations include the Czech Republic, Estonia, Hungary, Israel, 
Mexico, Poland, Taiwan, Romania, Russia, the Slovak Republic and Slovenia. Denmark is 
also a member country, but recent surveys were pulled from the database for quality 
control reasons. 


2. This procedure is called the “nonparametric bootstrap” because the bootstrap 
approximation of the standard error is based on the sample distribution, which is taken 
to be a nonparametric estimate of the unknown population distribution. 


3. The bootstrap procedures in this study employ resampling plans in the case of an 
independent and identically distributed (iid) sample of fixed size n from an unknown 
population. Although relatively undeveloped in the statistical literature, some have 
explored extensions of the /id bootstrap to data collected under complex sampling 
designs (Biewen 2002; Rao and Wu 1988; Jantti 2003). Due to the constraints of working 
with cross-national aggregations of survey data, however, the LIS does not allow us to 
evaluate the usefulness of these extensions. 


4. Efron and Tibshirani (1993:52) find that “very seldom are more than B = 200 replications 
needed for estimating a standard error” and “even a small number... say B = 50 is often 
good enough to give a good estimate...” To establish bootstrap confidence intervals, most 
acknowledge that B must be much larger, however the literature offers little formal 
guidance on establishing B for confidence interval estimation. 
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5. The simulations can also be used to establish the bias of a statistic, approximated as: 
N 
iN 1 A* 6 A¥ A 
gé= Be s aia =0 -98 


6. The LIS employs a remote access system, where user programs are sent to the LIS batch 
machines over the internet. 


7. Results for other LIS member nations can be obtained by contacting the author. 


8. The inequality trend for Austria further demonstrates the evaluative potential of the bootstrap 
hypothesis testing procedures. The decline in inequality from 1994 to 1997, for example, is 5 
percent overall in the Gini index, but a better conclusion would be “no change” in inequality 
as the p*-value for the difference is .041 (results not shown in the figure). It is not until the 
decline at 2000 reaches more than 7 percent in the Gini index is the difference statistically 
significant (0*-value of .008). 


9. While the bootstrap procedures are effective tools when one has access to the microdata, 
the findings presented here are potentially useful to researchers attempting to interpret 
differences and changes in Gini coefficients in situations where this access is limited or 
not possible. Specifically the hypothesis tests conducted here across various sample 
sizes suggest the following guidelines when comparing two Gini coefficients over time: 


* less than 2.5 percent difference = the difference is likely not statistically significant 


* between 2.5 and 5.0 percent difference = conclusion is ambiguous and likely determined 
by sample size and amount of sampling error, hypothesis test probably required 


* more than 5.0 percent difference = the difference is likely statistically significant 
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Appendix A: Estimating the Gini Index 


Gini coefficients estimated in this study are based on the following methodological 
recommendations of the LIS Project. 


Definition of Income — The unit of analysis is the household, and income is measured as net 
disposable household income: gross income - direct taxes - employee social security 
contributions 


Weights — Used to make adjustments in the relative influence of households in the survey to 
account for biases in the characteristics of the groups of non-respondents. The LIS does not 
carry out weighting procedures, they are post-weights constructed by the various survey 
sources themselves, so the exact impact of the weight variable varies throughout the 
database. 


While the bootstrap algorithm draws independent bootstrap resamples (i.e., equal probability 
selection), the Gini indices themselves are estimated using person weights — the household 
weight multiplied by household size — to spread the weight over the household members and 
thus account for differences in household size. The weights are standardized so that the 
weighted sample size equals the number of households instead of the population. The LIS 
standardizes person weights to handle deleted observations that nonetheless have non-zero 
weights (if one wishes to drop certain observations, for example, than the remaining weights 
will be adjusted accordingly). This procedure in no way affects the bootstrap results, which 
are identical with or without weight normalization. 


Equivalence Scale — Used to adjust household income for economies of scale in households 
with different numbers of people (i.e., because a household's economic needs are related in 
part to its size). The equivalence scale used in this study is the square root of household size. 


Top and Bottom Coding — Used to correct for possible measurement errors in the database. 
Incomes at the top of the distribution are capped at ten times non-equivalized median 
income. Incomes at the bottom are limited to one percent equivalized mean income. For 
comparable cross-national evaluations, bottom and top codes can not be set differently 
across the countries, the lowest top code and highest bottom code must be applied to all. 
These particular limits represent the lowest top code (Canada) and highest bottom code 
(Australia) of all the countries in the landmark LIS study (Atkinson, Rainwater, and Smeeding 
1995), and have been applied to all LIS datasets since. 


Cross-National Differences in the Skills-Earnings 
Relationship: 
The Role of Labor Market Institutions 


William Carbonaro, University of Notre Dame 





Abstract 

This study examines cross-national differences in returns to literacy skills and explores 
possible explanations for such differences. Data from the International Adult Literacy 
Survey (IALS) are analyzed using multilevel models. Returns to literacy skills are higher in 
liberal market economies (LMEs) than in social market economies (SMEs). Collective 
bargaining coverage emerges as the most potent predictor of cross-national variation in 
returns to skill. Cross-national differences in workers’ skill demands, skill profiles and 
welfare state regimes do not explain the observed differences in returns to skill. 


Numerous scholars have examined how structural factors shape the allocation of social and 
economic rewards in society (e.g., Brieger 1995 and Kerckhoff 1995). Examples include 
research on how returns to skill vary across dual labor markets (Dickens and Lang 1988: 
Doeringer and Piore 1971), occupations (Carbonaro 2005; Stolzenberg 1975), internal labor 
markets (Sorensen and Kalleberg 1981), firms (Baron and Bielby 1980; Sakamoto and Chen 
1991; Stolzenberg 1978), and boundaries of social class (Leicht et al. 1993; Wallace et al. 1993; 
Wright and Perrone 1977). Collectively, these studies have demonstrated how both individual 
characteristics and structural attributes work together to generate income inequality. 

Recently, political economists and some sociologists have focused their attention on even 
larger, more macro-level forces that might shape social and economic inequality in Western 
democracies. Numerous researchers have focused on the role that labor market institutions — 
“enduring legal and normative frameworks that establish how the price of labor is 
determined” (Western 1998:228) — play in the wage-setting process. Much of this research 
has focused on how institutional arrangements affect macroeconomic performance (e.g., 
Kenworthy 2002; Western and Healy 1999), as well as the overall level of income inequality 
in society (e.g., Rueda and Pontusson 2000; Wallerstein 1999). A small number of studies 
have shown that the importance of both education and cognitive skill vary across nations 
(Blau and Kahn 2001; Devroye and Freeman 2001), but these studies have failed to link how 
cross-national differences in institutional arrangements affect the process by which wages 
are allocated to workers. 

The main goal of this paper is to deepen our understanding of how macro-level, 
institutional characteristics affect the attainment process. In particular, | will examine how 
different institutional mechanisms governing the wage setting process affect how a key 
individual attribute — cognitive skill — is related to earnings. My findings suggest that one 
specific institutional attribute - the prevalence of collective bargaining — is a particularly 
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powerful predictor of variation in returns to skill across nations, even after controlling for 
numerous other institutional and skill demand factors. 


Variation in Income Attainment across Nations 


Educational credentials and cognitive skills are critically important determinants of earnings in 
modern industrialized nations (Farkas et al. 1997; Kerckhoff et al. 2001). However, the relative 
importance of education and/or cognitive skill for earnings varies across nations. For example, 
the earnings gap between college-educated and non-college-educated workers increased 
dramatically in the United States in the past 30 years (Bound and Johnson 1992; Murphy and 
Welch 1992), but this gap has either remained constant or increased less dramatically in other 
industrialized nations during the same time period (Freeman and Katz 1994; Morris and 
Western 1999). Several studies of the International Adult Literacy Study (IALS) have 
documented that the effects of both education and literacy skills vary across industrialized 
nations (Blau and Kahn 2001; Devroye and Freeman 2001; OECD 1997, 2000). 

Human capital theory is a helpful theoretical framework for understanding why returns to 
skill vary across nations. According to human capital theory, more skilled workers are more 
productive, and employers reward more productive workers with higher wages (Becker 1964). 
Institutional mechanisms within the labor market may weaken this relationship and reduce the 
skills-earnings relationship by curtailing the discretion of employers in setting wages. When 
employers agree to rules about compensation for broad classes of workers, they lose the 
ability to reward more productive (and by implication, more skilled) workers. Thus, it is critically 
important to understand how institutional mechanisms might affect the productivity-earnings 
link, and in addition, to understand how they vary across industrialized market economies. 

Studies of industrialized market economies suggest that capitalism does not function as a 
monolithic system of production; rather, there are “varieties of capitalism.” Hall and Soskice 
(2001) distinguish between two types of production systems. In /iberal market economies 
(LMEs), “firms coordinate their activities primarily via hierarchies and competitive market 
arrangements” and consequently their decision-making is often driven by considerations of 
short-term profitability. In contrast, firms in coordinated market economies (CMEs) “depend 
more heavily on non-market relationships to coordinate their endeavors with other actors .. .” 
(p.8). In describing these contrasting ideal types, Hall and Soskice focus on institutional 
“complementarities” across numerous dimensions, including corporate governance, access to 
capital, training systems, interfirm cooperation/competition and wage setting norms and 
practices. Rueda and Pontosson (2000) modify Hall and Soskice's typology slightly to focus 
more heavily on labor market dynamics. In their scheme, LME nations are contrasted with social 
market economies (SMEs) along three dimensions: welfare state provision, government 
regulation of employment conditions, and institutionalization of collective bargaining and wage 
coordination. In practice, these two conceptual schemes are virtually equivalent.’ In this paper, 
| will rely on Rueda and Pontosson’s terminology because it is more narrowly focused on labor 
market processes, which are most relevant for understanding income attainment processes. 


Labor Market Institutions and Income Attainment 


Labor market economists and economic sociologists have emphasized (both theoretically and 
empirically) the role of labor market institutions in governing the process by which rewards 
are allocated to workers (Freeman 1994; Western 1998). Nations with the least regulated labor 
markets generally have the highest returns to education and cognitive skill (Blau and Kahn 
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2001; Devroye and Freeman 2001). While this research supports the claim that “institutions 
matter,” it is less clear exactly which institutional attributes of labor markets actually affect the 
process by which rewards are allocated to workers. There are three main institutional 
characteristics that deserve attention: wage coordination, union density ane the prevalence 
of collective bargaining agreements. 

First, wage coordination, defined by Kenworthy (2001b:75) as “the degree of intentional 
harmony in the wage setting process,” might affect the allocation of rewards among workers 
by standardizing pay-setting norms across firms, industries and/or sectors of the economy. At 
one end of the continuum, wage bargaining is fragmented when it occurs at the plant or firm 
level; at the other, wage bargaining is coordinated when the state, unions and employers 
work together in setting wages. How might wage coordination affect the importance of skills 
and education for earnings? When wages are set at the firm or plant level, employers have 
more discretion to set both wage levels and criteria for allocating rewards to workers. This 
decentralized wage-setting process creates greater earnings inequality at the job level, as 
some job-seekers find and/or are selected into better paying jobs. The more that employers 
rely on skills to select workers for jobs, the stronger the skill-earnings relationship will 
become. If employers assume that skills increase productivity, they should prefer skilled labor 
over unskilled labor. In short, less coordination in wage setting should increase returns to skill. 

Second, /abor unions play an important role in standardizing outcomes for workers. In their 
classic study of labor unions in the United States, Freeman and Medoff (1986) found that unions 
decreased inequality in earnings among workers in two distinct ways. First, they found that 
unions managed to decrease overall earnings inequality by reducing earnings inequalities 
among white- and blue-collar workers. Thus, unions lowered inter-occupational inequality in 
earnings. Given that education and skill levels are related to entrance into different occupations, 
the importance of these attributes in determining earnings should decrease as well. Unions also 
decreased earnings inequalities in a second way: unions sought and implemented single-pay 
rates for similar workers with similar work conditions. Typically, such single-pay-rate systems 
standardize pay scales and reduce managerial discretion in allocating rewards. In particular, 
rewards are attached less to individual productivity in such a system; instead, they are more 
strongly linked to other attributes such as seniority.” Research indicates that returns to education 
are lower among unionized workers (Dickens and Lang 1988; Leicht et al. 1993), and Carbonaro 
(2005) found that returns to cognitive skills were lower in occupations with higher union density. 
In sum, higher union density should bring about more uniform wage setting and therefore 
should lower income returns to more educated and skilled workers. 

Finally, the prevalence of collective bargaining agreements is a third institutional 
mechanism that may be related to cross-national differences in returns to skill and education. 
Collective bargaining arrangements also tend to limit employers’ discretion in setting 
differential pay rates both between and within occupations. Because unions are usually key 
players in collective bargaining agreements, such agreements are likely to have standardized 
arrangements that resemble single-pay-rate compensation schemes (Chamberlain and Kuhn 
1986; Sisson 1987). The greater the percentage of workers covered by such agreements, the 
less discretion employers have in the wage-setting process. In addition, collective bargaining 
coverage may have more explanatory power than union density because collective bargaining 
agreements often extend beyond union members to non-union members as well. Thus, 
greater coverage by collective bargaining agreements should reduce returns to skill. 

Together, each of the three institutional mechanisms mentioned above has the potential to 
affect how skills and earnings are related. These factors may work independently of each 
other, or they may work as part of a larger causal chain. For example, higher levels of union 
density may indirectly decouple skills and earnings by increasing the prevalence of collective 
bargaining in the labor market. Alternatively, union density may succeed in reducing the skill- 
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earnings relationship only when institutional mechanisms such as collective bargaining are 
well-established. 


Alternative Explanations for Differing Returns to Skill across Nations 


While labor market institutions are undoubtedly important in the wage-setting process, other 
institutional differences might also affect how skills and earnings are related. Two main 
alternative explanations — differing skill profiles across nations and differing welfare state 
regimes — deserve consideration. If these alternative explanations can be eliminated, then the 
case for the importance of labor market institutions is strengthened. 

First, greater returns to skill in LME nations could be explained by differing skill profiles in 
LME and SME nations. Estevez-Abe et al. (2001) argue that SME nations increase the value of 
industry and/or firm specific skills in the labor market by offering greater “wage protection” 
(via greater employment and unemployment protection).? By wage protection, Estevez-Abe et 
al. (2001) mean that workers have some assurances that their skills will not lose their value 
due to chance market fluctuations. Lower levels of wage protection in LMEs make it riskier for 
both workers and firms to invest in industry- and/or firm-specific skills. Consequently, in LMEs, 
general skills garner the highest rewards. In contrast, by providing greater wage protection, 
SMEs reduce the risks involved in investing in industry- and firm-specific skills, and as a result, 
those skills reap greater rewards than general skills. In short, Estevez-Abe et al. (2001) predict 
that general cognitive skills should better predict earnings in LME nations while specific skills 
should have greater value in SME nations. 

Second, differences in welfare state regimes may also affect how skills and earnings are 
related. While nations differ in their socialspending, Esping-Andersen (1990) argued that 
differences in welfare state regimes can also be measured in terms of “decommodification.” 
Decommodification decreases the dependence of individual workers on the market for their 
well being. By providing more generous benefits and establishing them as entitlements linked 
to citizenship, decommodification strengthens the hand of the employee at the expense of the 
employer in the wage-setting process (p.22). By altering this balance of power in the labor 
market, decommodification could alter the skil-earnings relationship by disproportionately 
improving the bargaining power of less skilled workers. Since less skilled workers are more 
easily replaced than higher skilled workers, decommodification may provide more extensive 
alternatives to the labor market for low-skilled workers than high-skilled workers. Consequently, 
employers might be forced to make less skilled jobs more attractive by raising wages for these 
positions. Increased wages for less skilled workers would ultimately reduce returns to skill in 
the labor market by shrinking the earnings gap between low- and high-skilled workers. 


Research Questions 
To summarize, the main goal in this paper is to examine how macro-level institutional 
characteristics might affect the association between cognitive skills and earnings across 


nations. The three main research questions examined in this paper are: 


To what extent does the literacy skill-earnings relationship vary across 
LME and SME nations? 


How do institutional characteristics governing labor market processes 
affect cross-national variation in returns to skill? 
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Do differences in skill profiles and welfare state regimes affect cross- 
national variation in returns to skilled workers, and do these 
differences account for any observed relationships between 
institutional characteristics and returns to skill? 


Examining these questions will broaden our understanding of how structural aspects of labor 
markets affect the income attainment process. 


Data 


The analyses used data from the IALS. Data collection occurred in three waves: 1994 (Canada 
[French/English], Switzerland [German/French], Ireland, the Netherlands, Poland, Sweden and 
the United States), 1996 (Belgium [Flanders], New Zealand, Great Britain and Northern 
Ireland), and 1998 (Switzerland [Italian], Chile, the Czech Republic, Denmark, Finland, Hungary, 
Italy, Norway and Slovenia). This study will focus on western industrialized nations, and thus 
data from Poland, Hungary, the Czech Republic, Slovenia and Chile are not included in the 
analyses. Each nation drew its own random sample and data were collected using 
comparable measurement instruments in the vernacular(s) of each nation. 


Analytical Sample 


In the analyses that follow, data from two different units of analysis (or “levels”) were used: 
individuals and nations. Descriptive statistics for all variables used in the analyses are 
provided in Table 1. There are 18,628 individual workers in the analytical sample. The analytical 
sample included only full-time workers between the ages of 25-65 who had valid data for both 
income and occupation. Most of the excluded cases were individuals who were either not 
working full-time, or not asked the question regarding their work status by the interviewer. 
Fifteen Western industrialized nations comprised the second level of data. Each nation had a 
reasonably large sample (Table 2): the average was 1,151, with a low of 556 for Belgium and 
a high of 1,634 for Denmark. 


Dependent Variable 


In the analyses, the outcome of interest is the individual's position in the national distribution 
of earnings. Each respondent reported his or her gross income (before taxes) from a job; no 
other source of income was reported (e.g., interest, dividends, government assistance, etc.). 
These are important qualifications because nations vary markedly in both their taxation 
systems and the income assistance provided to workers by the state (Smeeding, O'Higgins 
and Rainwater 1990). The IALS does not provide the respondent's report of his/her income;* 
instead, it provides a measure that classifies an individual's earnings into one of five quintiles 
(i.e., the lowest 20 percent through the highest 20 percent of earners). This measure of 
income indicates an individual's re/ative position in a wage-earning hierarchy. 

There are some limitations in using income quintiles rather than a continuous measure of 
the respondent's income. The loss of precision in measuring income is an obvious limitation. 
While it would be preferable to use either a continuous measure or one with finer gradations 
of the income distribution (e.g., deciles, twentieths, etc.) in the analyses, quintiles still reveal 
the broad contours of the system that allocates economic rewards among workers. As we 
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Table 1: Descriptive Statistics — Means and Standard Deviations for Individual and Nation Level 
Variables 





Individual Level Variables 
LME SME Total Range 
(n=9,322) (n=9,306) (n = 18,628) Min Max 


Income 3.60 3.65 3.63 | 5 
(1.20) (1.16) (1.18) 

Age 40.75 41.11 40.93 Da 65 
(10.09) (9.82) (9.96) 

Age Squared 1,762.82 1,778.02 1,774.91 625 4.225 
(869.58) (835.67) (852.87) 

Weeks Worked 49.31 49.68 49 49 i 52 
(per Year) (8.62) (8.03) (8.34) 

Female 39 39 39 0 1 

Urban 62 64 63 0 1 

Native Born 87 .96 91 0 1 


Father’s Education 
ISCED 0/1 (Omitted) 


26 roe 29 0 1 
ISCED 2 “0 29 23 0 1 
ISCED 3 21 26 24 0 1 
ISCED 5 07 04 .05 0 1 
ISCED 6/7 .09 .09 .09 0 1 
ISCED 9 01 .003 01 0 1 
Respondent’s Education 
ISCED 0-2, 9 (Omitted) 30 24 27 0 { 
ISCED 3 Of At 36 0 1 
ISCED 5 ie 12 aye 0 1 
ISCED 6-7 ees 24 23 0 1 
Literacy Skills .26 45 36 -3.9 2.95 

(.91) (.74) (.84) 
Literacy Skill Demands head, 6.87 Indy 0 20 


4.38 4.21 4.31 
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Table 1: Descriptive Statistics — Means and Standard Deviations for Individual and Nation Level 
Variables (continued) 


a 


Individual Level Variables 


LME SME Total Range 
(n =9,322) (n=9,306) (n=18,628) Min Max 
Occupation 
Managers ato 13 14 0 1 
Professional 19 ah 18 0 1 
Technical 11 22 att 0 { 
Clerks 14 10 12 0 1 
Service 10 lA ai 0 1 
Skilled Agric. & Fishing 04 .03 03 0 
Craft 12 whe 12 0 1 
Machine Operators & 
Assemblers 11 07 .09 0 1 
Elementary (Omitted) .04 .05 .05 0 1 
Nation Level Variables (n = 15) 
LME SME Total Range 
(n =7) (n = 8) (n = 15) Min Max 
Wage Coordination 2.00 4.12 3.13 { 9 
Score (1.73) (.64) (1.64) 
Union Density 32 06 44 1p .90 
(.12) (.24) (.22) 
Collective Bargaining 42 84 .64 18 .95 
(.15) (.09) (.25) 
Unemployment Rate .081 .082 .082 .03 14 
(.034) (.029) (.030) 
~ Per Capita GDP 25.696 27.953 26.900 14.96 36.25 
(in $1,000s) (7.31) (5.02) (6.07) 
pe camimnediieston 21.82 32.66 27.60 13.80 39.10 
Score (5.12) (5.51) (7.60) 
Wage Protection .30 .69 00 3 85 
Ag a) 20 


Note: The numbers in parentheses below the means are standard deviations. An F-test 
indicated that LME-SME differences in means were statistically significant at .05 level for all of 
the variables in the tables, with the exception of female, age squared, and “craft” occupation. 
Categories may not add up to 100% due to rounding. Per capita GDP was computed in 1997 
dollars. 
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shall see, the findings reported here are wholly consistent with studies that analyze the 
limited cases in IALS with continuous data on income (Blau and Kohn 2001: Devroye and 
Freeman 2001). It must also be noted that SME nations have less overall variation in income 
than LME nations (Wallerstein 1999). Thus, in abso/ute terms, the meaning of the income 
quintiles differs across nations. My analyses do not capture these absolute differences: 
however, the income measure does reveal cross-national differences in how individuals aré 
sorted within the wage-earning hierarchy in each nation. As a result, the analyses probably 
provide a conservative estimate of cross-national differences in returns to skill and education. 


Independent Variables 


The IALS measured three different types of literacy that workers might use at work. Prose 
literacy is “the knowledge and skills needed to understand and use information from texts.” 
Document literacy is “the knowledge and skills required to locate and use information 
contained in various formats [e.g., tables, forms, graphs, maps, tables, etc.].” Finally, 
quantitative literacy is “the knowledge and skills required to apply arithmetic operations .. . 
to numbers embedded in printed materials [e.g., figuring out interest on a loan, filling out an 
order form] (OECD 1997:14.) Each of these three literacy skills was assessed using a pencil 
and paper test. The three types of literacy were averaged and served as my indicator of 
cognitive skill.° Literacy is clearly a genera/ rather than a specific skill. This distinction is 
important because, as already noted, the expectation is that general and specific skills will not 
garner the same rewards in LME and SME nations. 

Educational attainment is measured using a series of dummy variables based on the ISCED- 
76 coding scheme provided by the IALS. The ISCED-76 uses “levels” and “stages” to characterize 
all educational systems and serves as a valid measure of educational attainment across nations. 
Respondents were grouped into four main categories: ISCED 0-2, ISCED 3, ISCED 5 and ISCED 
6-7, which in the American system, are equivalent to “less than high school,” “high school 
degree,” “vocational degree” and “B.A. or more.” Two different measures were created to capture 
differences in job-level skill demands and other relevant aspects of workers’ jobs. First, ALS 
contains detailed information on how frequently respondents were required to perform literacy- 
related tasks at work. Eleven items on work tasks covered three main areas: reading, writing and 
math tasks.® A principal components exploratory factor analysis revealed that these 11 items 
loaded highly on one factor. An additive scale measuring workers’ job-level literacy skill demands 
was created from the items, and it had a reasonably high reliability (Cronbach's a = .784). Second, 
the respondent's occupation was also included in the analysis. Eight dummy variables to denote 
the respondents’ occupation were used in the analyses. By controlling for a workers’ occupation 
and allowing the effect of occupations to vary across nations, it is possible to account for other 
differences in skill demands and other factors that might affect the skil-earnings relationship. 

Finally, several background controls were used in the analyses. Female, “urban residence” 
and “native-born” were coded as dummy variables and father’s education was coded as a series 
of dummy variables where ISCED 0/1 is the omitted category. “Weeks worked” during the past 
year was also included to account for possible differences in labor market participation. The 
respondent's age and age squared served as controls for labor market experience. 


Nation-Level Indicators 


Two types of nation-level variables were used in the analyses. The first divides nations into 
two broad categories - LMEs and SMEs —- based on a wide range of institutional 
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characteristics. Rueda and Pontusson (2000) used data from Western industrialized nations to 
devise a “varieties of capitalism” measure. In their analysis, they measured three main 
aspects of market economies: (1) the magnanimity and comprehensiveness of a nation’s 
social welfare system, (2) the degree of government regulation designed to standardize 
employment conditions and job security, and (3) the degree to which collective bargaining and 
wage coordination is institutionalized within the labor market. LMEs have lower levels of 
social provision and less labor market regulation compared with SMEs. Table 2 lists which 
IALS nations were grouped into which categories. 

The second set of nation-level indicators focuses on cross-national variation on a wide range 
of institutional characteristics. Three different types of indicators are examined in the analyses: 
labor market characteristics, macroeconomic performance and welfare state provisions. 

First, labor market economists and economic sociologists have devised a myriad of ways to 
measure the institutional attributes of nations’ labor markets (Blyth 1987; Bruno and Sachs 
1985; Calmfors and Driffill 1988; Cameron 1984; Esping-Andersen 1990; Freeman and Gibbons 
1995; Kenworthy 2001a, 2001b; Schmitter 1981). Because | am interested in the particular 
institutional mechanisms that might affect labor market outcomes, | focus on three key 
institutional attributes: (1) wage coordination, (2) percent of the workforce covered by unions 
(a.k.a., “union density”), and (3) percent of the workforce covered by collective bargaining 
agreements. Wage coordination scores were borrowed from Kenworthy (2001a). In keeping 
with their most recent use in empirical analysis (Kenworthy 2002), the variable was treated as a 
continuous variable. Kenworthy’s measure of wage coordination focuses on the expected 
degree of coordination in wage setting in the labor market. The lowest degree of coordination 
involves “fragmented wage bargaining, confined largely to individual firms and plants,” and the 
highest degree of coordination involves “centralized bargaining by peak confederation(s) or 
government imposition of a wage schedule/freeze with a peace obligation . . .; informal 
centralization of industry level bargaining by a powerful monopolistic union confederation . . .; 
[or] extensive, regularized pattern-setting and highly synchronized bargaining coupled with 
coordination of bargaining by influential large firms” (Kenworthy 2002:374). 

“Percent union” and “percent collective bargaining” were taken from the OECD labor market 
indicator data file.’ For union density, the data are within one or two years of the time at which the 
IALS data were collected. For the data on collective bargaining coverage, information was only 
available for 1994. This measure is in perfect synch with the nine nations in the first IALS wave 
of data collection, but unfortunately is two years too early for nations in the second wave and four 
years too early for nations in the third wave. A comparison of the 1990 and 1994 measures of 
collective bargaining coverage reveals very little difference between them, thereby suggesting 
the 1994 measure is a reasonably good proxy for the unobserved 1996 and 1998 measures. 

The next set of indicators measure variation in macro-level economic performance. The 
two measures — percent unemployed and per capita GDP — were taken from the OECD labor 
market data file. Finally, two additional measures of welfare state provisions were included in 
the analyses. First, to account for possible differences in skill profiles across nations, | created 
a measure of “wage protection” which is an average of Estev-Abe et al.’s (2001) measures of 
employment and unemployment protection. In order to account for differences in welfare 
state regimes across nations, | used Esping-Anderson’s (1990) measure of labor market 
decommodication to denote workers’ protection from market forces. 


Missing Data 


There was no missing data at the nation level, but there was some minimal item non-response 
in the individual-level data. For indicators of age, father’s education, respondent's education level 
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and literacy skill demands, | used mean substitution with a “missing data” dummy flag. Father's 
education had the largest amount of missing data (8.8 percent of cases were missing), while the 
other indicators had less than 2 percent of cases missing. Although multiple imputation is 
preferable to mean substitution (Allison 2002; King et al. 2001), the amount of missing data is so 
minimal that the results are the same regardless of the missing data strategy Used. 


Methods 


In this study, cross-national variation in individual level relationships is the main focus. More 
specifically, | am interested in examining a series of cross-level interactions where the 
regression slope at the individual level becomes the outcome to be explained by nation-level 
predictors. Multilevel models, where individuals are nested within nations, provide an elegant 
and economical method for examining how individual level relationships vary across nations 
(Raudenbush and Bryk 2001). The dependent variable will be treated as a continuous variable 
in the analyses since the underlying distribution is continuous. Although other alternatives are 
possible, | decided to use linear regression models to analyze the data. 

For nation-level coefficients, standard errors do not have their conventional meaning 
because level-2 units are not a random sample of nations. Thus, | focus on the practical 
significance of the estimates and ignore the issue of statistical significance for nation-level 
indicators in my models. Because individual-level data were gathered through random 
sampling techniques, the standard errors retain their conventional meaning and were 
interpreted accordingly. | chose not to adjust for design effects when computing the level-1 
standard errors because the coefficients of interest were highly significant at the .000 level, 
and would not approach the .05 level after adjusting for the design effects. 

One potential problem with the analyses that must be addressed is mulitcollinearity among 
nation-level variables in the analyses. Since the nation-level measures were chosen precisely 
because they distinguish between LME and SME nations, it should not be surprising that many 
of them are fairly well correlated with one another. Agresti and Finlay (1986) discuss several 
problems raised by mulitcollinearity. The inflation of standard errors for the coefficients is the most 
commonly acknowledged problem associated with high levels of mulitcollinearity. Because we are 
interested in nation-level variables, and thus not relying on the standard errors, this is not an urgent 
problem. A more pressing concern is that a strong intercorrelation between independent variables 
renders the notion of statistical control in a regression analysis problematic. A very high correlation 
between X, and X> makes it difficult to separate the independent association of each variable 
with Y because there are so few cases where X, changes, but X9 does not. When presenting the 
results, | will discuss whether and how this issue affects the interpretation of the results and note 
various strategies for handling potential problems. 


Results 


Differential Returns to Skill and Education across LME and SME Nations 


The first task is to examine how the relationships between literacy skills, education and 
earnings vary across the two varieties of capitalism examined here. In model 1 (displayed in 
Table 3), individual-level controls for sex, family background (father’s education), experience 
(age and age squared), native-born, urban and weeks worked were included as predictors of 
income along with literacy skills and education. The only level-two predictor is the SME 
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Table 3: Labor Market Characteristics and Variable Returns to Literacy Skill and Education across 
Varieties of Capitalism 





Model 1 Model 2a Model 2b Model 2c Model 3 
Outcome: Literacy 


Slope 

Intercept (LME) Oe roe Ca roth et 
(.019) (.019) (.019) (.020) (.021) 

SME -.100* -.103 -.100 -.115* — 

Labor Market 

Characteristics 

Union Density 012 .018 

Collective Bargaining .010 -.190 

Wage Coordination .007 .002 

Outcome: Education 

ISCED 3 Intercept 

(LME) 202 a .196*** 2 ie 
(.030) (.030) (.031) (.030) (.031) 

SME -.030 -.030 -.035 -.028 -.052 

ISCED 5 Intercept 

(LME) 441*** 444*** .434*** .439*** .439*** 
(.040) (.040) (.039) (.040) (.039) 

SME -.051 -.050 -.059 -.057 -.102 

ISCED 6-7 Intercept 

(LME) .647°** .649*** SOU .646*** 657% 
(.056) (.056) (.057) (.055) (.058) 

SME -.100 -.108 -.120 -.099 -.195 

Variance Component 

Level-1 Intercept 851 851 852 851 852 


Level-2 Intercept (U,) “200 248 aia eres .236""* 
Literacy Slope 3.7E-03™  4.2E-03  3.8E-03*  4.0E-03"  5.0E-03** 


————— 
Note: Models 1-3 all include level-1 controls for gender, native born, weeks worked (per year), 
age, age squared, urban, and father’s education dummy variables. Level 1 variables were 
uncentered, while level 2 predictors were grand mean centered. Coefficients are 

unstandardized, and numbers in parentheses are standard errors. 

"p<.05 “p<.01 ™ p<.001 (two-tailed tests) 
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dummy variable; it is included as a predictor of the level-one intercept (mean earnings), the 
literacy skill slope and the education slopes. The level-two intercepts for literacy and 
education slopes were allowed to vary randomly in the model (as is the case for all 
subsequent models). For each slope, the intercept is the main effect and the coefficients 
below it are the interactions. 

In both LME and SME nations, literacy and education had significant, positive associations 
with earnings. However, as expected, the association between literacy skills, education and 
earnings varied across LME and SME nations. In model 1 in Table 3, the intercept for the 
literacy slope is .270. Because the SME indicator is entered as a predictor in the equation, the 
coefficient for the intercept represents the association between literacy and earnings in LME 
nations. The SME slope for the literacy-earnings relationship is calculated by subtracting the 
SME coefficient (because it is negative) from the main effect (.270 - .100 =.177). Thus, the 
relationship between literacy skills and income was only about two-thirds as large in SMEs 
when compared with LMEs (.177/.270 = 66 percent). Thus, the relationship between literacy 
skills and earnings is highest in nations with the least labor market regulation as prior research 
suggests. The same pattern holds for returns to education, but the difference in the 
education-earnings relationship is not as large: returns to education in SME nations are about 
85-90 percent as large as returns in LME nations. 

Before examining the main questions raised in the paper, it is important to eliminate one 
possible confounding factor from consideration in the models. Some studies indicate that 
two indicators of macroeconomic performance — unemployment and GDP — differ across 
LME and SME nations (Kenworthy 2002; Western and Healy 1999). Do differences in 
macroeconomic performance explain the differences between LME and SME nations 
observed in model 1? As Table 1 shows, the mean unemployment rate and per capita GDP 
are roughly the same in SME and LME nations. In addition, Table 2 indicates that the 
underlying distributions for unemployment and GDP are fairly similar across LME and SME 
nations. When these variables are included (either separately or together) as level-two 
predictors of mean earnings, literacy and education slopes, the results indicate that (a) 
unemployment and per capita GDP are unrelated to either returns to literacy or education, and 
(b) the difference in the literacy and education slopes across LME and SME nations is 
unchanged from model 1 (results not shown).° In short, the results suggest that we can 
eliminate differences in macro-level performance as a likely explanation of the LME-SME 
differences in returns to skilled workers. 


Returns to Skill and Institutional Characteristics 


Model 1 demonstrates that both returns to skill and education vary across market economies. 
The next set of analyses examines whether institutional characteristics of labor markets are 
related to returns to literacy, and whether they account for the observed differences across 
LME and SME nations. The correlations between labor market characteristics are reasonably 
strong (.406 through .477), but the variation across measures is sufficient to allow 
consideration of their independent contributions to returns to skill."° 

Models 2a-c add the three indicators of labor market characteristics separately to predict 
mean earnings and the literacy slope. Model 2a examines the role of union density in shaping the 
wage-setting process. The practical significance of the union density coefficient is roughly zero in 
model 2a: very large differences in union density across nations are associated with minute 
changes in the returns to literacy. Model 2a also indicates the LME-SME difference in returns to 
literacy skill remains largely unchanged after controlling for union density. Models 2b and 2c yield 
similar conclusions: neither the level of collective bargaining coverage nor the degree of wage 
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coordination in the labor market is related to the literacy-earnings relationship (net of SME status). 
In addition, neither indicator explains any of the LME-SME difference in returns to literacy skills. 

Model 3 includes the three indicators of labor market characteristics in the model 
simultaneously in order to account for the intercorrelations among the measures. The SME 
dummy is excluded from the analysis, because the three indicators collectively represent the 
defining features of LME-SME differences in labor market governance, and including it would 
be redundant. Union density and wage coordination are almost completely unrelated to the 
skills-earnings relationship in model 3. Collective bargaining is clearly the strongest predictor 
of returns to literacy skills across nations, with nations with the greatest collective bargaining 
coverage having the lowest returns to literacy skills. To illustrate the magnitude of the 
relationship, we can compare the literacy-earnings relationship for the nations with the lowest 
and highest levels of collective bargaining coverage: the United States (with a coverage rate 
of 18 percent) has coefficient of .236, while Germany (with 92 percent of workers covered) 
has a coefficient only 40 percent as large (.095)." 

In sum, the findings displayed in Table 3 suggest that collective bargaining is the best 
predictor of cross-national variation in returns to literacy skills. Some caveats are in order 
regarding these findings. First, of the three labor market characteristics examined in this paper, 
collective bargaining is the best predictor of SME status. The mean difference in collective 
bargaining coverage is about 40 percent, and the distributions for LME and SME nations do not 
overlap, with Ireland representing the high for LMEs (66 percent) and Denmark representing the 
low for SMEs (69 percent). (See Table 2.) Thus, it is entirely possible that collective bargaining is 
merely serving as a proxy for some other defining characteristic of SME nations that affects 
returns to literacy skill. | will address this issue in analyses presented in the next sections. 

Second, if collective bargaining coverage is primarily responsible for lowering returns to literacy 
skills, this relationship should be present not just between LME and SME nations, but within each 
group as well. To examine this issue further, | created separate samples for LME and SME nations 
and re-ran model 2b (excluding the SME dummy). Overall, the results (not shown) indicated that 
collective bargaining was unrelated to returns to skill in LME nations, but (as expected) greater 
coverage lowered returns for skills in SME nations. One possible explanation for this difference is 
that collective bargaining may only lower returns to literacy skills when it surpasses some critical 
threshold (e.g., 65 percent). It is also possible that some other critical ingredient that makes 
collective bargaining relative to lower returns for skills is missing from labor markets in LME 
nations. However, with so few cases and so little variation in collective bargaining within each 
group of nations, it is difficult to draw firm conclusions from these supplementary analyses. 


Cross-National Variation in Skill Demands and Returns to Literacy Skills 


The foregoing analyses have focused exclusively on labor market characteristics of LME and 
SME nations. Some scholars have noted that LME and SME nations also differ in their “skill 
profiles,” with LMEs rewarding general skills and SMEs rewarding industry- and even firm- 
specific skills (Estev-Abe et al. 2001). No studies have accounted for the role of skill demands 
in creating the differential returns to skill among LME and SME nations. Table 1 indicates that 
literacy skill demands are slightly higher in LME nations than in SME nations. (An ANOVA 
indicates that this difference is statistically significant at .000.) 

The difference in skill demands among LME and SME nations could affect the literacy- 
earnings relationship in two ways. First, because workers in LME nations are required to perform 
literacy-related tasks more frequently at work, literacy skills might be more strongly linked to their 
levels of productivity and consequently make literacy skills a stronger predictor of earnings than 
in SME nations. Second, some research suggests that more skill-intensive jobs and occupations 
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Table 4: Skill Demands, Occupations and Variable Returns to Literacy Skills across Varieties of 
Capitalism 


—— 


Model 4 Model 5 
(Plus Skill — (Plus 
Demands Occupations 
Outcome: Literacy Slope 
Intercept (LME) Oe 183*** 
(.015) (.013) 
SME ae or -.103** 
Outcome: Education 
ISCED 3 Intercept (LME) aot .099*** 
(.024) (.024) 
SME -.023 -.043 
ISCED 5 Intercept (LME) 3407. oe 
(.034) (.035) 
SME -.034 -.055 
ISCED 6-7 Intercept (LME) .549*** 346"** 
(.050) (.051) 
SME -.105 -.099 
Outcome: Literacy Tasks Slope 
Intercept (LME) .046*** {038*** 
(.003) (.002) 
SME .010 .008 
Variance Component 
Level-1 Intercept 818 882 
Level-2 Intercept (U,) 2. 285" 
Literacy Slope 2.2E-03* 1.1E-03 
Literacy Task Slope 8.0E-05*** 5.0E-05* 





Note: Models 4 through 5 include level-1 controls for gender, native born, weeks worked (per 
year), age, age squared, urban, and father’s education dummy variables. Level 1 variables were 
uncentered, while level 2 predictors were grand mean centered. Coefficients are 
unstandardized, and numbers in parentheses are standard errors. 


*p<.05 “p<.01 ™ p<.001 (Two tailed tests) 


have rewards attached them above and beyond the skills and education of the workers holding 
them (Carbonaro 2005; Farkas et al. 1997). It is possible that varying returns to literacy skill actually 
reflect varying rewards attached to job-level skill demands across LME and SME nations. 

To account for these possible confounding factors, model 4 includes the literacy skill demand 
measure as a predictor of earnings in the analysis. In addition, the SME indicator is included as a 
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predictor variation in returns to literacy skill demands across nations. The results (shown in Table 
A) indicate that literacy skill demands are positively related to earnings. In addition, literacy skill 
demands are a stronger predictor of earnings for workers in SME nations by about 20 percent. 

In order to understand whether job level skill demands play a role in LME-SME differences 
in returns to literacy, models 1 and 4 must be compared. Controlling for literacy skill demands 
reduces returns to skill in both LME and SME nations. However, differences in literacy skill 
demands account for less of the literacy skill-earnings relationship in LME nations (19 
percent= .220/.270) than in SME nations (38 percent=.113/.176). Thus, the LME-SME 
difference in returns to literacy skill is not explained by differing skill demands faced by 
workers. In fact, the findings indicate that when workers with the same literacy skill demands 
at work are compared, the LME-SME difference in returns to literacy actually grows larger, 
from a ratio of roughly 3:2 to a ratio of more than 2:1 in model 4. 


Occupational Attainment and Cross-National Variation in Returns to 
Literacy Skills 


The next set of analyses focus on whether occupational sorting plays a role in differing returns 
to skill in LME and SME nations. Differing skill demands are a defining feature of occupations 
(Rothman 1985). Skill demands other than literacy skill demands may also be important in 
determining the importance of skills for earnings. Estev-Abe et al. (2001) argued that industry- 
specific and firm skills matter more in SME nations than in LME nations. By comparing workers 
in the same occupations, and allowing occupational effects to vary across nations, it is possible 
to better account for the impact of differing skill profiles in LME and SME nations in the analysis. 

Model 5 (Table 4) is a fixed-effect model that estimates returns to literacy skills among 
workers in the same occupation. By adding controls for occupational attainment, any factors 
across these nine occupational groups that contribute to the literacy-earnings relationship are 
held constant. In addition, the occupational effects on earnings are allowed to vary across 
LME and SME nations in the model. Thus, if occupational skill demands or any other 
occupational factor affecting the wage setting process varies across LME-SME nations, these 
differences will be accounted for in the model. 

The findings suggest that about 20 percent of the literacy-earnings relationship is due to 
factors associated with occupational sorting. However, the LME-SME difference in returns 
to literacy skills remains roughly the same after controlling for occupational attainment. Thus, 
while occupational sorting partly explains why literacy skills are related to earnings, it does not 
explain why returns to literacy skills vary across LME and SME nations. 

In sum, models 4 and 5 account for differing skill profiles across nations by comparing 
returns to skill for individual workers who hold jobs with similar skill profiles. The findings 
suggest that LME-SME differences in returns to skills are not due to differing skill profiles 
among workers in LME and SME nations. The next set of models explores this alternative 
hypothesis with additional nation-level data. 


Welfare State Provisions and Cross-National Variation in Returns to 
Literacy Skill . 


The final set of models examines whether other institutional characteristics explain cross- 
national variation in returns to literacy skill and the robust relationship between collective 
bargaining and the skills-earning relationship. First, as discussed above, nations with more 
regulated labor markets have differing skill profiles. In addition to the controls used in models 
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4 and 5, Estev-Abe et al. (2001) suggest that wage protection is a good predictor of a nation’s 
skill profile. If returns to literacy skills vary across LME and SME nations because of their 
different skill profiles, then wage protection should predict this variation. Second, the degree 
of decommodification provided by the welfare state may alter how rewards are allocated in 
the labor market. By increasing the power of workers in SME nations to stay out of the labor 
market, greater decommodification may allow low-skill workers to improve their chances of 
moving up in the income distribution. 

Multicollinearity poses some major challenges for the analyses that follow. The correlation 
between the wage protection and decommodification measures is quite strong (r = .826), and 
together these variables explain roughly half of the variation in collective bargaining (adjusted 
R squared = .558). Separating the independent associations between the two welfare state 
provision measures and the outcome will be difficult due to their strong intercorrelation. For 
collective bargaining, the problem is not as severe: the welfare state measures leave about half 
of the variation in collective bargaining unaccounted for, so it is possible to get some sense of 
how collective bargaining is related to returns to literacy net of welfare state provisions. 

In model 6 (see Table 5), wage protection is included as a predictor of returns to literacy 
skills along with the SME indicator. The results indicate that wage protection is unrelated to 
cross-national variation in returns to literacy skills. The SME indicator is practically unchanged 
from model 5. Model 7 replaces the wage protection measure with the decommodification 
measure. Not surprisingly, given the strong correlation between wage protection and 
decommodification, model 7 mirrors the results seen with model 6: decommodification is 
unrelated to returns to literacy skills across nations, and the LME-SME difference in returns to 
literacy is unchanged. Thus, it appears that neither a nation’s skill profile nor its welfare state 
regime explains LME-SME differences in returns to literacy skills. 


Table 5: Institutional Characteristics and Variable Returns to Literacy Skill and Education across 
Varieties of Capitalism 





Model 6 Model 7 Model 8 
Outcome: Literacy Slope 
Intercept (LME) Gd Selene .186*** 
(.014) (.014) (.016) 
SME -.104* -.094* ---- 
Institutional Characteristics 
Wage Protection 5.8E-04 -.057 
Decommodification -8.2E-04 5.2E-04 
Collective Bargaining -.086 
Variance Component 
Level-1 Intercept 178 118 118 
Level-2 Intercept (U,) 241 222 .240 
Literacy Slope 1.2E-03 1.2E-03"* 2.2E-03 





Note: Models 6-8 all include level-1 controls for gender, native born, weeks worked (per year), 
age, age squared, urban, father’s education dummy variables, literacy skill demands, ; 
respondent’s occupation, and respondent’s education. Level 1 variables were uncentered, while 
level 2 predictors were grand mean centered. Coefficients are unstandardized, and numbers in 
parentheses are standard errors. 

p<.05 “p<.01 “p<.001 (Two tailed tests) 
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The final model (model 8) includes the measures of the three institutional characteristics 
of LME and SME nations in the model. The results are similar to prior analyses in that 
collective bargaining coverage is a strong predictor of returns to skill across nations. However, 
collective bargaining coverage is less important in predicting the literacy slope in model 8 than 
in model 3, where skill demands and welfare state characteristics are not held constant. 
Model 8 also indicates that, as predicted, greater wage protection is related to lower returns 
to skill. Thus, literacy skills are more important for earnings in nations with more general skill 
profiles, less important in nations with more specific skill profiles. Because wage protection 
and collective bargaining coverage are strongly correlated, they work together to lower the 
literacy-earnings relationship in SME nations. Finally, decommodification remains unrelated to 
the literacy slope. 

Interestingly, the results for model 8 do not fit the typical pattern of an analysis that is 
hopelessly affected by mulitcollinearity, where coefficients and t-statistics change dramatically 
from prior, reduced models (Agresti and Finlay 1986). In addition, it should be noted that model 
8 also controls for skill demands and occupations, thereby allowing us to make controlled 
comparisons among workers with similar occupations and jobs when accounting for 
institutional differences across LME-SME nations. 


Discussion 


Understanding Differences in Returns to Skill in the Cross-National Context 


Prior research in the field of stratification has focused on the importance of both individual 
characteristics and structural attributes of labor markets in the attainment process. This paper 
adds to that literature by examining how cross-national differences in macro-level factors 
affect the distribution of rewards among workers with varying skill levels. Overall, the analyses 
reported here suggest both differences and similarities across nations in how rewards are 
distributed to workers. Both LME and SME nations reward individual attributes such as 
education and cognitive skills and structural characteristics such as the skill demands of 
workers’ jobs. However, the relative importance of each of these characteristics differs across 
nations. Most prominently, in SME nations, literacy skills are less important in sorting workers 
into different spots in the income distribution. 

Overall, the findings suggest labor market institutions shape the process that links 
workers’ skills and economic rewards in the labor market. Furthermore, the analyses provide 
some evidence that one specific institutional practice — collective bargaining — is critically 
important in explaining why returns to skill vary across LME and SME nations. Differing skill 
profiles across nations also predicted cross-national differences in returns to skills, but the 
alternative hypotheses examined did not change this main finding. 

Some caveats are needed when interpreting this finding. It is overly simplistic to identify 
collective bargaining as the “cause” of lower returns to skills. As Hall and Soskice (2001) point 
out, there are many “institutional complementarities” within LME and SME nations, and 
collective bargaining is just one component of this complex web of factors. It is possible that 
collective bargaining coverage indicates the successful implementation of other institutional 
strategies — e.g., wage coordination, single pay rates advocated by unions, etc. — in shaping 
the wage-setting process. Thus, collective bargaining may be a proxy for the prevalence of 
other institutional practices that diminish employer autonomy in the wage setting process. 

The findings have important implications for human capital theory, as well as “structural” 
accounts of labor market stratification. Skills clearly matter for earnings in a wide variety of 
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capitalist nations, as human capital theory predicts. However, the findings reported here suggest 
human capital theory should be nested within a larger theory that accounts for the ability of 
institutions to change the “rules of the game” when allocating rewards among workers. This 
conclusion is consistent with a large number of studies in the stratification literature that have 
provided such a correction. (See Kerckhoff 1995 for a review.) This study adds a new layer of 
complexity to those accounts and broadens our thinking about how structure matters. 

The findings reported here also deepen our understanding of how stratification processes 
operate in different social and political contexts. The findings regarding the greater 
importance of skills for earnings in LMEs take on added importance when we consider that 
income inequality is greatest, low earnings are most common, and welfare state provisions 
are most miserly in LME nations. Hence, less skilled workers in LMEs are doubly punished for 
their low-skill levels by (1) having an enhanced probability of being at the lower end of the 
income distribution, and (2) being objectively worse off than their counterparts in the same 
relative position in the earnings distribution in SME nations. In short, not only do the rules of 
the game regarding “who gets what” differ across varieties of capitalism, the stakes of the 
game vary as well. 

One might conclude that LMEs are more meritocratic than SMEs because cognitive skill 
and (to a lesser extent) education are stronger determinants of earnings in such nations. Such 
a conclusion is premature for two main reasons. First, while it is true that both education and 
skills are characteristics that are achieved rather than ascribed, opportunities for skill 
development and educational attainment are not equally distributed across industrialized 
societies. If educational success is strongly related to ascribed characteristics in LME nations, 
meritocracy may be somewhat illusory. Second, cognitive skills are by no means the only 
determinant of workplace performance; studies of American workers suggest that cognitive 
skills make up only a small part of workplace performance (Wigdor and Green 1991). Thus, we 
cannot safely conclude that SME nations fail to reward worker performance more heavily than 
LME nations; rather, SME nations may simply weigh other non-cognitive skills related to 
productivity (e.g., soft skills, firm-industry specific skills, etc.) more heavily when allocating 
rewards to workers. 

It might be also be tempting to view SME nations as less “efficient” than LME nations 
because institutional mechanisms “interfere” with market forces that would otherwise tighten 
the link between skills and earnings. This interpretation of the findings is misguided. The 
“varieties of capitalism” framework highlights the fact that cross-national differences in 
institutional arrangements impose different constraints upon actors within the economy (Hall 
and Soskice 2001). Since SME nations are less constrained by short term profits, employers 
have more freedom to allocate economic rewards in ways that deemphasize skill differences 
(relative to LME nations). Thus, it is wrong-headed to judge LME-SME differences in returns 
to skill on the basis of an abstract notion of “efficiency.” Instead, future research should focus 
on examining how institutional characteristics are related to differences in stratification 
systems across nations. 

One fruitful way in which future research could improve our understanding of how income 
attainment processes differ across nations would be through detailed cases studies of selected 
nations. These case studies would focus on the legal and normative practices that govern the 
wage-setting process. In particular, they would examine whether and how macro-level 
processes (such as collective bargaining and wage coordination) are linked with firm-level 
decision making about wages in selected LME and SME nations. How are workers’ general 
skills assessed and used to sort workers into different jobs? How is the productivity of individual 
workers assessed and/or rewarded? How does collective bargaining constrain employers when 
allocating economic rewards to individual workers? The answers to these questions would 
greatly enhance our understanding of how inequality is structured differently across nations. 
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Notes 


ihe 


Switzerland is the discrepant case in the two schemes. Hall and Soskice (2001) classify it 
as a CME, while Rueda and Pontusson (2000) label it an LME. The discrepancy most likely 
stems from the greater attention devoted to labor market and welfare state characteristics 
by Rueda and Pontusson (2000) in contrast with the broader array of characteristics 
considered by Hall and Soskice (2001). 

It is possible that the preference for single-pay rates might reduce overall productivity by 
undermining managers’ efforts to reward individual workers’ performances. Freeman and 
Medoff's (1986) findings suggest that unions actually raise productivity by giving workers 
greater voice and encouraging better management practices. 


“Employment protection” is institutionalized employment security: “the higher the 
employment security, the less likely a worker will be laid off even during economic 
downturns.” (Estev-Abe et al. 2001:150) “Unemployment protection” is “protection from 
income reduction due to unemployment.” (Estev-Abe et al. 2001: 150) 


Though not publicly available, continuous measures income data are recoverable for the 
U.S., Canada, Germany, the Netherlands, Poland, Switzerland and Sweden. Blau and Kahn 
(2001) argue that data for Germany are not usable because they are not comparable to the 
recovered data in the other nations. 


To avoid “ceiling effects” in the test, and that the literacy measure made more fine-grained 
distinctions in people's literacy skills, the IALS had individuals respond to a small number 
of representative items drawn from a larger pool of 165 items. The items on the test are 
classified in terms of five levels of difficulty from requiring respondents to perform very 
basic tasks, such as finding one piece of information that literally matches what is asked 
in the question, to requiring respondents to use high-level inferences, process 
“conditional information,” and/or perform multiple operations sequentially. Item response 
theory (IRT) was then used to generate five plausible values for each of the three types of 
literacy. Following prior research (Raudenbush and Kasim 1998; Kerchoff, Raudenbush 
and Glennie 2001), | decided simply to average the five plausible values for each type of 
literacy to get one unbiased estimate of each individual's prose, document and 
quantitative literacy skills. After creating this measure, | standardized it so that it has a 
mean of zero and standard deviation of one. 


For reading tasks, respondents were asked about the following six activities: “How often 
do you read or use: (a) letters or memos, (b) reports, articles, magazines or journal, (c) 
manuals or reference books, including catalogues, (d) diagrams or schematics, (e) bills, 
invoices, spreadsheets or budget bills?, and (f) directions or instructions for medicines, 
recipes or other products as part of your main job?” Four items indicated respondents’ 
writing related task: “How often do you write or fill out (a) letters or memos, (b) forms or 
things such as bills, invoices or budgets, (c) reports or articles, and (d) estimates of 
technical specifications as your part of a main job?” Finally, for math tasks, one item was 
used in the scale: “How often do you use mathematics as part of your main job to 
calculate prices, costs or budgets?” 


This data file is a compilation of information collected from (a) reports and statistical abstracts 
or (b) best estimates provided by academic experts from each OECD nation. One exception 
is the estimate of collective bargaining in Ireland, which was unavailable in the OECD data. 


oO 


—s 
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The best available estimate (collected in the year 2000) comes from the Employment in 
Europe 2003 report published by the European Commission. These data were provided by 
the European Industrial Relations Observatory (EIRO) and are generally consistent with other 
descriptive accounts of collective bargaining practices in Ireland during the mid-1990s. 


Since the variable is actually an ordinal variable that approximates a continuous variable, 
there are several different available modeling strategies that could be used. The three main 
possibilities are: (1) treating the outcome as a continuous variable and using linear 
regression, (2) treating the outcome as ordinal and using an ordinal logistic regression, and 
(3) treating the data as categorical and using multinomial logistic regression. Each approach 
has its advantages and disadvantages. An ordinal logit regression would be appropriate 
given the dependent variable, but the parallel regression assumption is rarely met when 
using the ordinal logit (Long and Freese 2001). A multinomial logit is another viable option 
with an ordinal dependent variable, but the results are often difficult to interpret and convey 
to the reader, especially with five categories (as is the case here). In order to get a sense of 
how each method affected the findings, | analyzed the data using each of the three methods 
using the same dependent and independent variables. Although making comparisons 
across these three types of models is somewhat cumbersome, the results led to very similar 
conclusions about the data. Given the consistency of the results across these three 
methods, | decided to use the linear model because it is the simplest and least problematic. 


The correlation between unemployment and GDP growth is reasonably strong (-.531), but 
the correlations between these measures and the other nation-level measures range from 
small (.057) to modestly strong (.387). When included as predictors of the other nation 
level predictors, neither measures increases the adjusted R-squared much, thereby 
suggesting mulitcollinearity is not a pressing problem with these variables. 


. Alternatively, when the other two labor market indicators are used as predictors of 
variation in the remaining labor market measures, the adjusted R? ranges from .124 to .18. 
This suggests that each variable has a substantial amount of unexplained variation after 
accounting for the other two. 


. The literacy slopes for each nation are calculated by multiplying the collective bargaining 
coefficient (-.19) by the level of collective bargaining coverage for each nation and then 
subtracting that from the overall literacy slope (.270). 
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Ecological-Evolutionary Theory: Principles and Applications 
By Gerhard Lenski 
Paradigm Publishers, 2005. 251 pages. $67 (cloth); $28.95 (paper) 


Pursuing general principles at the broadest scale of societal comparison provides the most 
ambitious stage on which seekers of a socio-logic of history confront historical particularism. 
In Ecological-Evolutionary Theory, eminent-emeritus sociologist Gerhard Lenski provides a 
lucid capstone statement of his own distinguished efforts to draw lessons from features and 
fates of human societies past and present, large and small. The scope of what Lenski means 
to address using ecological-evolutionary theory (hereafter EET) can be seen from the four 
applications provided in the second half of the book: “The Origins and Early Development of 
Ancient Israel,” “The Rise of the West,” “Trajectories of Development among Societies,” and 
the failures of “Marxist Societies.” While some historians undoubtedly will be distressed to 
see each of these topics dispatched in about 20 pages, Lenski’s endeavor is to understand 
the patterns visible from exactly such a high-altitude view, and he offers compelling 
arguments that a larger and explicable structural storyline really does exist in each case. 

Indeed, the brevity of the applications belies Lenski’s view that a major contribution of EET 
is that it actually offers a more complicated and nuanced view of society than other theories 
with similar ambitions. At one point, Lenski presents a very incisive table that compares EET 
to [10] other theories in terms of the attention given to six “determinants of social system 
characteristics (ideologies).” (p. 128) In other words, everything competitors postulate as 
being important for understanding societal composition and change EET also regards as 
important — if not always as important — while arguments against particular competitors often 
turn on their inattention to something EET emphasizes. 

EET seems pitched most against those who would give ideas, culture or social 
organization too much autonomous credit in the determination of history, emphasizing 
instead the continual exertion of material constraints on societal possibility. Human societies 
are constrained first by the behavioral dispositions encoded in our genes, then by the 
available resources in society's physical environments, and finally by the technologies 
available to extract resources. Genetic constraints are used to explain what humans can do 
given environmental and technological constraints. Environmental features are invoked to 
explain differences across societies with similar technological sophistication, while 
technological change carries the main explanatory weight for explaining social change. 
Although not a technological determinist, Lenski is an enthusiastic proponent of the centrality 
of technology for understanding social change. His outline of promising directions for future 
research is mostly preoccupied with expanding appreciation of the influence of technology, 
especially in such seemingly idea-centered domains as religion and the arts. 

As technology opens up possibilities for social change, human nature is depicted largely 
as providing inviolable counterweights that keep societal history in check. Sociology’s 
pathological history of histrionic, credibility-torching resistance to even relatively mild 
invocations of “biology” makes any serious acknowledgment of its importance welcome. Still, 
arguing that human nature is a necessary part of theories of human society goes nowhere 
by itself, and the real theoretical work is in articulating a conceptualization of human nature 
that is both biologically and psychologically accurate and theoretically fecund. Here, Lenski’s 
discussion of human nature is disappointing and devolves mostly into vague axioms: “human 
action is governed by emotions as well as reason” and “humans economize most of the 
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time.” (p. 48) Proponents of EET might be better served by closely considering the ideas 
about the co-evolution of human psychology and society presented in Boyd and Richerson’s 
Not by Genes Alone, which seems otherwise quite resonant with Lenski’s thought. 

Like many others, Lenski proffers the failures of Marxist societies as the chief cautionary 
tale about the disciplining of idealistic visions by human nature. Whatever its merits, the 
cogency of the example is likely less for those younger social scientists who have trouble 
understanding exactly how so much of the academy took the more utopic flights of Marxism 
so seriously anyway. Indeed, a larger problem with Lenski’s legacy may well be that the world 
has shifted so congenially toward his ideas as to make it hard to appreciate their novelty. 

Regardless, the idea of the limits imposed by human nature has severe limits of its own. As 
our understanding of human psychology and physiology improves, so too does the potential 
for technology to intervene on human nature. While | do not suspect that such interventions 
will be used to overthrow capitalism, technology could lead human beings to exhibit a good 
deal more malleability that suits capitalist imperatives than anything a pure reading of our 
genome would ever suggest. Perhaps it already has. More generally, just as innovation 
increases the resources that can be intentionally expropriated from physical nature, so too 
does innovation transform what can be intentionally expropriated from human nature. 

Strangely, despite the centrality of technology to EET, Lenski gives little consideration to 
how new technologies arise. Others might regard a crucial feature of contemporary 
developed societies to be precisely their intentional, almost obsessive, pursuit of innovation. 
Lenski is likely right in his skepticism toward certain claims about the role of ideas and social 
organization in history, but he may simultaneously underestimate the importance of ideas and 
organization in the contemporary development of innovation. Innovations in the production of 
innovations may be precisely what now have set us careening along unprecedented 
discontinuity from anything human history has before observed. 


Reviewer: Jeremy Freese, University of Wisconsin-Madison and Harvard University 





The Social Logic of Politics: Personal Networks as Contexts for Political Behavior 
Edited by Alan S. Zuckerman 
Temple University Press, 2005. $72.50 (cloth); $25.95 (paper) 


With few exceptions, a clean but unfortunate division of labor characterized political research 
in the past three decades of the 20" century. The discipline of political sociology took on the 
interface between political institutions and social institutions, characterizing social 
movements, considering the sources and consequences of the development of welfare 
states, developing critical theories of power and hegemony, fighting an exhausting battle 
between new institutionalism and power elite theory, and examining the implications of 
interlocking directorates for coordinated corporate political action. Political science, on the 
other hand, maintained disciplinary jurisdiction over the study of formal political institutions 
such as legislatures, courts, executive leadership, political parties and elections. 

As Alan S. Zuckerman reminds us in his historically engaging and thematically rich 
introduction to the edited volume The Social Logic of Politics, this division had not always been 
observed. Indeed, modern election studies were grounded in the work of Paul Lazarsfeld and 
his colleagues in the Columbia University Department of Sociology. Lazarsfeld recognized that 
political institutions are social institutions at heart, in the Simmelian sense of interdependent 
individuals interacting in a web of overlapping group affiliations. Zuckerman recounts 
decisions by political scientists in the 1950s to reject the sociological model on 
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methodological grounds (hypotheses describing interdependence are incompatible with 
Statistical tests demanding independence), strategic grounds (if political behavior is merely 
one instance of social behavior governed by sociological rules, political science becomes 
merely one instance of sociology), and theoretical grounds (politics is not a matter of 
influence through interaction, but rather is driven by rational calculators with preferences). 

The Social Logic of Politics convincingly demonstrates that these grounds for rejecting 
interaction-based explanations of political preference and behavior are no longer valid. A variety 
of modeling techniques able to handle instances of autocorrelation, including network effects, 
hierarchical linear and structural equation models, are employed to useful ends to test 
hypotheses regarding the impact of interdependent interaction on political behavior. Across these 
and other methodologies, in cross-sectional and longitudinal analyses, social context is 
consistently found to be associated with expressions of political preference and political behavior 
such as voting, even when controlling for the social and economic position of individuals. 

Refutation of strict individualism in political theory might have sufficed two decades ago 
to grab our attention. Since then, however, a number of studies have demonstrated the 
impact of social ties on the course of political phenomena such as social movements, 
corporate political behavior, legislative coalitions, and the international diffusion of policy 
innovation. Zuckerman wisely focuses on a subject relatively untouched by recent sociology: 
electoral preference and behavior. Contributions to the volume not only establish the 
relevance of a variety of social ties (friends, spouses, parents, neighbors, coworkers), but also 
attempt to explain variation in the strength of the force of social ties on politics. The answers 
here can be strikingly counterintuitive. Robert Huckfeldt, Paul Johnson and John Sprague 
counter Solomon Asch’'s comforting lesson that the presence of one dissenter can dissipate 
conformist pressures; in political communication networks, they find that views of lone 
dissenters are often ignored. Laura Stoker and M. Kent Jennings demonstrate that even as 
gender gaps in political preference widen in the aggregate, they narrow between particular 
married couples over time. Jeffrey Levine finds that casual acquaintances exert no weaker 
an influence on political preferences than close friends do. 

Although the volume focuses most strongly on personal discussions held between 
respondents and those directly tied to them, Ron Johnston and Charles Pattie as well as 
James Gimpel and J. Celeste Lay bring meso-sociological concerns to light, examining the 
impact of neighborhood and community characteristics on voting behavior and perceptions 
of political efficacy. Christopher Anderson and Aida Paskeviciute incorporate macro-level 
context, finding that greater political heterogeneity in a country and greater political 
marginality within that country lead to greater political discussion among individuals. 

Given the volume’s strong sociological orientation, it is stunning to see that not one of the 
contributors claims a primary home in a department of sociology. Indeed, nearly all of the 
contributors are in departments of political science or government. Although this cross- 
disciplinary work encouragingly demonstrates that sociological ideas have a home outside 
the formal bounds of the discipline, some duplication of effort has resulted. Huckfeldt’s 
rediscovery of the concept of “social-space” might, for example, have been better informed 
by the contemporary ecological work of Miller McPherson, Michael Hannan and John 
Freeman. Gimpel and Lay's study of neighborhood effects on efficacy would be enriched by 
variables of socioeconomic context included in the work of Robert Sampson, Mario Luis 
Small and Katherine Newman. 

But these are minor quibbles regarding work of major importance. 7he Social Logic of 
Politics lays out a course for the application of sociological theory to formal political behavior. 
Is it a course that sociologists will follow? 


Reviewer: James Cook, Duke University 
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Freedom Is a Constant Struggle: 

The Mississippi Civil Rights Movement and Its Legacy 

By Kenneth T. Andrews 

University of Chicago Press, 2004. 265 pages. $50 (cloth); $21 (paper) 


Everyone knows the standard account of the civil rights movement: After a string of innovative 
protest campaigns in the South, the movement prompted the passage of the Civil Rights and 
Voting Rights acts. But soon it lost support and purpose in the wake of urban uprisings and more 
intractable socio-economic goals in the North. The movement has been studied so thoroughly 
that nowadays scholars can write entire monographs about it without doing any original research. 

Kenneth Andrews shuns both conventional wisdom and current practices in Freedom /s a 
Constant Struggle. Andrews, an assistant professor at the University of North Carolina, argues and 
demonstrates that in Mississippi the movement followed up on its gains in voting registration 
by creating new political leaders and influencing the programs of the War on Poverty and 
educational desegregation. His reinterpretation rests not on protest activity, but on extensive new 
research regarding the political and policy consequences of the movement. 

Freedom Is a Constant Struggle brings keen rigor with a multidimensional approach to 
explaining the consequences of social movements, a subject that has moved to the forefront of 
movement research. Andrews focuses on Mississippi counties, employing multivariate analyses 
across several different issues of interest to civil rights activists. He unsurprisingly addresses the 
electoral issues that initiated the movement. But he also goes beyond them to confront the 
movement's engagement in Johnson administration War on Poverty initiatives, such as Head 
Start and the Community Action Program, and struggles over desegregation in school 
desegregation. Historical understanding is sometimes seen as opposed to methodological 
stringency. But Andrews’s deep knowledge of the movement in Mississippi, based on both 
archival work and interviews, heightens the validity of his quantitative analyses. 

He finds that the movement's infrastructure is of crucial and continual significance, after 
other possible influences on the outcomes are controlled for. By movement infrastructure he 
means its leadership, organizational structure and resources. Andrews argues that a 
movement with a strong infrastructure will be able to employ different strategies to meet the 
needs of different situations. He also argues that movements can have influence in a number 
of different ways, using different forms of action. In this way, Freedom /s a Constant Struggle 
helps to reorient theoretical thinking in a field that often is mired in rehashing the possible 
results of disruptive events such as urban riots. And although Andrews relies on statistical 
analyses to make his case, he fills in the numerical picture by following the trials of the 
movement in three counties, Holmes, Madison and Bolivar. In each one, he traces the 
struggles by which the movement sought to exert influence on the issues. 

No book is perfect, even one as excellent and compelling as Freedom /s a Constant 
Struggle .| wanted, for example, to see Andrews directly confront the bearers of conventional 
wisdom and make more of the interpretive shift that his research implies. Was Mississippi 
simply different, with its peculiarly virulent racism and violent resistance and its movement 
innovations such as Freedom Summer and the Mississippi Freedom Democratic Party? Or 
was its experience largely replicated across the South or Deep South? If Mississippi's 
experience was common, we need to alter our understanding of the movement. In addition, 
| wanted to see the idea of movement infrastructure unpacked to address the usefulness of 
its component parts for other studies of the consequences of social movements. | was 
hoping, too, to hear about a county in which the movement was unable to sustain activity. 

Minor criticisms aside, Freedom /s a Constant Struggle is must reading for scholars of social 
movements, the civil rights movement and the consequences of social movements. 
Andrews's keen attention to key issues and outcomes that fall below the national level is a lead 
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that can help free scholars of social movements from the methodological binds they often find 
themselves in. His informing his quantitative analyses with historical understanding is a model 
for anyone who does historical quantitative research, whatever the topic. And his ability to shed 
new light on a topic that is so well studied and important is an inspiration to us all. 


Reviewer: Edwin Amenta, New York University and University of California, Irvine 





The Two Reconstructions: The Struggle for Black Enfranchisement 
By Richard M. Vallely 
University of Chicago Press, 2004. 330 pages. $58 (cloth); $22.50 (paper) 


“Democracy” and universal suffrage have not always moved together. In most cases, the rise 
of democratic governance did not ensure a universal right to participate. Women were usually 
excluded from the vote process at the time democracy was established. And in the early 
phases of democratic consolidation, efforts by the powerful to exclude other social groups 
from participation almost always limited the franchise. But one near-universal fact about the 
history of democracy around the globe. After a group establishes its right to participate, it is 
never taken away as long as democratic governance persists. 

There is, however, one exception to this broad generalization. In the aftermath of the Civil 
War, an extraordinary coalition came together to bring about the enfranchisement of African 
American men in the United States. Previously excluded from voting in all southern and most 
northern states, the adoption of the 14" and 15" Amendments (in 1868 and 1870 respectively) 
and the creation of a series of federal enforcement mechanisms imposed on the former 
Confederacy ensured, for the first time, that black men would be able to exercise the 
franchise. And exercise it they did. With high levels of participation, the new African American 
voters worked to create Reconstruction governments across the South. Numerous African 
Americans were elected to political office in the late 1860s and 1870s. Although hampered by 
the difficulties of the post-war environment and high levels of white violence, the 
Reconstruction governments developed a startling number of initiatives that began to lay the 
foundation for a new, racially-egalitarian social order. 

The Reconstruction agenda was initially supported and fostered in Congress, where 
radical Republicans and their moderate allies fashioned programs for substantial federal 
intervention in elections to ensure that the new electoral coalition could compete fairly, even 
in the face of white violence. Some important and promising national civil rights initiatives 
were adopted. Even the federal courts occasionally (and mostly only briefly) got in on the act. 
In the immediate aftermath of the Civil War, some courts surprisingly endorsed new initiatives 
with innovative rulings that, at least briefly, pushed in the direction of racial equality. 

Yet it would take a century before the promise of the brief Reconstruction era would finally 
be realized. After 1872, Congress largely abandoned the southern reform movement. White 
violence (most notably the practice of lynching), electoral fraud, the decline of the Republican 
majority in Congress, the bitter hostility of white voters to Reconstruction (and the Republican 
Party which sponsored it), and (after 1890) legal disenfranchising measures adopted by 
southern state governments gradually stripped the right to vote from African Americans. By 
1910, few blacks could vote anywhere in the South. The disenfranchising measures also 
reduced participation among poor whites. Led by the U.S. Supreme Court, federal courts 
began to reject the direction cautiously charted in the immediate post-1865 environment. 
Democratic control over southern governments was briefly threatened by the rise of populism 
(and Populist-Republican coalitions) in the late 1880s and early 1890s, and occasional threats 
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to restore federal oversight by the remaining civil rights liberals in the Republican bloc in 
Congress. But these were crushed through a variety of different legal or illegal means. 

The result was a single-party South that created the Jim Crow system to perpetuate virulent 
racial inequality for the first six decades of the 20th century. With their seniority assured, 
southern Democrats would dominate Congress for much of this period, defining the shape of 
the national party system and taking race entirely off the political agenda until the rise of the 
civil rights movement and the dramatic crumbling of Jim Crow in a “second” Reconstruction. 
In his masterful comparison of the two periods of reform, Richard Vallely tells the story of how 
the first Reconstruction rose and fell, while the second (in the 1960s) succeeded. Drawing 
upon a rich secondary literature and an especially close reading of key Supreme Court rulings, 
Vallely covers familiar and unfamiliar ground with grace and theoretical sophistication. 

The argument he advances is a pointed one, offering a firm causal story. Vallely asserts 
that changes in public opinion, the political economy of the South, or the vastly different roles 
played by the federal governmental in the 1960s vs. the 1860s cannot account entirely for the 
difference in outcomes. Instead, he develops what he calls a “rational choice-historical 
institutionalist” account in which law and party building are causally central. The first 
Reconstruction failed because of the logic of the party system and a legal infrastructure that 
could not support sustained reform. The United States in the post-Civil war era was, to 
employ Stephen Skowronek’s oft-quoted formulation, a state of “parties and courts.” Only if 
both party building and “jurisprudence building” had supported Reconstruction, Vallely 
argues, could it have fully taken root after the Civil War. The Republicans briefly saw black 
enfranchisement as a viable political strategy for influence in the South, but supporting black 
civil rights was unpopular in the North and bitterly opposed by whites in the South. By 
contrast, in the 1960s, the Democrats gained electorally from supporting civil rights among 
northern black voters while maintaining enough white southerner votes to hold large chunks 
of the South. Similarly, the evolution of civil rights liberalism in American jurisprudence by the 
1960s had reached the point where strong and explicit federal intervention was embraced 
by federal courts who had rejected such measures a century earlier. 

As a broad, synthetic account of a vitally important question in American political 
development, 7he Two Reconstructions can teach us a great deal about both the state of 
historical social science and the history of democracy. | would, however, raise two concerns, 
both of which grow out of my peculiar interests. First, the disenfranchisement of current or 
former criminal offenders, virtually the first legal blow against African American voting rights, 
is Ignored in the book. It was precisely felon disenfranchisement that first showed how 
racially-neutral laws that eliminated black votes could be constitutionally viable even in the 
face of the 15° Amendment. 

Second, while Vallely mentions the possibility that changes in public opinion were an 
important source of difference between the two periods, he produces no actual evidence that 
would rule out a critical causal role for public opinion. For too long, political sociologists and 
political historians have been reluctant to broach the possibility that mass opinion matters in 
accounting for policy and political change. In the late 1860s, after the Civil War, five states 
voted on referenda that would have enfranchised African Americans. In each case, these 
measures were defeated. The 15th Amendment survived in part because it was a way of 
punishing the South, which was even more popular among northern whites than racial 
equality was unpopular. By the 1960s, however, northern whites had slowly come to endorse 
at least equality of opportunity (including the right to vote) for African Americans. If they had 
not, it is hard to believe that Democratic Party elites would have pushed civil rights legislation 
and federal judges would have endorsed it. A public opinion-based account is both 
parsimonious and powerful. At a minimum analysts should systematically consider this 
possibility in examining the sources of the civil rights revolution of the late 1960s (one 
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example of such an approach can be found in Paul Burstein's work on equal opportunity 
employment legislation, in his Discrimination, Jobs, and Politics). 

These concerns aside, | find much to embrace in both the broad historical sweep and 
analytical details of The Two Reconstructions. Books such as this one, and the related work of 
John Skrentny (7he Minority Rights Revolution), J. Morgan Koussar (Colorblind Injustice), and 
Philip Klinkner and Rogers Smith (7he Unsteady March) provide the seeds of a comprehensive 
understanding of the single most important change in 20" century American political history. 


Reviewer: Jeff Manza, Northwestern University 





Heartwood 
By Wendy Cadge 
University of Chicago Press, 2005. 268 pages. $55 (cloth); $22.50 (paper) 


Using participant observation and in-depth interviews, Wendy Cadge studied two Theravada 
Buddhist communities in the United States. The research is unique because the author 
systematically compares a “convert” community (i.e., a European-American group) with an 
“immigrant” community (i.e., a Thai-American group). Cadge is especially interested in how 
Theravada Buddhism is being adapted to the American context. 

There are important similarities between the two groups. In both, members are mainly 
middle- or upperclass women. Both communities are non-residential centers for lay people 
who want to learn about or practice Buddhism. 

The immigrant community is interested in both teaching Buddhism and transmitting Thai 
culture. Most of the participants are first-generation immigrants from Thailand. The temple is 
hierarchically organized, although the monks who run the group are more dependent on the 
laity than are monks in Thailand. Many of the female participants are professionals, and 
perhaps for this reason have more responsible roles than would be the case for women in 
Thailand. While in Thailand, rituals are usually arranged in relation to the phases of the moon, 
the Thai-American group holds regular Sunday meetings. Many of the rituals at the temple are 
meant for all members. Moreover in many cases whole families come to the temple. Not 
surprisingly, the members think of themselves as one large group or family. 

The immigrant group holds a variety of activities — chanting sessions, meditation classes and 
such. The monks discourage magical practices such as fortune telling that are common in Thai 
temples overseas. Lay participants venerate the resident monks, and especially the deceased 
monk who discovered the type of meditation taught at the temple; some people pray to this 
monk for help when they are in need. However the main activity is merit making such as by 
donating food, time or money to the temple. The laity want a better life in their next birth. 

The convert community is quite different. It is a lay-led group whose primary activity is 
practicing mediation; in addition, talks about Buddhist practice occur every Wednesday evening. 
Many of the members are single. Life is organized around small groups; most members are 
deeply involved in only one such group. Although the form of Buddhism taught at the center 
was developed by Theravada monks, participants use practices from various strands of 
Buddhism. People meditate not to amass merit but to gain nibbana (nirvana) in this lifetime. 

The most interesting chapter, for me, concerns Buddhist identity. Many people in the 
immigrant group have an “ascribed” Buddhist identity, that is, they were taught as children 
that they were Buddhists, and this identity was accepted without much thought. Interestingly 
some of these same people have added an “achieved” Buddhist identity; because of 
exposure to other religions, and in some cases of ethnic intermarriage, these individuals 
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examined why they were Buddhists and chose this religious identity. People with achieved 
identities often separate Buddhism and Thai culture. Cadge found a variety of achieved 
identities among the people in the convert group. Some thought of themselves as Buddhists, 
others simply as students of meditation, others as followers of several non-Christian 
traditions, others as having dual religious identities - as Jewish and Buddhist or Christian and 
Buddhist, while others did not choose any religious identity. 

Cadge uses her findings to criticize Christian Smith's sub-cultural identity theory of religious 
growth. Smith argues that religions with strong boundaries and a sense of conflict with the larger 
society will grow. In contrast, Cadge claims that the groups she studied attract people because 
they are weakly bounded (see pp. 151-2, 164, 240). “It is because of these loose boundaries that 
Christian Smith's sub-cultural identity theory does not account for the religious growth and 
strength of these two organizations or of Buddhism in America more broadly.” (p. 170) 

In a few places, Cadge describes the changes within the Buddhist groups as examples of 
Americanization. But similar developments are occurring among Buddhists in the United 
States, other Western countries and in Asia. Modernization theory could have been used to 
distinguish global changes in Buddhism in response to modernity from adaptations 
specifically to American culture. 

Heartwood is a well-written book that could easily be read by undergraduates or others 
with some minimal knowledge of Buddhism. Besides being required reading for those 
engaged in the study of Buddhism, it will also be of interest to students of new religions or 
those concerned about explaining religious growth. 


Reviewer: Joseph B. Tamney, Ball State University 





Elusive Togetherness: Church Groups Trying to Bridge America’s Divisions 
By Paul Lichterman 
Princeton University Press, 2005. 331 pages. $65 (cloth); $21.95 (paper) 


Why are some civic groups better able to connect diverse social actors and forge connections 
between them than others? What kinds of resources do groups cultivate or draw upon, so 
that some are more successful than others to “bridge America’s divisions?” Paul Lichterman's 
new ethnography of nine Christian volunteering and advocacy groups in the pseudonymous 
“Lakeburg” argues that an important piece of the answer resides in groups’ social and 
discursive habits, traditions and customs. Elusive Togetherness is a serious, thoughtful and 
detailed ethnography that suggests that so many civic and volunteer groups fail to build 
bridges between communities not because of participants’ lack of interest, desire or motives 
but rather because of the contexts in which they work, which reinforce taken-for-granted 
ways of interacting that preclude or Jimit reflexive understanding of social situations. 
Lichterman demonstrates the importance of studying daily speech acts and interactions in 
understanding political and civic cultures, and likewise suggests that some the most powerful 
resources we have for building robust civil societies (or standing in their way) reside within 
ways of talking, interacting and engaging each other on a daily level. 

Lichterman introduces his theoretical and methodological frame in two chapters that address 
the limitations of recent “social capital” approaches to evaluating community groups’ social 
impact, he offers an alternative method and theory based on interaction and socially shared 
norms and customs. E/usive Togetherness is thus firmly engaged in recent debates about how 
civil groups “bridge” or “spiral outward” to bolster and strengthen community and social ties 
across diverse sectors of society, but steers away from approaches that emphasize the 
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importance of quantifiable “skills” favored in the social capital literature to focus on the less-easily 
quantified customs, interaction styles and discourses that might lead to greater social reflexivity. 

Drawing judiciously on the political writings of Jane Addams and John Dewey, Lichterman 
argues throughout this volume that the way to understand how social democracy works depends 
on our ability to listen to how people talk to each other about things that matter. “[Pleople need to 
know the complexity of modern society, to talk reflexively about it, if they [are] going to organize 
democratically” he argues (p. 252). Lichterman’s second chapter outlines this approach, 
articulating how he assesses the ways discursive customs, speech styles and group dynamics 
work to shape the spaces and contexts where bridging and social reflexivity take place. 

These introductory chapters are followed by empirical chapters that find Lichterman a 
participant observer in nine Christian civic groups, where he trades off playing with children at 
a community center and carrying food for “Adopt-a-Family” parties with asking questions of 
leaders and group members and (above all) closely observing what is said and isn't said. 
These chapters demonstrate Lichterman’s skill at reading social situations, and also 
demonstrate the value of his methodological and theoretical apparatus. Lichterman draws 
readers into both liberal and evangelical Christians’ conversations and debates the redress 
of social ills and how to build stronger communities in Lakeburg, showing us time and time 
again (indeed, with only one exception) how groups’ social customs place strong limits on 
their ability to make real, human connections with the people they are serving. Lichterman 
demonstrates how deeply embedded yet varied social customs, largely taken for granted and 
set in motion from the groups’ beginnings, undermine or weaken the potential for social 
reflexivity and bridge building. Lichterman learns that very few groups cultivate the kinds of 
social customs and “settings that allow people to think and talk about spiraling outward 
[beyond their own limits] without threatening the group's own togetherness.” (p. 18) 

The single group that did succeed, Lichterman tells us, took on projects that broke them 
out of several norms of “volunteering” and “helping” and put them in a position of 
“partnership” with the neighborhood they wished to positively affect. Nonetheless, the 
relationships and ties that developed this group developed remained “fraught and tenuous.” 
(p. 174) Building bridges across social divisions — while arguably necessary to our society's 
sense of itself — is much more difficult, and takes much more work, than simply volunteering 
and participating from time to time in community action. 

In addition to addressing central issues in recent research on civil society, Lichterman also 
argues for a closer and more nuanced approach to understanding the place of religious (in 
this case Christian) civic groups in American public life. Lichterman’s focus on religious social 
customs, speech and practice should make this a necessary read for those interested in the 
place of religion in American society, as Lichterman demonstrates that “religion” means much 
more than theology and ideological differences. The numerous stories in this volume 
complicate easy portraits of differences between “liberal” and “evangelical” approaches to 
volunteering and social services, even as they suggest that various differences in social 
customs and speech styles continue to mark these traditions as quite different. 

The numerous fascinating examples and vignettes that comprise the majority of E/usive 
Togetherness work together to demonstrate how important it is to attend to social customs 
and discursive styles when evaluating Americans’ failures and successes in bridging social 
differences. The empirical chapters are lively and convincing. That said, Lichterman goes to 
some lengths to answer potential critics and situate his positions within several distinct 
bodies of research, at times in ways that obscure the book’s main argument. Likewise, 
Lichterman’s ethnographic focus, while greatly beneficial and at time quite powerful, 
nonetheless raises questions that it does not answer about the traditions, histories or 
genealogies of the customs that he identifies. Can we identify these traditions using historical 
data? How recognizable would they be to non-Christians or within other American publics? 
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These questions are best left for future investigation, and do not diminish Elusive 
Togetherness’ contributions to our understandings of civil society. Paul Lichterman shows us 
why it is necessary to attend to social customs and how to do so. This is a story well worth 
learning and learning from. 


Reviewer: Courtney Bender, Columbia University 





Hollywood Goes to High School 
By Robert C. Bulman 
Worth Publishers, 2005. 191 pages. $23.95 eo 


Robert Bulman seeks to bring much-needed sociological sensibility to the study of the high 
school film. Observing that most studies treat the genre “monolithically,” Bulman 
differentiates urban public, suburban public and private school films. The book argues that 
Hollywood's portrayal of adolescence and high school rests on social class distinctions and 
that the films reflect and reinforce a “middle-class cultural hegemony” that has to do with 
“individualism, self-sufficiency, free expression, hard work and fair play.” (p. 7) 

Bulman insightfully unearths consistent differences in film subgenres, each with its distinctive 
view of adolescence and individualism. Urban public school films revolve around class struggle 
between the “morally-challenged urban poor” and their “virtuous middle-class teacher-heroes” 
(p.119) where academic achievement and utilitarian individualism are the avenues out of poverty 
for lower-class youth. In the suburban school films, middle-class students battle one another; 
heroes lurk among them; school is not a place of learning, rather it is a metaphor for oppressive 
society, populated with corrupt and incompetent adults. Students fight a culture of conformity 
and find their sense of self through expressive individualism. In the private school films where 
class struggle is “full-blown,” the “integrity and merit of working and middle-class heroes is 
contrasted with the moral and emotional bankruptcy of wealthy antagonists.” (p. 23) Each of 
these subgenres reflects a perspective that encompasses the cultural anxieties and tensions, 
prejudices, preoccupations and fantasies of middle-class Americans. 

Differences between subgenres are seen to correspond to real-world asymmetries in 
which the poor are expected to exercise utilitarian individualism to better their lot while 
middle-class youth are given the freedom to exercise expressive individualism in their search 
for meaningful identity. In this linking of class, culture and varieties of individualism, Bulman’s 
argument finds some agreement with author Adrie Kusserow’s identification of “soft 
individualism” as opposed to “hard individualism” in parenting and school culture in three 
New York neighborhoods. See Kusserow’s American /ndividualism, Child Rearing and Social 
Class in Three Neighborhoods published in 2004 by Palgrave. 

The study is a think piece, somewhere.between a ‘cinematic ethnography’ and analysis of film 
as popular myth. In framing the discussion to address American individualism, identifying change 
agents as ‘cowboy heroes’ who are outsiders (reminiscent of Will Wright's classic study of the 
Western), noting that characters overcome social barriers and constraints and that the narrative 
resolves contradictions toward neat and happy endings, Bulman highlights characteristics shared 
by a variety of film genres. The resulting analysis is general enough to reach beyond the high 
school film while being somewhat removed from the richness of the films’ social world. 
Ethnographically-significant observations for example, that school is a place for articulating 
difference and hierarchy, for expressing intergenerational conflict, violence, deceit and deviance, 
where education is seen as stifling conformity and where, on occasion, teacher-heroes are 
military veterans or out-of-work mercenary soldiers are made, but do not drive the analysis. 
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Bulman convincingly demonstrates that high school films are not about schooling, rather they 
are social settings where identity issues, rite of passage and the grand drama of adolescence is 
played out. Would it then not have been better to reframe the study to address these broader 
themes? A reframing together with a discussion of the translatability of the high school genre to 
non-American films would have better suited the comparative dimension of the study and the 
inclusion of 41 foreign films, not all of which relate to high school we learn as few foreign films 
were to be found that could be categorized as ‘high school films.’ The question then arises if the 
high school film is an American genre, does it lend itself to cross-cultural comparison? 

There are other methodological questions. For example, the criteria for selecting the 185 
films which form the sample are not clear. Neither is it clear which films form the backbone of 
the analysis. Individual chapters present a discussion of the synopsis of selected films with 
many film references in footnotes and asides. Assuming a hypothetical and homogeneous 
audience, the analysis does not separate out films that are box-office successes, films that flop 
and made-for-TV movies. 

Bulman critiques the Hollywood film as American myth for ignoring structural obstacles to 
mobility, supporting a culture of poverty thesis, providing over-dramatized ‘solutions’ and its 
focus on individuals, relationships and emotional appeals. An analysis that took into account the 
parallel world of film and the institutional culture of (Hollywood) entertainment which influences 
film content would have allowed a more pointed critique. One could argue that popular Indian 
cinema is also about individuals, relationships and emotions where happy endings predominate. 
We live in an infotainment age where even news media provide thrilling stories, and for 
filmmakers and audiences movies are about individuals and their transformative journeys, 
relationships and emotions, according to author Strauss Zelnick (“Twentieth Century Fox” in 
Making and Selling Culture, published in 1996 by Wesleyan University Press). 

A greater familiarity with how popular cinema works would have helped tease out the culture 
and workings of film from the broader culture in which it is embedded. However, even with these 
questions, the book will be useful for students seeking to understand sociology through film. 

While Bulman notes that the high school film genre took off in the 1980s, he does not link 
the analysis of films to any historical, economic or political context. He does not take into 
account the cultures that influence content, the audience or the film's box-office performance. 
By choosing to critique Hollywood Bulman steps away from a more powerful social critique. 
Is the high school genre distinctively American? Does the high school film and the high school 
experience have a cultural resonance in America that is not translatable to other societies?) 
These issues would have better suited the comparative dimension of the study and the 
inclusion of 41 foreign films. 


Reviewer: Lakshmi Srinivas, Wellesley College 





Host Societies and the Reception of Immigrants 

Edited by Jeffrey G. Reitz 

Center for Comparative Immigration Research, San Diego, CA: University of California, 
2003. 550 pages. $25.95 (paper) 


This edited collection comprises 19 chapters written by scholars from the United States, Canada 
and Europe for a 2001 Harvard University conference. Six of the chapters have been previously 
published as an issue of an international journal. The collection’s major focus is how features of 
‘host (i.e., receiving or settler) societies determine the incorporation of immigrants. The editor's 
introductory chapter elaborates four intersecting dimensions of this research trajectory that 
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serve to organize the book. (1) prior ethnic or race relations within host societies (2) variance in 
labor markets (3) effects of immigration and other state policies, and (4) the shifting of 
international boundaries and globalization. Many of the subsequent chapters within these 
sections adopt a comparative approach and employ sophisticated quantitative analysis. In 
addition, a fifth section contains two chapters that have a more historical orientation. 

This is an interesting collection of articles that explores diverse aspects of immigrant 
integration in receiving societies, including educational attainment, residential segregation, 
housing shortages, labor market incorporation and income levels, entrepreneurship and political 
engagement. It effectively showcases varied approaches — sociological, economic, and historical 
—within the study of immigration. If there is a single revelation flowing from this book, it is that the 
incorporation of immigrants is far more complex than previously acknowledged. The chapters 
reveal a process wherein the characteristics of settler societies are as important for integration as 
immigrants’ attributes and one that correspondingly enlists a range of Social institutions in its 
service, institutions which themselves are seen to change significantly over time. 

This complexity of immigrant incorporation is exemplified in three chapters from different 
sections in particular. In the section on pre-existing ethnic or racial relations, the chapter by 
R. Alba, J. Logan and B. Stultson on residential inequality and segregation reveals that spatial 
assimilation, place stratification nor the newer ‘ethnic-community’ model anticipate the 
identified patterns, thereby “indicating that the settlement of contemporary immigrant groups 
is indeed complex and involves elements of all three models.” (p. 144) 

G. Borjas’s chapter, in the section on state policy, centers on immigrant participation in 
welfare programs and shows the complex effects of efforts to reduce such participation in 
the form of the federal Personal Responsibility and Work Opportunity Reconciliation Act of 
1996. Welfare reliance is found after 1996 to have declined among immigrants relative to U.S.- 
born individuals nationally, but upon closer analysis it is revealed this supposed national 
change Is entirely a result of reductions in welfare participation in California alone (p. 291). The 
reasons for such a dramatic shift in only one state can only be speculated upon. 

Finally, in the section on globalization, D. DeVoretz, J. Ma and K. Zhang introduce a 
triangular human capital transfer model that problematizes the widely-held assumption in the 
neo-classical model wherein the immigrant enters the host society and, if disappointment 
results, returns home. By referring to the case of Hong Kong and Canada these authors paint 
a far more complex picture that involves a third nation (i.e., a triangle) to which human capital 
can migrate and thus three movement options for the immigrant. 

One significant consequence of the complexity of the incorporation process revealed in 
these and other chapters is to starkly expose how policies, especially those deploying crude 
point selection systems or excluding all immigrants from welfare programs, are unlikely to 
effect positive change in immigrants’ experiences in the labor market or in immigrants’ 
institutional integration more broadly. The presence of a range of topics and approaches such 
as these is the collection’s main strength but perhaps also its principal limitation. 

While the editor does his best to categorize disparate articles into five key themes in his 
well-written introduction, the collection ultimately lacks an overarching theoretical theme. To 
be sure, the editor notes the four dimensions he identifies “may be useful in developing an 
emerging theory about how host societies affect immigrant integration” (p. 15), but he fails to 
show how this might be accomplished given the array of approaches present or, indeed, why 
such a theory is requisite. The editor could have acknowledged there are incommensurable 
theories informing the various contributions that render such synthesis impossible and 
instead discussed the rich diversity of this realm of study. In this way, the chapter by S. 
Castles discussing transnational communities that draws on social theorists such as Z. 
Bauman and M. Castells, would seem less out of place in a section that features the testing 
of the DeVoretz-Ma economic model (p. 470). 
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Nevertheless, if the approachable presentation of diverse research and critical discussion of 
key concepts in contemporary immigration studies is its goal then this collection surely 
delivers. As a consequence, this book will appeal to a wide audience that includes immigration 
policy makers, scholars amd graduate and undergraduate students of immigration studies. 


Reviewer: Randy Lippert, University of Windsor 





Islamic Modernism, Nationalism, and Fundamentalism: Episode and Discourse 
By Mansoor Moaddel 
University of Chicago Press, 2005. 448 pages. $65 (cloth); $24 (paper) 


This ambitious book is an Islamic version of Robert Wuthnow’s influential analysis, Communities 
of Discourse (1989). While Wuthnow traversed a series of European cases over four centuries, 
examining the relationship of social structure to ideological discourses, Mansoor Moaddel 
addresses similar issues for a more tightly-knit series of Islamic societies. These cases range 
from Islamic modernism in the late 19th and early 20th century, to various forms of Middle 
Eastern nationalism in the early and mid-20th century, to Islamic fundamentalism in the mid- to 
late 20th century. Moaddel asks why these ideologies emerged when they did, and why they 
varied from one country to another. For the Islamic modernist period, Moaddel compares the 
Muslim regions of northern India with Egypt and Iran; for nationalism, he compares Egypt, Syria 
and Iran; for fundamentalism, he compares Egypt, Syria, Iran, Algeria and Jordan. 

The unit of analysis is an episode of ideological production, in keeping with the 
development of the concept of episodic discourse in Moaddel’s earlier work on the Iranian 
revolution of 1979. Moaddel defines episode as a period of continuity in meaning-making that 
is bounded geographically and temporally, cut off from earlier and later episodes by dramatic 
events such as coups, revolts, wars, sudden political or economic shifts, or innovations of 
various sorts (p. 20). The book’s main argument is that episodes are discontinuous — but that 
there is a hidden logic to the discontinuity, which is characterized in large part by each 
episode’s opposition to the previous one. Each episode “targets” the ideology of its 
predecessor, and in so doing it is shaped by its predecessor. Like a dialectic without hope of 
synthesis, each episode succumbs to the antithesis that it helped to form. 

Moaddel distinguishes two dimensions of this dialectic: the location-of the target in civil 
society or the state, and the degree of pluralism within the target's discursive field. The first 
dimension differentiates between Islamic modernism, which Moaddel argues was more 
targeted toward the Islamic establishments of civil society, and nationalism and 
fundamentalism, which Moaddel argues were more targeted toward the institutions of the 
state. The second dimension differentiates between modernism and nationalism, which 
generally adopted the pluralism of their targets, and fundamentalism, which adopted the 
monolithic discourse of its target, the nation-state. To give one example from among the 
many that Moaddel offers: in the 1930s, Sayyid Qutb, the Egyptian literary critic who later 
became one of the foremost ideologists of Islamic revolution, was an advocate of modern 
Arabic poetry and a supporter of liberal nationalism politics, whose pro-democracy ideology 
reflected the pluralism of the Egyptian constitutional monarchy. In the following decades, 
Qutb abandoned liberal nationalism and joined the Muslim Brotherhood, ultimately favoring 
violent overthrow of the Egyptian state in the name of an Islamic ideology that reflected the 
military-led state's “totalitarian” ambitions (pp. 217-220). 

Moaddel rejects any direct correspondence between ideology and social structure. For each 
case, he describes the ways in which colonialism, modern education, statist development 
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projects and global economic trends battered some classes and created new ones. However, 
the ideology of each class may vary from one period to the next or from one class faction to the 
next. For example, Moaddel notes that Islamic modernism was stronger in Egypt than in Iran in 
the 19th century, despite similarities in the two countries’ social structures. He attributes the 
difference in part to the smaller cadre of modern-educated Iranians, but primarily to the lack of 
“discursive pluralism’ in Iran. If Christian missionaries had been permitted greater voice in Iran, 
Moaddel argues, Islamic modernism might have developed to a greater extent (p. 109). For 
Islamic fundamentalism, Moaddel goes even farther in undermining the relationship between 
social structure and ideology. In Iran, Moaddel locates fundamentalism in older social classes, 
the bazaar and the Islamic religious establishment; in Jordan, building on his earlier work on 
Jordanian exceptionalism, Moaddel associates fundamentalism with the new middle class; in 
Algeria, he associates it with peasants and the urban poor. 

All comparative-historical studies of this grand scale, including Moaddel’s and Wuthnow’s, 
are vulnerable to questions about representativeness: to what extent are the conclusions a 
product of the selection of cases? For Moaddel’s book, how well does the argument apply 
to other Islamic societies? Ottoman Turkey, for example, experienced Islamic modernism 
similar to Egypt and North India, but it seems difficult to apply Moaddel’s logic and attribute 
the pluralism of the Islamic modernist ideology to the pluralism of the Ottoman religious 
establishment. Or perhaps, unlike Moaddel’s argument for other Islamic modernists, the 
Ottoman movement targeted the Ottoman state more than the institutions of civil society, in 
which case the Ottoman state reforms of the mid-19th century might indeed have provided 
an inspiration for the opposition’s pluralism. The challenge in such an approach is to 
characterize the essence of an episode of discourse while doing justice to the variety of 
ideological currents within it, as well as variation over the course of the episode. Moaddel’s 
book is a bold attempt to do both. 


Reviewer: Charles Kurzman, University of North Carolina at Chapel Hill 





Middle Eastern Lives in America 
By Amir Marvasti and Karyn D. McKinney 
Rowman & Littlefield Publishers, Inc., 2004. 171 pages. $60 (cloth); $22.95 (paper) 


This is a perplexing book, as large claims are made for it in the foreword and by the authors, 
who claim it breaks new ground in laying out Middle Eastern contributions to the 
development of Western civilization and in portraying the experiences of Middle Eastern 
Americans. There is no bibliography, but the footnotes show that the authors have failed to 
consult the substantial literature that exists in both these areas. (Publications on Middle 
Eastern Americans that have been out for decades are reviewed in my own 2003 publication, 
Muslims in the United States: the State of Research). 

Instead, this book's genesis relies on personal connections (xv-xvi) and the sense of 
urgency produced by the events of Sept. 11, 2001. The authors’ position (p. xxi) is that Middle 
Eastern Americans “are considered a despised and dangerous minority group,” and the 
research goal was to “bring to light the history and current experiences of this group with 
discrimination.” Marvasti and McKinney, sociologists, set out to conduct formal interviews with 
20 respondents (14 men and six women). We are not given the interview schedule or any 
systematic analysis. Nine of the 20 respondents were from Iran, six were from Pakistan, and 
one from Turkey; only two each from Egypt and Lebanon were Arabs and would fit most 
definitions of “Middle Eastern.” But these authors define Middle Eastern as “having ancestral 
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ties to the predominantly Islamic region of the world in southwest Asia and North Africa (p. xx). 
The interviews were termed “in-depth” and “lasted an hour.” The authors also drew on informal 
sources of empirical data, defined on p. xix and ranging from their own life experiences to visits 
and gatherings. 

The value of the book lies in the anecdotal material presented by authors-and informants, 
but the anecdotes are presented without substantive background material or systematic 
analysis. It is a popularly-oriented political appeal and may have a place in provoking dialogues 
or providing topics for speeches and workshops on race and ethnicity in American life. Given 
the other academic literature out there on the topics touched on in this book, the claims made 
for it are considerably overstated. 


Reviewer: Karen Leonard, University of California, Irvine 





Ethnic Routes to Becoming American: Indian Immigrants and the Cultures of 
Citizenship 

By Sharmila Rudruppa 

Rutgers University Press, 2004. 239 pages. $62 (cloth); $22.95 (paper) 


In an age of multiculturalism, how do new immigrant groups construct an American identity 
and assert their cultural claims to U.S. citizenship? What impact do racial and class 
backgrounds have on these processes? In this provocative study, Sharmila Rudruppa’s 
research focuses on two non-profit organizations, Apna Ghar, a battered women’s shelter, and 
the Indo American Center, a cultural association, in Chicago. She utilizes participant 
observation research methods to gain access as a paid staff member at the former and as a 
volunteer at the latter. 

Rudruppa deliberately spends the first part of the book describing her observations and 
interactions at the two centers, with separate chapters discussing the workers and users of 
the sites. The organizations provide an interesting contrast. One is a social service agency 
with staff, volunteers and clients of varying racial backgrounds; the other is a cultural center, . 
which has primarily Asian Indian volunteers and participants. Her skill as an ethnographer is 
admirable and her research notes are meticulous, allowing the reader to become immersed 
in the two sites. : 

lronically, Rudrappa finds that both these ethnic organizations, contrary to the assumption 
that they are creating a separatist rhetoric, are inadvertently promoting assimilationist 
paradigms. Neither of these organizations is fulfilling its declared role of politicizing the 
community, rather it imposes assimilationist agendas, even to the extent of “de-ethnicizing” 
or “re-ethnicizing” community members. She explains how these ethnic organizations, 
originally created with interventionist goals, utilize more integrationist strategies as they have 
become more institutionalized and professionalized in order to compete for funding sources. 
Rudrappa acknowledges the vital service provided by Apna Ghar, but critiques this social 
service agency's approaches to assisting women who seek shelter, particularly their directive 
of disciplining the women to disconnect with their ethnic communities or networks, which 
she argues is counterproductive to rebuilding their lives. Rather than raising the political 
consciousness of their clients by providing them with alternative gender roles, they reinforce 
patriarchal ones. 

The second part of the book focuses on the theoretical implications of her case studies 
of immigrant activism, interrogating notions of ethnic authenticity and citizenship. She argues 
that the specter of whiteness shapes how racialized immigrants are selectively incorporated 
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into U.S. society. At the Indo Community Center, their emphasis is on the validation of the 
“good immigrant” model, which is confined to middle-class immigrants who are economically 
integrated into mainstream society. Through tours of their communities, ethnic curriculum 
development for school children, or performances to promote their ethnicity, they display 
what they consider to be authentic representations of ethnic identities and traditions. By 
revealing only what they believe is palatable to U.S. civil society, they hope to increase their 
stature as “good” moral Americans to be included in the U.S. polity, even if it means excluding 
segments of their community or aspects of their cultural practices. In this section, her 
discussion on the historical Americanization process for European immigrants and the rise 
of white ethnicity can be lessened, with more attention to how this model continues to shape 
the lives of contemporary racialized immigrant groups, other than just South Asians. 

As a participant observer, Rudrappa attentively highlights how middle-class staff members 
and volunteers provide services to working-class immigrants, indoctrinating them on how to 
become Americans. Class and ethnic interests become intertwined since the negative 
perception of new working-class or impovished immigrants can be detrimental to more 
economically and culturally established ones. This seems to be a common concern for 
established immigrants, particularly those who have been negatively stereotyped and who 
engage in staging ethnic acts to lessen the level of hostility directed at them. Yet, as Rudrappa 
appropriately points out, the problem arises when cultural indoctrination maintains narrow, 
and often static, visions of ethnic culture, which dismisses the hybridity of cultures in the 
homeland and in the United States. She can elaborate on whether these two case studies 
reflect cultural processes at other immigrant organizations vying for participatory democracy, 
especially ones that cater to groups not restricted to the geographic or cultural boundaries 
of a single nation-state. 

In the early chapters, more historical background and statistical information on the Indian 
American population and community are needed to situate this study within the social and 
political context of Chicago. In these case studies, Rudrappa also integrates self-reflective 
ethnographic practices, allowing the reader to assess her interpretations. This is informative; 
however, readers still have to rely on her representation and analysis of the events. It would 
be useful to understand more about language being spoken in these settings and if there are 
differences on how identity is interpreted in interactions when heritage language versus 
English is utilized. She hints at ethnic and racial tensions in her fieldwork observations, but 
could expand this discussion, to indicate how this might impact the processes of identity and 
community formation. Overall, this book is valuable because it addresses the complex 
debates around multiculturalism and shows how the forces of Americanization still affect 
immigrants in the contemporary period, even activist-based organizations with a decidedly 
ethnic and alternative agenda. 


Reviewer: Linda Trinh V6, University of California, Irvine 





Rebuilding Labor: Organizing and Organizers in the New Union Movement 
Edited by Ruth Milkman and Kim Voss 
Cornell University Press, 2004. 309 pages. $49.95 (cloth); $19.95 (paper) 


Ever since John Sweeney led a palace coup to become leader of the AFL-CIO in 1995, there 
has been renewed scholarly interest in the activities and the fate of the U.S. labor movement. 
Sweeney's "New Voice” slate ran on a platform calling for U.S. unions to dedicate themselves 
to massive organizing of the unorganized. This aggressive call for unions to transform 
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themselves from “business union” servicing bureaucracies into organizing machines 
crusading for workplace and broader social justice kindled hope that unions might be able to 
grow rapidly and once again become central to the struggle for justice for working people. 

A decade later, the reality is sobering. Few unions have wholeheartedly taken up the 
challenge to organize on a massive scale. Union density (percentage of the workforce in 
unions) continued to drop, albeit at a slower pace, in the past 10 years. Today less than 12° 
percent of U.S. workers are in unions, and only about 8 percent of private sector workers are 
union members. Much recent academic literature on organized labor concentrates on union 
efforts to transform themselves internally — to “change to organize” so that they can “organize 
for change.” Rebuilding Labor is an impressive addition to this literature. 

As the subtitle suggests, the volume is concerned with union organizing efforts and union 
organizers. What accounts for the overall failure to organize massively and the occasional 
successes? What happens to union organizers, and can this occupation be a fulfilling one that 
retains people? These are the types of questions addressed in the book. 

A well written overview of the state of union organizing since 1995 by editors Milkman 
and Voss is followed by a comprehensive examination of National Labor Relations Board 
(NLRB) data by Kate Bronfenbrenner and Robert Hickey that confirms Bronfenbrenner’s 
previous research findings that a “comprehensive union building strategy” is the most 
effective way to organize. Indeed, Bronfenbrenner and Hicky argue that union organizing 
behavior is the most important determinant of organizing success, more important even 
than employer anti-union tactics. 

Extremely interesting chapters by Steven Lopez and Robert Penney show that worker 
Opposition to unions is not simply due to employer manipulation. Many workers 
independently hold negative images of unions, and organizing unions must show workers 
that they are different from those stereotypes. Solidarity through the mechanism of a union is 
an achievement to be actively worked for, not something that is automatic once workers 
overcome fear of the boss. 

Other chapters focus on union organizer practices or their experiences or longevity. Teresa 
Sharpe studies organizers of Hotel Employees and Restaurant Employees (HERE) local to 
show the importance of democratic practices, yet the need to push workers to engage in 
activities they are not naturally inclined to undertake. The tension between these two is a 
creative and ongoing one. Marshall Ganz and four colleagues examine the career trajectories 
of a cohort of union leaders and organizers from 1984 to 2001, utilizing their self-understanding 
to determine how their goals changed or did not change as a result of their experiences, and 
which goals make for longevity in the union movement. Daisy Rooks interviews current and 
former organizers who went through the AFL-ClO’s Organizing Institute (Ol) to examine 
burnout, retention and turnover issues and to critique present union practice. Leslie Bunnage 
and Judith Stepan-Norris examine the “Union Summer” program where college students are 
placed in union organizing situations during a three week summer internship period. 

All of these chapters are genuine contributions to our knowledge of current union 
organizing, and a number of them are simply excellent. Perhaps the Organizing Institute 
chapter is least successful, and the two chapters on worker opposition to unions are 
excellent, as is the always-thorough work of Kate Bronfenbrenner. 

A final chapter by John DiNardo and David Lee, while a fine piece of work, is wildly out of 
place in this book. It is an examination of whether unionized establishments are more likely to 
close than are non-union ones. Using econometric analysis, the authors conclude that they are 
not. This is a fine piece of work, but it did not belong in a book on organizing and organizers. 

On final comment: the book has a large number of “typos” as well as citation and 
grammar errors that should have been caught by a copy editor. This is an unusual problem for 
Cornell University Press, which usually is impeccable in its editorial standards. 
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That problem aside, this is a terrific book. Milkman and Voss have gathered together an 
insightful collection of research that is truly original and important. | recommend the book very 
highly to all interested in the subject. 


Reviewer: Bruce Nissen, Florida International University 





Work and Life Integration: Organizational, Cultural, and Individual Perspectives 
Edited by Ellen Ernst Kossek and Susan J. Lambert 
LEA Publishers, 2005. 570 pages. $99.95 (cloth) 


In the 21° century men and women work. The Leave /t to Beaver era has ended. And, the 
world is flattening as global lines of communication and trade expand. Today's worker arrives 
at an office that demands longer hours and higher education, but offers little security. And 
today’s manager faces the challenge of creating an inclusive environment for a diverse 
workforce, while meeting employee demands for flexibility with fewer resources. But while 
work-life conflict may be an accepted reality, work-life integration remains an elusive goal. 

Work and Life Integration: Organizational, Cultural, and Individual Perspectives is an 
impressive volume that has garnered the wisdom of leading scholars and woven them 
together seamlessly creating a whole that is much greater than the sum of its parts. Ellen 
Ernst Kossek and Susan Lambert's book offers business professionals and scholars alike a 
refreshing analysis of what it will take to make true progress in the workplace. The chapters 
have been artfully crafted so that even the most seasoned readers learn a great deal. 

The book is divided in to three main sections, Organizational Perspectives, Individual 
Perspectives and Cultural and Social Perspectives, and includes a provocative conclusion that 
encourages the readers to reconsider long held assumptions and engrained ways of operating. 

Mac Dermid’s chapter, (Re) Considering Conflict between Work and Family establishes a 
baseline with a succinct critique of what is work-family contlict? And what do we know about 
it? She shows that conflict is often used as a catch-all term, and if stratified into types such as 
time-based, strain-based and behavior-based (Greenhaus and Beutell, 1985), employers and 
scholars may be better able to design appropriate interventions. 

Taking an organizational perspective, Milliken and Dunn-Jensen consider “how people 
think about and manage their work and non work time.” (p.44) They acknowledge that time 
pressures have increased and address the implications of new technologies but assert that 
time allocation is ultimately a choice. Valcour and Hunter expand on common assumptions 
about the impact of technology on work-life integration. Their model illustrates the 
relationships between technology and factors such as job type, organizational policies, and 
work distribution (pp. 63-64). 

A common weakness in the employer approach to addressing work-life conflict is the 
underestimation of personal accountability. Friede and Ryan's discussion of the effects of self 
evaluation on work family interaction reminds us that ultimately work family conflict is 
experienced at an individual level. Cleveland adds another dimension with a discussion on 
“What is Success?” Poelmans draws our attention away from the “consequences” of conflict 
and considers the “actions of deliberation, decision making, and learning that proceed and 
follow a work-family dilemma.” (p. 264) This is a particularly important chapter for the business 
reader, for how can an employer design policies for a workforce it may not understand? 

In the section on cultural and social perspectives, Pitt-Catsouphes and Googins assert that 
true change will be limited until work-family is approached both as a private and public 
concern. They encourage collaboration between human resources and corporate social 
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responsibility. Edmondson and Detert review the research on using workplace dialogue as a 
tool for change and apply it to work-life; an important comparison for the business reader 
hoping to create an open dialogue within their workplace. 

And finally Kossek and Lambert leave the readers to consider: What is not a work/life 
issue? They assert the importance of keeping work central to the new business case that 
includes: a focus on workers and their key role in determining profitability; an understanding 
of the implementation problems in current business and social policy, and an assessment of 
the merits and limitations of various interventions. 

This reviewer humbly offers a word of caution to the authors and to the business reader. 
While overall this book will be a valuable addition to any practitioner's or scholar’s library, the 
business reader may find the chapters long and the academic references abundant. In a 
climate where time is of the essence and words are used sparingly, business readers will 
have to be patient in order to digest the concepts presented. 

Work and Life Integration: Organizational, Cultural, and Individual Perspectives is an 
excellent book and a valuable contribution. In a field wrought with endless articles touting 
changing demographics and the needs of the “new” employee, Kossek and Lambert created 
a piece of work that has the potential to actually change the workplace. Scholars will test 
theories and design compelling projects, while professionals will take back fresh opinions and 
refined strategies. This book truly bridges the worlds of academia and the workplace, and as 
such will serve to move both forward. 


Reviewer: Kathy Lynch, Boston College Center for Work & Family 
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The Persistence of Structural Inequality? 
A Network Analysis of International Trade, 1965-2000 


Matthew C. Mahutga, University of California, Irvine 





Abstract 

This article reports results from a network analysis of international trade from 1965 through 
2000. It addresses the impact of changes associated with globalization and the “new 
international division of labor” (NIDL) on structural inequality in the world economy. To 
assess this impact, I ask three specific questions. (1) Do patterns of international trade 
conform to a core/periphery structure through time? (2) Does the structure exhibit inequality 
with regard to industrial sophistication? (3) Have globalization and the NIDL encouraged 
upward mobility for historically poor countries, or have they reproduced a stable set of 
structural positions? The findings support the view that the NIDL and globalization have 
benefited a few exceptional countries while at the same time producing structural inequality. 


Introduction 


How is “globalization” changing the fate of developing countries in the world economy? There is 
considerable agreement that high levels of inequality between nations was caused by an uneven 
spread of capitalism that segmented the international division of labor from the mid-1800s 
through the mid-1900s (Firebaugh 2000; Mittelman 2000; Wallerstein 1974). There is also broad 
agreement that the world economy experienced a “new international division of labor,” a shift in 
manufacturing from developed to developing countries and a concomitant increase in the volume 
of international trade, beginning in the 1960s (Frobel, Heinrichs and Kreye 1980; Dicken 2003). 
There is much less agreement on the prospects for poor countries that have begun to 
industrialize and open their economies to the world market. Some claim these global processes 
equalize the structure of the world economy. As Robert Wade (2004) paraphrases this view: 


... the world economy is [now] an open system in the sense that 
country mobility up the income/wealth hierarchy is [no longer] 
constrained by the structure. The hierarchy is in the process of being 
flattened, the North-South, core-periphery, rich country-poor country 
divide is being eroded away as globalization proceeds (p. 567). 


Others contend instead that globalization reproduces old forms of structural inequality, along 

with producing new ones (Arrighi et al. 2003; Hirst and Thompson 1996; Mittelman 1996). 
In many ways, current debates in sociology over “globalization” and trends in international 

inequality hinge on the interpretation of industrialization in historically poor countries. Those who 
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suggest that poor countries are closing the gap between themselves and developed countries 
implicitly equate industry in advanced industrialized countries with industry that is developing 
in poorer countries. Thus, poor countries should have faster growth as they exploit the returns 
to closing the productivity gap between low and high capital/labor ratios (Barro and Sala--Martin 
1995). On the other hand, scholars from more critical perspectives claim that technological and 
other historical forms of dependency delimit industry in developing countries to less 
sophisticated forms than those in core countries (Smith 1997; Chase-Dunn 1998). According to 
this view, macro-level comparative studies potentially obfuscate our understanding of global 
change by giving insufficient attention to the levels of industrial sophistication undergirding 
industrialization in historically poor countries. For example, many scholars point to the meteoric 
growth of “manufacturing” in developing countries to suggest that historically poor countries 
can close the gap between themselves and the developed world: 


The conception of an interdependent, interacting, global manufacturing 
system cuts across the old view of a world consisting of... First and 
Third, haves and have-nots, rich and poor, industrialized and non- 
industrialized. 

Harris 1986: 200 


Others point to increasing aggregate trade ties for historically poor countries to suggest that 
the world economy is becoming less hierarchically organized (Kim and Shin 2002). Among the 
problems with these measures of global shifts in manufacturing and international trade is that 
they equate everything from garments to semi-conductors. Such a level of aggregation runs 
the risk of misrepresenting the dynamics associated with globalization and the NIDL. 

The central questions that emerge from these debates, then, concern the qualitative 
nature of industrialization occurring in historically poor countries, as well as the effect of 
increasing trade ties for developing countries. Is there a difference between the type of 
industry that dominates advanced industrial countries vs. that in historically poor countries? 
Do expanding trade ties reflect a fundamental change in the underlying structure of the 
international division of labor? 

This article joins the debate on these global shifts by examining competing empirical claims 
about the effects of globalization on outcomes related to the structure of the world-economy 
and the performance of individual countries within that structure. | then adjudicate between 
these claims by employing a network analysis of international trade over five points in time 
between 1965 and 2000. Three main research questions guide this investigation. First, does 
the structure of international trade continually exhibit a hierarchical structure in which some 
countries occupy dominant positions and play dominant roles vis-a-vis others? Second, are 
dominant positions associated with higher levels of industrial sophistication than subordinate 
positions? Finally, do patterns of mobility suggest that historically underdeveloped countries 
have been able to improve their position in the world economy, or has the relative position of 
countries remained stable throughout these global shifts? | turn now to discussing the 
historical context of industrialization in the developing world, emphasizing the role that /evels 
of processing play in creating and maintaining structural inequality. 


Levels of Processing and Structural Inequality 


The world-system and dependency perspectives suggest that the structure of the world 
economy creates international inequality. Generalizing from a long-term macro historical 
comparison, Wallerstein claims that dominant (core) countries rose to ascendancy as a direct 
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result of their exploitation of non-western (peripheral) regions. Through colonization, core 
states enriched themselves and simultaneously retarded the growth of peripheral states by 
forcing them onto political and social paths that made technological advancement and 
competition with the core difficult if not impossible (Wallerstein 1974). More recent 
mechanisms of growth and stagnation have to do with the notion that the subsequent world 
system is hierarchically organized into zones (core, semi-periphery and periphery) that 
perform different functions in the international division of labor. 

The defining characteristic of world-system zones is the extent to which they are involved 
in core or periphery ‘production processes. According to Chase-Dunn, “Core production is 
relatively capital intensive and employs skilled, high wage labor; peripheral production is labor 
intensive and employs cheap, often politically coerced labor.” (Chase-Dunn: 1998, 77) Thus, 
core countries’ production regimes are primarily capital intensive, while peripheral countries’ 
regimes are primarily labor intensive or based on the export of raw materials. Conceptually, the 
core/periphery distinction is one of a continuum. Countries that have a relatively equal mix of 
core and peripheral production processes are labeled semi-peripheral and reside between core 
and periphery countries in the hierarchical structure. It is highly unlikely that countries with little 
to no advanced industry can move up into core production processes because they lack the 
necessary levels of capital, infrastructure, workforce skills and technical expertise to do so. 

Prior to 1960, the international division of labor was relatively simple: core countries 
specialized in manufactured goods that were exported abroad, and peripheral countries 
specialized in raw materials and food stuffs for core markets (Dicken 2003; Mittelman 2000). 
Countries that specialized in manufacturing remained on the cutting edge of technological 
innovation and industrial capacity, which advantaged them in generating wealth through 
product innovation. Countries that specialized in raw material and agricultural extraction were 
disadvantaged in terms of their ability to generate wealth and develop modern industry, and 
were vulnerable to highly fluctuating prices for raw materials on the world market (Galtung 
1971). As Dicken (2003) states, this structural inequality manifests itself in the form of unequal 
terms of trade’: 


“Over time, these inequalities tend to be reinforced through the 
operation of the cumulative processes of economic growth... the 
terms of trade for manufactured and primary products tend to diverge 
[and primary producers have] to export a larger quantity of goods in 
order to buy the same, or even a smaller, quantity of manufactured 
goods.” (p. 572) 


Thus, the picture that emerges prior to 1960 is one in which the international division of labor 
manifests structural inequality. 

This international division of labor also had political consequences for individual nations. 
The periphery’s export recipients are primarily located in the core. Consequently, peripheral 
countries remain highly sensitive to political and economic decisions of core countries and 
tend to maintain trade relations that favor them (Hirschman [1945] 1980). The other side to 
this trade relationship is that peripheral countries are equally dependent on the core for 
imports that are necessary to reproduce their own economy: 


“The rich country can with little effort supply a poor country with 
implements for agriculture or the chase which doubled the 
effectiveness of her labor, and which she could not make for herself; 
while the rich country could without great trouble make for herself 
most of the things which she purchased from the poor nation or at all 
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events could get fairly good substitutes for them.” Marshall, 1923 
cited in Hirschman, 1980: 2. 


Thus, core nations enjoy a structural advantage over peripheral nations by limiting their trading 
alternatives and maintaining trade relations that favor the core. Ultimately, peripheral states 
are unable to increase their standing vis-a-vis core states (Frank 1969; Portes 1976). 

In sum, these perspectives suggest that the social organization of production, by 
structuring access to the material goods necessary for development, is “the major source of 
the inequality of this world.” (Galtung 1971: 89) Hirschman’s theory of the relationship 
between National Power and the Structure of Foreign Trade ([1945] 1980) suggests further that 
the reproduction of the core/periphery hierarchy occurs through structural inequality, which 
is the interaction of both “purely economic competitive advantage and political interference 
by states.” (Chase-Dunn and Rubinson 1979: 455) Ultimately, these operate through 
reciprocal causation as high levels of processing facilitate strong states, and strong states, 
in turn, facilitate higher levels of processing. Empirical research has found that both strong 
and competent states (Evans 1979,1995; Haggard 1990) and high levels of processing 
(Stokes and Jaffee 1982) are associated with economic growth. 


The New International Division of Labor and Free Trade 


Two interrelated processes occurred in the world economy to make this simple dichotomy 
between manufacturing and raw material/agricultural extraction no longer an adequate 
description of the international division of labor. The first of these began in the 1960s and 
1970s. Due to increasing economic volatility, the world economy began to reorganize into a 
NIDL as industrialized countries relocated segments of their industrial sectors to developing 
countries (Frobel, Heinrichs and Kreye 1980). This global shift gave rise to claims that 
historically poor countries could close the developmental gap between themselves and the 
core. Though the above discussion of levels of processing and structural inequality 
juxtaposed manufacturing with raw materials to show that unequal levels of processing 
created inequality, “the general point here is the gap, which would also exist if one nation 
delivers semi-finished products and the other finished products.” (Galtung 1971: 88, 
emphasis added) In other words, potential qualitative distinctions between types of industrial 
processes that characterize national economies remain an important analytical component 
when assessing the developmental impact of globalization and the NIDL. 

The second of these processes is more recent and has to do with expanding neo-liberal 
trade policy that began in the 1980s. The multilateral trade agreements that further 
institutionalized the General Agreement on Tariffs and Trade (GATT) in the World Trade 
Organization? (WTO) sparked debate among economists and scholars of international 
relations. Institutionalizing free trade agreements could have multiplied the benefits of the 
NIDL by lowering the costs associated with international exchange and encouraging further 
industrialization in the periphery. Furthermore, trade liberalization could encourage countries 
to invest in products for which they have a comparative advantage (Krueger 1997). As Kim and 
Shin (2002) summarize, “according to neoclassical economic theory, [increased] trade would 
largely eliminate the handicaps of countries with limited natural resources or those in lower 
developmental stages. It is through trade that developmental opportunities would be more 
widely distributed across the world.” (449) On the other hand, the world-system/dependency 
perspective of free trade claims that lowering restrictions on exports and imports tends to 
favor firms and national economies that already dominate. As Frank argues, “free trade, like 
free enterprise, is protective monopoly under another name.” (Frank 1969: 54) 


The Persistence of Structural Inequality? * 1867 


Network Analysis and International Trade 


The preceding theoretical perspectives suggest several empirical implications for structures of 
international trade, the location of individual countries within that structure, as well as a country's 
level of industrial sophistication. First, the diversity of a country’s production regime should 
increase monotonically with the overall level of processing at which it is capable of producing. 
Second, a country’s position in structures of international trade is therefore highly correlated 
with its level of processing: high variation in the types of commodities a country produces 
means greater access to markets and trading partners. It follows that structures of international 
trade are hierarchically organized based on the marked contrast between the ro/es that core and 
peripheral countries play vis-a-vis each other. The role of core countries involves providing 
capital-intensive goods and technological ingenuities required by all regions of the world 
economy. Furthermore, core countries have the largest markets in the world. Thus, core 
countries will tend to export and import large volumes of commodities to and from many 
countries located throughout the entire world economy. The role of non-core countries involves 
specializing in exports of raw materials and, increasingly, intermediate processed goods to 
higher zones in the hierarchy. Consequently, these countries will tend to have fewer trading 
partners, most of which will be located at higher zones in the hierarchy (Galtung 1971). 

These ideas — that the world economy is hierarchically organized and countries play 
distinct roles in the world economy — make network analysis uniquely suited to the study 
of international trade because of its ability to detect socia/ structure by analyzing the 
patterns of relations between actors.’ Since the late ‘70s, we have learned much from the 
study of international trade using network analysis. Many studies confirmed the world- 
system hypothesis that the world economy is composed of distinct positions: core, 
periphery and semi-periphery (Blanton 1999; Kick and Davis 2001; Nemeth and Smith 
1985: Schott 1986; Smith and White 1992; Snyder and Kick 1979; Su and Clawson 1994; 
Van Rossem 1996). Furthermore, several studies found that zones lower than the core 
have slower economic growth (Kick and Davis 2001; Kick et. al. 2000; Nemeth and Smith 
1985; Snyder and Kick 1979). 

Another line of research demonstrated the notion that zones of the world economy display 
differential levels of processing in terms of the products they produce and export (Nemeth 
and Smith 1985: Smith and White 1992). Smith and White’s (1992) study illustrated an 
unequal division of labor from 1965 to 1980. According to Smith and White (1992), high 
processed goods flowed from core to non-core countries in ratios ranging from 3/1 to 13/1, 
and commodities of low processing levels flowed in the opposite direction, from “peripheral 
and other lower blocks toward consumption in the core.” (882) Recent network analyses 
claim to find evidence that the world economy is becoming more structurally equal (Kim and 
Shin 2002). However, their findings are hard to interpret because of their operationalization 
of structure and neglect of /evels of processing. 

Kim and Shin (2002) conduct a longitudinal network analysis of globalization and suggest 
that the network is becoming more equal. This claim is based on two related empirical 
observations. First, they observe an increase of trade ties by middle-range countries to 
suggest that “at the core of [globalization] has been the deve/opment of countries in the 
middle strata.” (Kim and Shin 2002: 445, emphasis added) Second, they observe that 
“over time the network looks less like a star configuration with one dominant node... , the 
relative dominance of a single central country has decreased, and the world trade structure 
has been decentralized since 1959.” (Kim and Shin 2002: 456) However, these empirical 
observations are difficult to interpret for two reasons. First, they make no attempt to assess 
the alleged benefit of the expanded ties they observe by looking at the type of commodities 
that make up the increased interaction or correlating these expanded ties with development 
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indicators. Second, they do not delineate subgroups in their data. Rather, they measure 
network centralization from the perspective of one individual actor. Therefore, their finding 
that the network is less centered around one dominant node fails to adequately assess 
whether or not the network conforms to a core/periphery hierarchy, and may simply reflect 
greater density.‘ Ultimately, when they purport to assess the explanatory value of world- 
system/dependency literature vis-a-vis classical and neo-classical economics, they fail to 
adequately operationalize these theoretical perspectives. This article resolves these 
problems of interpretation by assessing the extent to which the structure conforms to a 
core/periphery hierarchy, establishing whether or not structural positions remain unequal in 
levels of processing and measuring the temporal mobility of individual countries within the 
structure. 


Data 


The data for this study come from the Commodity Trade Statistics compiled annually by 
the UN (United Nations 1986). The data are classified according to the Standard 
International Trade Classification system (SITC Rev.1). SITC classifies commodities on a 
continuum from one digit (very general) to five digits (very specific).° Because this study 
is designed to follow Smith and White (1992) in drawing distinctions between levels of 
processing, | use the two-digit level. The two-digit level is superior to other levels of 
generality for a number of reasons. (1) One-digit categories are too aggregated to draw 
appropriate distinctions.® (2) Using the two-digit codes over less general categories limits 
the range of possible commodity matrices from thousands to 55. (3) Measurement error 
and misclassification become increasingly problematic as researchers move to the more 
specific SITC categories (Durand 1953). However, it is difficult to assign a level of 
processing to even the two-digit level commodity categories because they, too, are quite 
heterogeneous. This problem is addressed often in the literature (Smith and Nemeth 
1988; Firebaugh and Bullock 1986; Nemeth and Smith 1985; Stokes and Jaffee 1982: 
Steiber 1979; Delacroix 1977). 

The most systematic and replicable way to resolve this problem was carried out by 
Nemeth and Smith (1985; 1988), who found five significant factors of commodities 
spanning a continuous space between extraction based/labor intensive to production 
based/capital intensive. The commodities | use here match Smith and White's (1992), and 
represent the levels of processing developed by Smith and Nemeth (1988):’ 


High Technology/Heavy Manufacture 

71 Machinery, other than Electric 

58 Plastic Materials, Regenerated Cellulose and Artificial 
69 Manufactures of Metal 

Extractive 

64 Paper, Paperboard and Manufactures thereof 


25 Pulp and Waster Paper 
34 Gas, Natural and Manufactured 
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Low Wage/Light Manufacture 


84 Clothing 
85 Footwear 
83 Travel Goods, Handbags and Similar Articles 


Food Products and By-Products 


01 Meat and Meat Preparations 
02 Dairy Products and Bird's Eggs 
29 Crude Animal and Vegetable Material 


Finally, it is important that this study be restricted to countries that appear in every year. It 
would be impossible to interpret my results if each panel had a different set of countries. 
Smith and White had 63 countries at each time point between 1965 and 1980. Since 1980, 
however, new countries have emerged while others failed to report their data after 1980. 
Thus, in order to maintain a network with the same countries at each time point, this project 
analyzes matrices of order 53.° To conclude, the data are pair-wise trade flows between DS 
countries measured in current U.S. dollars in 1965, 1970, 1980, 1990 and 2000. 


Methods 


Comparative Network Analysis 


Network methodology is well suited to the study of international trade because it focuses on 
relationships among social entities and the implications of these relationships (Wasserman 
and Faust 1999). As | describe above, world-system theory explains national development by 
highlighting the relations between nation states and the corresponding roles these relations 
facilitate in the world economy. The method | use involves the network concept of regular 
equivalence. Regular equivalence locates actors who relate to the other actors in a network in 
the same way. Specifically, “the notion of regular equivalence formalizes the observation that 
actors who occupy the same social position relate in the same ways with other actors who 
are themselves in the same positions.” (Wasserman and Faust 1999: 473) More formally, “two 
points in a network are regularly equivalent if and only if for each tie one has with another 
point, the self-equivalent point has an identical tie with an other-equivalent point.” (White and 
Reitz 1985) In other words, for points A and B to be regularly equivalent, they must have the 
same ties to equally equivalent others. 

The first step in this analysis calculates the degree of regular equivalence for each pair of 
countries with the following algorithm. The regular equivalence (M‘*";) between countries / 
and / at iteration t + 1 Is: 


day : 
‘ De max., De (ix Men + ii M tome) 
where the denominator is the maximum possible value of the matches between the profiles of 


ik and jm that would occur if all of the ties between j and its alters (k) were perfectly matched 
to the ties between / and its alters (m), and all of k and m were regularly equivalent. The 
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numerator determines the best matching of the ties between / and m for 7’s ties to k weighted 
by the regular equivalence of k and m from the previous iteration (Wasserman and Faust 1994). 
Thus, the algorithm determines the best possible matching of ties between / and/ weighted by 
the equivalence of their alters, and divides that value by the maximum possible value of the 
numerator. It is important to remember that the equivalence of each pair of actors is revised after 
each iteration (t +1). | have specified three iterations, with the third serving as the measure of 
regular equivalence for each pair of countries.’ It is highly unlikely that any two nations would be 
exactly equivalent, so applying a regular equivalence algorithm to five matrices including the 
sum of all the values of the 12 commodity categories produces an equivalence measure for 
each pair of countries between maximally dissimilar (0) and regularly equivalent (1).'° 

In order to- scale the resulting matrix of regular equivalences and quantify the 
similarity/difference between countries, the second step uses the equivalence matrix as input 
for correspondence analysis. Correspondence analysis is one of a family of techniques that 
draw on a common computational foundation: the Singular Value Decomposition (SVD). 
Included in this family of techniques are Principle Components Analysis, Factor Analysis and 
Optimal Scaling, among others. While the SVD is common to all these approaches, the pre- 
processing that is done to the data differentiates between them. In the context of the present 
analysis, correspondence analysis is simply a singular value decomposition preformed on a 
matrix H in which the cells of the original matrix have been transformed so that the row/ 
column marginals are approximately 1. Conceptually, correspondence analysis represents 
actors in a multi-dimensional Euclidian space by assigning coordinates to actors that place 
them close to those with whom they are similar and far from those with whom they are 
dissimilar (Weller and Romney 1990). Interpreting the results from correspondence analysis 
depends on the amount of variation explained by each Singular Value/dimension and the 
observed spatial pattern of objects in the Euclidian space. Thus, one can have a relatively 
simple structure (few significant dimensions) or a complex one (many significant dimensions). 
After the significant dimensions are delineated in this analysis, | can use these dimensions 
to make general observations about the latent structure contained in the data. 

The third and fourth steps in this research process seek to delineate which world-system 
zones countries occupy and examine the relationship between zones with block models. A 
block model is simply a means to group actors who have similar patterns of relations 
together, and analyze the relationships between relatively equivalent groups (Borgatti and 
Everett 1992; Sailer 1978). | establish zonal location by clustering the matrix of regular 
equivalencies into groups of relatively equivalent countries. | apply two complementary 
hierarchical clustering algorithms to the matrix of regular equivalencies — complete link and 
average link — that are available in UCINET."' The resulting groups are the basis for fitting block 
models in each time point. | use these block models to analyze the patterns of trade between 
world-system zones by separately fitting them to each matrix of the 12 commodity 
categories. The result is 12 separate block models for each of the five time slices that 
represent the four levels of processing. By examining the pattern of commodity exchange 
separately for different levels of processing, | can begin to answer questions about continuity 
in a qualitative gap between the types of industries that dominate in advanced industrial 
countries vs. those that dominate in historically poor countries. 

The final step in this study assesses patterns of mobility in the 35-year period from 1965 
to 2000. After correspondence analysis assigns coordinates to each country, these 
coordinates can be used as a measure of position in the world economy. Upwardly mobility 
can be measured as the change in rank-ordered position between years for each country. In 
order to measure mobility, | first correlate the positions between 1965 and 2000 to suggest an 
overall level of change. | then regress the rank order measure for 2000 on the measure for 
1965 to locate countries that demonstrate significant mobility. 
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Hierarchy from 1965 to 2000 


Table 1 contains the Singular Values and percentage of explained variance for the first through fifth 
dimensions of the correspondence analysis of regular equivalence for the years 1965 to 2000. 


Table 1: Correspondence Analysis of Regular Equivalence by Dimension and Year 





Dimensions 1965 1970 1980 1990 2000 

Dimension 1 

Singular Value .035 .038 025 .032 029 
Explained 

Variance* 97.00 97.00 97.40 97.00 95.20 

Dimension 2 

Singular Value .004 .004 .003 .004 .005 
Explained 

Variance* 1.20 1.10 1.40 1.50 2.80 

Dimension 3 

Singular Value .003 .003 .002 .003 .003 
Explained 

Variance* 10 .60 .60 .90 10 

Dimension 4 

Singular Value .002 .003 .001 .002 .002 
Explained 

Variance* .30 .60 .20 40 50 

Dimension 5 

Singular Value .001 .002 .001 .001 .001 
Explained 

Variance* .08 .30 .20 10 10 


EE 
* Explained Variance for each Dimension is calculated as a percentage: 


2 
Aw 
M 


2, 


100x 





‘m 
m=1 


where M indexes Singular Value 1, 2, 3, ...M 


The first dimension of this analysis explains nearly all the variance in regular equivalence 
between countries. This is apparent by comparing the amount of variation explained by the 
first dimension with that explained by subsequent dimensions. The pattern is such that the 
first dimension accounts for between 95.2 and 97.4 percent of the variance in each time 
period, while each subsequent dimension accounts for less and less of the variance. Tables 
2a and b contain the first dimensional coordinate for each country in each of the periods. The 
country columns list the countries ordered from highest to lowest correspondence analysis 
coordinate in each year, with rich countries such as the United States on the top and relatively 
poorer countries including Togo and Senegal on the bottom. Tentatively, this configuration 
suggests a hierarchical interpretation (Borgatti and Everett 1999; Mahutga 2005)."* 
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Table 3a: Block Membership by Country, Position and Year (N = 27) 


EEE EEE 


Country Blocks 


USA USA 
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Germany 
UK 

France 
Italy 
Sweden 
Japan 
Netherlands 
Canada 
Switzerland 
Denmark 
Belgium 
Argentina 
Australia 
Finland 
Austria 
Norway 
Hong Kong 
New Zealand 
Thailand 
Mexico 
Spain 
Philippines 
Brazil 
Ireland 
Malaysia 


1965 
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1970 


1980 
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1990 
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2000 


DSUINS LIND ISD Ge IND NUG8 IND G8 INDOOR IND C5 ee INI IND IND oe I te ie 


U Hungary 3 Zz 2 3 


Notes: *1 = Core; 2 = Strong Semi-Periphery; 3 = Weak Semi-Periphery; 4 = Strong 


Periphery (1965-1980); 4 = Periphery (1990 and 2000); 5 = Weak Periphery (1965-1980). 


Letters denote countries in subsequent scattergrams. 


Network analysis provides an intuitive notion of a core/periphery structure. Specifically, “the 
intuitive conception entails a dense, cohesive core and a sparse, unconnected periphery.” 
(Borgatti and Everett 1999) A continuous core/periphery model implies that countries are either 
“more or less” core-like. This notion presumes that interaction decreases monotonically for 
countries that are less core-like. The continuous notion of a core/periphery structure is closest 
to the empirical expectations of the world-system perspective that trade patterns reflect an 
underlying continuum in the international division of labor between core (capital intensive) to 
peripheral (labor intensive) production processes. Thus, if the data conforms to a 
core/periphery structure, interaction should decrease as one moves away from the core block 
in a block model. 
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Table 3b: Block Membership by Country, Position and Year (N = 26) 





Country Blocks 
1965 1970 1980 1990 2000 


00 Portugal 3 2 3 2 3 
m Peru 

X Israel 

f Morocco 

R Guatemala 
qq Singapore 

b South Korea 
Q Greece 

kk Pakistan 

vy Turkey 
Nicaragua 

H Chile 

pp Senegal 


aaa Venezuela 
El Salvador 
Honduras 
Costa Rica 
Iceland 
Paraguay 
Columbia 
Tunisia 
Ecuador 
Malta 
Bolivia 
Togo 
Jordan 


aaa EEE ——==E== 
*1 = Core; 2 = Strong Semi-Periphery; 3 = Weak Semi-Periphery; 4 = Strong Periphery (1965- 
1980); 4 = Periphery (1990 and 2000); 5 = Weak Periphery (1965-1980), Letters denote 
countries in subsequent scattergrams. 
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In order to assess whether or not the pattern of relations conforms to the above 
conception of a core/periphery structure, | collapse individual countries into relatively 
equivalent blocks. From 1965 to 1980, there were five major blocks among countries for both 
the average and complete link routines. The years 1990 and 2000 produced four instead of 
five block solutions.’ Tables 3a and 3b detail which block contains each country, by year. In 
the years 1965 through 1980 | have five blocks labeled core, strong semi-periphery, weak 
semi-periphery, strong periphery and weak periphery. In 1990 and 2000, | have only the 
periphery in place of the strong and weak peripheries. These are the most useful and 
interpretable blocks that correspond to the complete and average link hierarchical clustering 
routines." 
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Table 4 shows the block model for the aggregate trade in 1965 and demonstrates the 
continuous core/periphery pattern of interaction for 1965, which holds throughout each year 
analyzed. The heaviest interaction between blocks is that between each block and the core, 
and levels of interaction decrease as you move away from the core. 


Table 4: Block Model of Aggregate Trade for 1965 





Sending Block Receiving Block 
Strong Weak 
Semi- Semi- Strong Weak 
Core Periphery Periphery Periphery Periphery 

Core 585,106 296,705 72,803 42,776 10,174 
Strong Semi- 213,205 34,506 7,656 3,284 1,308 
Periphery 
Weak Semi- 19,241 1511 486 339 44 
Periphery 
Strong Periphery eos 638 220 poe 65 
Weak Periphery 797 89 8 14 0 


*Values are constant U.S. dollars in units of 1,000. 


Table 4, along with tables 1, 2a and 2b yield a clear and robust interpretation concerning the overall 
structure. The latent structure of the data conforms well to the conception of core/periphery from 
network theory. However, it is important to assess the consistency between this measure of the 
core/periphery hierarchy and previous results that included other theoretical considerations. Some 
argue that international trade is not the only kind of relationship that matters in delineating where 
countries reside in the core/periphery hierarchy (Chase-Dunn and Grimes 1995; Kentor 2000; Snyder 
and Kick 1979). These authors suggest that military, diplomatic and legal relations are as important 
to the reproduction of the core/periphery hierarchy as trade. Kentor (2000) produced the most 
systematic and longitudinal attempt to determine the location of countries in the core/periphery 
hierarchy using this logic. Kentor (2000) estimated an index of location in the core/periphery hierarchy 
between 1800 and 1990 by combining measures of a nation’s capital intensiveness, gross domestic 
product (GDP), military power, labor productivity, total exports, internal foreign capital penetration, etc. 
Two of Kentor’s periods overlap with the years in the current analysis: in 1970 there are 34 matching 
countries, and in 1990 there are 49. Both years show significant positive correlations between the 
current network measure and Kentor’s. For 1970, the two measures are only 13.4 percentage points 
(.866) from a perfect one-to-one matching of rank orders. In 1990, when the data are more complete, 
the measures are only 5.7 percentage points (.943) from a perfect one-to-one matching of rank order. 
Thus, this network measure of position in the world economy is robust when considering variables 
associated with other conceptualizations from the world-systems perspective. 

In sum, the first dimension of the analysis explains nearly all the variation of relational 
difference between countries. Second, the patterns of trade in the block models conform to 
the notion of a continuous core/periphery structure. Finally, there is a nearly one-to-one 
correlation between this measure and Kentor’s operationalization of the core/periphery 
hierarchy. Together these findings provide robust evidence that the world economy continues 
to operate within the confines of a hierarchical, core/periphery structure. Therefore, these 
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results suggest that the period from 1965 to 2000 should not be characterized as eradicating 
the hierarchical structure of the world economy. The next section assesses the extent to 
which levels of processing differentiate the core from the periphery in this hierarchy. 


Segmented International Division of Labor 


Industrialization in one world economic zone may not be equal to industrialization in another. 
Drawing distinctions between levels of processing remains important for their tendency to either 
encourage or constrain the whole of a nation’s productive capacity. Consequently, when authors 
point to a rise in “industry” and fail to differentiate between certain kinds of industry, this 
potentially misrepresents the type of industrialization underway in the developing world." If there 
is a systematic pattern in which commodities produced at higher levels of processing originate at 
higher zones of the structure and commodities produced at lower levels of processing originate 
at lower zones of the structure, this would suggest that the international division of labor remains 
bifurcated between countries with advanced industry and countries with less advanced industry. 
Such a bifurcation would suggest that unequal terms of trade still benefit the advanced industrial 
group over the less advanced group (Delacroix 1977; Stokes and Jaffee 1982). 


Table 5: Index of Asymmetrical Levels of Processing as Log of Exports to Imports from Higher 
to Lower Blocks 


Log of Import/Export from 


Commodity Type Lower/Higher Blocks* 
1965 1970 1980 1990 2000 


High Technology/Heavy Manufacture 


71 Machinery, other than Electric S De samen? iy bc - Owe unoe 1 a1 
58 Plastic Materials, Regenerated Cellulose -62 -105 -80 -26 -A41 
and Artificial Resins 
69 Manufactures of Metals pAS ee = OO iGO, a 2 Gea 
Extractive 
64 Paper, Paperboard and manufactures 44 -.23 05 27. 16 
thereof 
25 Pulp and Waste Paper 62 ‘11 34 .60 56 
34 Gas, natural and manufactured SU = 00 04 63 14 
Low Wagellight Manufacturing 
84 Clothing 43 fal 58 50 60 
85 Footwear We .28 .38 50 .29 
83 Travel goods, Handbags, similar Articles 24 nz Fol .28 19 
Food Products 
01 Meat and meat preparations 1.01 1.04 97 103: .30 
02 Dairy products and birds egg 50, 220. 2-05 ©.04 2 514 
29 Crude animal and vegetable materials 48 .26 25 HS M5 


Notes: * Key to conversion from logs to ratios 

Fractional part of log: .9, .8, .7, .6, .5, .4, .3, .2, .1, .0 

Integer Part of Log: 0., 7.9, 6.3, 5.0, 4.0, 3.2, 2.5, 2.0, 1.6, 1.3, 1.0, 
1., 79, 63, 50, 40, 32, 25, 20, 16, 13, 10 
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Table 5 reports a summary index of asymmetrical levels of processing for the years 1965 to 
2000. This index assesses whether or not the core/periphery structure conforms to a 
segmented division of labor by dividing the sum of the cells below the diagonal (flows directed 
up the hierarchy) by the sum of the cells above the diagonal (flows directed down the hierarchy) 
for each of the 60 block models estimated in the previous section. | log the ratio so that it is 0 
at parity, positive when net commodity flows move up the hierarchy and negative when they 
move down. The key to interpreting the logged ratio is on the bottom of the table. 

Overall, the pattern of trade asymmetry in which goods with high levels of processing are 
exchanged between the core and higher zones of the hierarchy for goods with low levels of 
processing from lower zones has not changed. The ratios for “High Technology/Heavy 
Manufacture” indicate that these commodities flow down the hierarchy at a ratio ranging 
from 10/1 to 1.3/1. Commodity categories “Plastic Materials, Regenerated Cellulose and 
Artificial Resins” and “Manufactures of Metals” decrease in the ratio of asymmetry between 
1980 and 1990, but reverse this trend in 2000. The Extractive category demonstrates the 
tendency for commodities at low levels of processing to originate at lower zones in the world 
economy. “Paper, Paperboard, and Manufactures thereof” are variable, while “Pulp and Waste 
Paper” is stable and “Gas, Natural and Manufactured” increases the positive trade balance 
between lower blocks and higher blocks. Likewise, the category “Low Wage/Light 
Manufacturing” reveals a consistent pattern of commodity flows that move up the hierarchy, 
though “Footwear” is highly variable throughout the 35-year period. 

The contrast between the categories “High Technology/Heavy Manufacture” and “Low 
Wage/Light Manufacturing” suggests that industrial expansion in developing nations is highly 
uneven. While industrialization has occurred in non-core countries, not all countries have 
industrialized to the same degree in levels of processing. As Amsden notes, “there are also 
different degrees of convergence between North and South for different industries. The 
South accounts for 43.8 percent of world shoe production, for example, but for only 28.3 
percent of world steel production.” (Amsden 2003: 35) 

It is important to note that the magnitude of flow asymmetry within the class “High 
Technology/Heavy Manufacturing” decreases between 1965 and 2000. Overall, the magnitude 
decreases by 52 percent for this class. However, the categories “Plastic Materials, Regenerated 
Cellulose and Artificial Resins” and “Manufactures of Metals” reverse this trend by 2000. 
Furthermore, the very dramatic decrease of trade asymmetry for “Machinery, other than 
Electric” accounts for 54 percent of the decrease in the entire class. Further analysis shows that 
the strong semi-periphery is responsible for this decrease. The ratio in trade asymmetry for 
“Machinery, other than Electric” between the core and strong semi-periphery moves from a ratio 
of 3.2/1 in 1965 to parity in 2000, while the ratio between the core and both peripheries remains 
stable at 25/1 in 1965 and 20/1 in 2000. This shows that countries that were already relatively 
advantaged vis-a-vis lower peripheral countries benefited from globalization in ways that the 
poorest countries have not.” Finally, the upward flow for Extractive and Low Wage/Light 
Manufacturing categories decreases by only 8 and 13 percent respectively. Such a finding is 
consistent with the possibility that a few stereotypical semi-peripheral countries were able to 
move up the hierarchy by “upgrading” to advanced industries (Gereffi et. al. 2001).'® 


Mobility 


The previous sections found that the world economy is organized into a core/periphery hierarchy 
and the upper zones of the hierarchy tend to specialize in advanced industry. Therefore, patterns 
of mobility are another dimension by which to gauge the alleged change via globalization and the 
NIDL. If countries tend to move from lower to higher zones of the core/periphery hierarchy, then 
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this would suggest that structural inequality is decreasing. Alternatively, if an unequal structure is 
characterized by little long-term upward mobility, it reflects an underlying stability in the 
distribution of industrial technology and structural inequality. 

Tables 6 displays upward and downward mobility measured as the change in rank order 
position from 1965 to 2000, as well as the positions of each country for the same years. 
Considering the whole system, there is a high level of continuity. Again, the correlation of .856 
indicates that the structural position of countries was highly stable between 1965 and 2000. 
Furthermore, there is a tendency for higher stability at the top of the structure. This is obvious 
when looking at the top five positions of the hierarchy. The United States and Germany 
remain the top two economies from 1965 to 2000, while the United Kingdom, France and Italy 
only drop one spot to accommodate the upward mobility of Japan. There is no stronger 
pattern of stability among any five consecutive positions than that displayed by the top five. 
The second most stable set of positions is the bottom five. Of these bottom five positions, 
Ecuador displays the most upward mobility at four positions. The other four positions would 
have remained in the bottom five if it were not for the moderate downward mobility of Iceland 
and the extreme downward mobility of Senegal. Overall, the high level of stability that is 
especially pronounced at the upper and lower ends of the continuum suggests further that 
the equalizing effect of globalization and the NIDL may be overstated. 

It is important to assess temporal variability in the overall amount of structural stability 
detailed above. Importantly, the expansion of neo-liberal trade policy since the 1980s could 
account for much of the change between 1965 and 2000. If this were the case, the 
curtailment of state level protectionist policies could enhance the NIDL, which could suggest 
that significant change began more recently. In other words, the structure of the world 
economy may be on a trajectory of change that will produce greater amounts of mobility in 
the future than occurred in the years leading up to 2000. 


Figure 1: Temporal Variability in Structural Change 
ic 


Percent Change 


2 
1965-1970 1970-1980 1980-1990 1990-2000 


Time 
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To assess this possibility, | examine the 10-year increments in terms of their share of the overall 
14.4 percent change. Figure 1 shows the amount of the 14.4 percent change accounted for by 
each time period. Rather than increasing the amount of mobility in the system, increasing trade 
liberalization correlates with decreasing mobility. Of the overall 14.4 percent change, the highest 
period of change (6.1 percent) is between 1970 and 1980, with a substantial return to stability 
during the two subsequent 10-year periods (3.1 and 2.9 percent respectively). Therefore, the 
evidence does not support the contention that “opening up to trade and exporting should 
accelerate [mobility].” (Kim and Shin 2001: 449) 

Within the high level of stability that characterizes the whole world system, it is 
important to identify countries that experienced significant mobility. As Table 6 
demonstrates, it is difficult to differentiate between significant mobility and that which 
would occur given the upward/downward mobility of a few countries.” In order to identify 
countries with significant levels of mobility, | regress the 2000 measure of position on the 
1965 measure. Because the constant is insignificant,’ | exclude it and include a 90 percent 
confidence interval for the individual observations.” Countries above the upper interval 
demonstrate significant upward mobility, and countries below the lower interval 
demonstrate significant downward mobility. Figure 2 shows the scatter plot, along with the 
confidence interval for the years 1965 and 2000. 


Figure 2: Scattergram Showing Significant Mobility between 1965 and 2000 
60 


50 
40 


30 


1965 


20 





2000 


Notes: See tables 3a and 3b to match labels with countries 


Only South Korea (b), Singapore (qq) and Turkey (yy) demonstrate significant upward mobility. 
The exceptional performance of these countries should come as no surprise. South Korea and 
Singapore are among the manufacturing miracles of East Asia, and are often studied as exemplars 
of third world development (Gereffi and Wyman 1990; Haggard 1990). Though Turkey began to 
develop later than some of the prototypical East Asian “NICs,” its upward mobility is directly related 
to its shift from lower to higher levels of processing (Amsden 2001: 139 and 159). The 
performance of these three countries suggests that development is a viable goal for some poor 
countries, and claims that development is impossible are incorrect. At the same time, however, the 
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small number of upwardly mobile countries suggests that mobility is the exception rather than 
the rule. Upward mobility within the core/periphery hierarchy is exceedingly difficult and rare. 
Future research would benefit by comparing these three countries to those demonstrating less 
success in terms of their efforts to engage in “industrial upgrading.” (Gereffi et al. 2001) 


Conclusion 


This study produced several important findings. The first dimension of my analysis of the 
variance of regular equivalence between countries provides a robust structural measure of 
the world economy. This measure captures the continuous hierarchical nature of the world 
economy and is highly correlated with the core/periphery concept from the world-system 
perspective. Second, the analysis of commodity exchange suggests that unequal levels of 
processing continue to create structural inequality through the reproduction of a segmented 
international division of labor. Third, there is a high level of structural stability between 1965 
and 2000, with the more recent decades demonstrating the most stability. Within this high 
level of stability, three countries demonstrated significant upward mobility. 

Together, these findings yield some important generalizations. First, the hierarchical nature of 
the world system remained stable from 1965 to 2000, both in terms of core/periphery patterns 
of interaction and production processes. The most important change within this general 
tendency was the rise of labor-intensive manufacturing in non-core zones of the core/periphery 
hierarchy. Thus, the old world of a manufactured goods/raw materials dichotomy co-exists with 
a low value added/high value added dichotomy. Second, upward mobility in the world economy 
is rare through a period that many claim represents a massive restructuring.” However, the 
exceptional performance of South Korea, Singapore and Turkey show upward mobility is 
possible. Third, the expansion of neo-liberal trade regulations is associated with less upward 
mobility than the period before this policy expansion. In general, the processes associated with 
globalization and the NIDL have not reversed structural inequality. 

The global restructuring that began in the 1960s had the potential to undercut the 
historical dominance of core countries. By providing more sophisticated production 
technology amenable to spin-offs and the development of domestic capital, the NIDL could 
have spelled the end to centuries of world inequality. However, the changes brought about by 
these global shifts instead produced “winners” and “losers.” The winners are the core 
countries that maintained their dominant positions throughout these global shifts. Another 
group of winners are the few semi-peripheral countries that were able to move up the value- 
added hierarchy and bring their standard of living with them (Amsden 2001; Haggard 1990). 
However, there is a rather large group of losers from this process. As Peter Dicken notes, 


“the already affluent developed countries have sustained — even 
increased — their affluence, some developing countries have made 
very significant progress, but there is a hard core of exceptionally poor 
countries that remains stranded... ” (Dicken 2003: 514). 


Thus, future macro-level research on the consequences of globalization can benefit from attempts 
to generalize as to the contingent impediments/conduits to technological upgrading and upward 
mobility in spite of generalized structural inequality. Finally, while this study was limited to assessing 
whether or not there were significant alterations to the structure of economic relations between 
individual nations in the whole world economy, it may also be interesting to explore variations in 
regional performance, as well as competition between hegemonic core states (i.e., the United 
States) and potential rivals, including the European Union and the East Asian region headed by 
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Japan, China and South Korea. Asking these kinds of questions could further clarify our 
understanding of the causes of inequality (structural or otherwise) in the world economy today. 


Notes 


i 


The dependency and world-systems perspectives often speak of “unequal exchange” rather 
than unequal terms of trade. However, the exact meaning of “unequal exchange” is itself 
open to debate. O'Hearn (2001) differentiates between an “exploitative” and “oppressive” 
understanding. The exploitative form occurs through the appropriation of surplus labor 
across zones of the world economy when low wage earners in poor countries exchange 
their wages for high value goods on the world market. The oppressive understanding occurs 
when poorer regions are excluded from the most innovative and profitable activities, which 
is closer to the meaning described here. O'Hearn suggests that the predominance of one 
form over the other occurs cyclically, and thus they are intimately connected. | would 
suggest that because low levels of processing are associated with low wages, these are 
perhaps even more intimately connected than the literature suggests. 


The Uruguay rounds began in 1986 and culminated in 1993. The institutionalization of the 
WTO was finalized in 1994. Furthermore, the US-Canada free trade agreement was signed 
in 1988 and the North American Free Trade Agreement (NAFTA) was signed in 1993. Many 
other regional free trade agreements came into being throughout the 1980s and 1990s. 


Indeed, there has been an almost universal recognition in studies of global political 
economy that the best ways to conceive changes in the world economy use the concept 
of networks, whether they be global city networks (Sassen 2002; Taylor 2001); commodity 
chains linking nodes within production networks (Gereffi and Korzeniewicz 1994); or entire 
orienting perspectives like the “Network Society.” (Castells 1996) Recent empirical 
research has also shown that the structure of world-cities and that of nation-states are 
strongly associated (Alderson and Beckfield 2004), suggesting that the use of nation 
states as nodes in network analysis remains a relevant methodological strategy. 


Network Centralization scales linearly with mean degree, such that graphs with high mean 
degree cannot have high centralization. This is because degree centrality is equal to: 


5 (A-d) 


i=l 


1) Ci(G) = Ty yN=2) 


where A is the maximum actor degree centrality. One can re-express the numerator in 
terms of mean degree with 


N, = 
2) > (A-d) = N(A-d), 
i=l 
where d is the mean degree. Substituting equation 2 for the numerator of equation 1 gives 


N(A-d) 


3) C,(G) = (N=1)(N=2): 


Clearly, degree centralization falls linearly with mean degree. (See Butts 2005 for a complete 
derivation and discussion of the determinants of exact bounds for centrality measures.) 
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Obviously, rising mean degree (or density) reflects rising connectivity, which in many ways 
is the definition of globalization. In other words, because decreasing network centrality 
values only reflect greater connectivity (globalization), this finding only verifies that 
globalization has indeed occurred and leaves open the question as to the benefit of 
globalization for developing countries. 


These data were purchased from the United Nations Statistical Office in electronic form, 
and therefore represent the most up-to-date information for both new countries, as well 
as upgraded information for older nations. 


One-digit categories such as “Manufactured Goods Classified Chiefly by Material” include 
commodity types as diverse as “Textile Yarn, Fabrics, Made-up Articles” and “Iron and Steel.” 


These commodity categories consistently loaded high on the first factor of Smith and 
Nemeth’s (1988) factor analysis, and remained stable through the 15 years they analyzed 
(1965-1980). Using these 12 instead of the 39 possible (that consistently loaded high on 
the first factor) reduces the number of trade matrices to analyze from 195 (39 categories 
X five years) to 60 (12 categories X five years). My own unreported analysis corroborates 
their findings for these 12 categories. See Amsden 2001; and Gereffi and Wyman 1990; 
for discussions of the importance of these kinds of industries. 


Two anonymous reviewers were concerned over the absence of China from this analysis. 
The reason for China's exclusion is straightforward: it only appears in 1990 and 2000. This 
is because China did not report this information to the United Nations before engaging in 
economic reforms in the 1980s. 


Faust (1988) finds that three iterations are sufficient in most cases. 


. |used the program, UCINET (Borgatti, Everett and Freeman 2002). | used the base 10 log 


of the dollar amounts to re-scale the variance in commodity exchange between countries. 
This maintains the relative differences between countries while aiding the algorithm in 
convergence. 


Hierarchical clustering starts by putting each actor in an NXN matrix into its own cluster so 
that the similarity between clusters equals the similarity between each actor. The 
procedure then finds the most similar pair of actors and merges them into one cluster. The 
third step computes similarities between the new cluster and each of the other actors. 
The second and third steps are carried out until all actors have been merged into a single 
cluster of size N (Borgatti 1994). 


Graphic displays of the relative position of countries delineated with correspondence 
analysis are available from the author upon request. 


In general, complete and average link clustering agree on the partitioning of actors at each 
level of regular equivalence. Correlations are well above .900 in every case. Therefore, the 
block models reported here are the results of only the average link algorithm. Substantive 
conclusions were the same in both cases. 


The years 1990 and 2000 are different in that most of the successive block solutions after 
a four-block solution would split individual peripheral countries into their own blocks. 


15 


16. 


ie 


19: 


20. 


21. 


oe; 


23; 
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Therefore, in the year 2000 | collapsed several individual country blocks into the peripheral 
block. This decision does not bias the analysis, and in fact provides for rather conservative 
estimates of trade asymmetries because it creates one periphery rather than breaking it 
down into tiny one- and two-member blocks. 


.While choosing the appropriate number of blocks can sometimes seem arbitrary, 


Wasserman and Faust (1999) note that “the ‘trick’ is to choose the point along the series 
that gives a useful and interpretable partition of actors into equivalence classes [blocks].” 
(383) The reader should also note the likelihood that the following analysis will yield lower 
ratios of trade asymmetry because the clustering routines are breaking apart countries 
that would likely be kept together if more variability was introduced by the numerous poor 
countries precluded because of missing data. 


For example, Amsden (2003) provides a table (Table 3) that lists the industry shares in 
terms of the gains and losses by developing countries, but does not draw attention to the 
fact that the industries for which the developing world gains 10 percent or more are much 
more likely to be of the labor intensive or extractive type than are those industries for 
which the developing world gains less than 10 percent. 


Since semi-peripheral countries have, by definition, a much more equal mix of “core” and 
“periphery” production processes than do peripheral countries, some or all of them are 
much more likely to have the necessary basic infrastructure to develop advanced industry 
(Chase-Dunn 1998). 


_|f the spread of advanced industry was even throughout the network, then all categories 


would experience a similar decrease in magnitude. However, if only a few countries 
developed, then that would explain the divergent decreases: most historically poor 
countries continue to produce and export less advanced industry while a few of them 
“upgraded” to advanced industrial processes. 


West Germany in 1965, 1970 and 1980. 


Ordinal rank orders are sensitive to mobility because one upwardly mobile country would 
cause every country passed in rank order to lose one position. 


This would be expected given the nature of rank order measures discussed above. 


The 95 percent confidence interval decreases the number of upwardly mobile countries to 
two (South Korea and Singapore), and the 99 percent interval delineates one upwardly 
mobile country (South Korea). 


Two anonymous reviewers pointed out correctly that the inclusion of China could change 
the substantive interpretations. Data on China were only available for the two periods, 
1990 and 2000. Taking the reviewer's comments into consideration, | reanalyzed these 
periods and included China. Indeed, China did display upward mobility, but it was not 
significant. Furthermore, China appears in the strong semi-periphery in 1990, and remains 
there in 2000. While this lends tentative support to my findings, future studies might 
narrow the temporal range to periods in which data is available to examine the 
performance of China, especially in comparison to countries that perform well. See note 
nine for a discussion of China’s absence. 


1886 * Social Forces Volume 84, Number 4 * June 2006 


References 


Alderson, Arthur, and Jason Beckfield. 2004. “Power and Position in the World City System.” American Journal 
of Sociology 109(4):811-51. 


Amsden, Alice. 2003. “Comment: Good-bye Dependency Theory, hello Dependency Theory.” Stuales in 
Comparative International Development 38(1):32-38. 


. 2001 The Rise of “The Rest:” Challenges to the West from Late-Industrializing Economies. Oxford 


University Press. 


Arrighi, Giovanni, Beverly Silver and Benjamin D. Brewer. 2003. “Industrial Convergence, Globalization, and the 
Persistence of the North-South Divide.” Studies in Comparative International Development 38(1):3-31. 


Barro, Robert J., and Xavier Sala-I-Martin. 1995. Economic Growth. McGraw Hill 


Blanton, Robert G. 1999. “Structure and Behavior in the World Trading System.” /nternational Interactions 
25(2):119-44. 


Borgatti, Stephen. 1994. “How to Explain Hierarchical Clustering.” Connections 17(2):81-84. 


Borgatti, Stephen, and Martin Everett. 1999. “Models of Core/Periphery Structures.” Socia/ Networks 
21:375-95. 


. 1992. “Regular Block Models of Multiway, Multimode Matrices.” Socia/ Networks 14:91-120. 


Borgatti, Stephen P, Martin G. Everett and Linton C. Freeman. 2002. UC/NET for Windows: Software for Social 
Network Analysis. Harvard: Analytic Technologies. 


Butts, Carter. 2006 “Exact Bounds for Degree Centralization.” Social Networks (Forthcoming). 


Castells, Manuel. 1996. 7he /nformation Age: Economy, Society and Culture, voll. The Rise of Network Society. 
Blackwell Publishers. 


Chase-Dunn, Christopher. 1998. G/oba/ Formation: Structures of the World Economy. Rowman & Littlefield. 


Chase-Dunn, Christopher, and Peter Grimes. 1995. “World Systems-Analysis.” Annual Review of Sociology 
21:387-417. 


Chase-Dunn, Christopher, and Richard Rubinson. 1979. “Toward a Structural Theory of the World-System.” 
Politics and Society 7(4):453-76. 


Delacroix, Jacques. 1977. “The Export of Raw Materials and Economic Growth: A Cross-National Study.” 
American Sociological Review 42:795-808. 


Dicken, Peter. 2008. Global Shift. Guilford Press. 


Durand, Edward D. 1953. “Country Classification.” /nternational Trade Statistics. Pp. 117-29. Roy George Douglas 
Allen and J.E. Ely, editors. John Wiley. ~ 


Edwards, Sebastian. 1997. “Openness, Productivity and Growth: What Do We Really Know?" Nationa/ Bureau 
of Economic Research Working Paper no. 5978 http://ideas.repec.org/p/nbr/nberwo/5978.html. 


Evans, Peter. 1979. Dependent Development. Princeton University Press. 
. 1995. Embedded Autonomy: States and Industrial Transformation. Princeton University Press 


Faust, Katherine. 1988. “Comparison of methods for positional analysis: Structural and general equivalences.” 
Social Networks 10:313-41. 


Firebaugh, Glenn. 2000. “The Trend in Between-Nation Income Inequality.” Annual Review of Sociology 
26:323-39. 


The Persistence of Structural Inequality? * 1887 
Firebaugh, Glenn, and Brad Bullock. 1986. “Levels of Processing of Exports: New Estimates for 73 Less- 
Developed Countries in 1970 and 1980. /nternational Studies Quarterly 30:33-50. 


Frank, Andre Gunder.1969. “The Sociology of Development and the Underdevelopment of Sociology.” 
Catalyst 3:20-73. 


Frobel, Folker, Jurgen Heinrichs and Otto Kreye. 1980. The New /nternational Division of Labor. Cambridge 
University Press. 


Galtung, Johan. 1971. “A Structural Theory of Imperialism.” Journal of Peace Research 2:81-117. 


Gereffi, Gary, John Humphrey, Raphael Kaplinsky and Timothy J. Sturgeon. 2001. “Introduction: 
Globalisation, Value Chains and Development.” /DS Bulletin 32(3):1-8. 


Gereffi, Gary, and Miguel Korzeniewicz, editors. 1994. Commodity Chains and Global Capitalism. Praeger Press. 


Gereffi, Gary, and Donald L. Wyman, editors. 1990. Manufacturing Miracles: Paths of Industrialization in Latin 
America and East Asia. Princeton University Press. 


Harris, Nigel. 1986. The End of the Third World: Newly Industrializing Countries and the Decline of an 
/deology. Penguin Books. 


Hirschman, Albert O. [1945] 1980. National Power and the Structure of Foreign Trade. University of California 
Press. 


Hirst, Paul, and Grahame Thompson. 1996. G/obalization in Question: The International Economy and the 
Possibilities of Governance. Blackwell. 


Haggard, Stephen. 1990. Pathways from the Periphery: The Politics of Growth in the Newly Industralizing 
Countries. Cornell University Press. 


Kentor, Jeffrey. 2000. Capital and Coercion: The Economic and Military Processes That Have Shaped the 
World Economy 1800-1990. Garland Publishing, Inc. 


Kick, Edward L., and Byron L. Davis. 2001. “World-System Structure and Change: An Analysis of Global 
Networks and Economic Growth Across Two Time Periods.” American Behavioral Scientist 
44(10):1567-78. 

Kick, Edward L., Byron L. Davis, M. Lehtinen and Thomas J. Burnes. 2000. “World-System Position, National 
Political Characteristics and Economic Development Outcomes.” Journal of Political and Military 
Sociology 28(1):131-55. 


Kim, Sangmoon, and Eui-Hang Shin. 2002. “A Longitudinal Analysis of Globalization and Regionalization in 
International Trade: A Social Network Approach.” Socia/ Forces 81(2):445-71. 


Krueger, Anne O. 1997. “Trade Policy and Economic Development: How We Learn.” American Economic 
Review 87:1-22. 


Mahutga, Matthew C. 2005. “Measuring Global and Actor Level Change in Core/Periphery Structures with 
Correspondence Analysis: An Application to International Trade.” Paper presented at the 25" annual 
Sunbelt Social Network Conference, Redondo Beach California. Feb. 19. 

Marshall, Alfred. 1923. Money, Credit and Commerce. London. 

Mittelman, James H. 2000 7he Globalization Syndrome. Princeton University Press. 


1996 Globalization: Critical Reflection. Lynne Rienner. 


Nemeth, Roger J., and David A. Smith. 1985. “International Trade and World-System Structure: A Multiple 
Network Analysis.” Review 8(4):517-60. 


1888 © Social Forces Volume 84, Number 4 * June 2006 


O'Hearn, Denis. 2001. The Atlantic Economy: Britain, the U.S. and Ireland. Manchester University Press. 
Portes, Alejandro. 1976. “On the Sociology of National Development." American Journal of Sociology 82:3-38. 


Ross, Robert J.S., and Kent C. Trachte. 1990. Global Capitalism The New Leviathan. State University of New 
York Press. 


Sailer, Lee Douglas. 1978. “Structural Equivalence: Meaning and Definition, Computation and Application.” 
Social Networks 1:73-90. 


Salisbury, Joseph G.T., and George A Barnett. 1999. “The World System of International Monetary Flows: A 
Network Analysis.” /nformation Society 15(1):31-49. 


Sassen, Saskia, editor. 2002. Global Networks, Linked Cities. Rutledge. 


Schott, Thomas. 1986. “Models of Dyadic and Individual Components of a Social Relation: Applications to 
International Trade.” Journal of Mathematical Sociology 12:225-49. 


Smith, David A. 1997. “Technology, Commodity Chains and Global Inequality: South Korea in the 1990s. 
Review of International Political Economy 4(4):734-62. 


Smith, David A., and Douglas R. White. 1992. “Structure and Dynamics of the Global Economy: Network 
Analysis of International Trade 1965-1980. Social Forces 70(4):857-93. 


Smith, David A., and Roger J. Nemeth. 1988. “Research Note: An Empirical Analysis of Commodity 
Exchange in the International Economy: 1965-1980.” /nternational Studies Quarterly 32:227-A0. 


Snyder, David, and Edward L. Kick. 1979. “Structural Position in the World System and Economic Growth, 
1955-1970: A Multiple-Network Analysis of Transnational Interactions.” American Journal of 
Sociology 84(5): 1096-126. 


Steiber, Steven. 1979. “The World System and World Trade: An Empirical Explanation of Conceptual 
Conflicts.” The Sociological Quarterly 20:23-36. 


Stokes, Randall, and David Jaffee. 1982. “The Export of Raw Materials and Export Growth.” American 
Sociological Review 8(4):517-60. 


Su, Tie Ting, and Dan Clawson. 1994. “Trade Networks, Trade Blocks, and Hegemonic Conflict.” Socio/ogical 
Inquiry 64(4):415-37. 


Taylor, Peter. 2001. “Recasting World-Systems Analysis: City Networks for Nation-States.” Globalization and 
World City Study Group and Network Research Bulletin No. 44. 


United Nations. 1986. Commodity Indexes for Standard International Trade Classification, Revision 3. 


Van Rossem, Ronan. 1996. “The World System Paradigm as General Theory of Development: A Cross- 
National Test.” American Sociological Review 61(3):508-27. 


Wade, Robert. 2004. “Is Globalization Reducing Poverty and Inequality?” World Development 32(4):567-89. 


Wallerstein, Immanuel. 1974. The Modern World System |: Capitalist Agriculture and the Origins of the 
European World-Economy in the Sixteenth Century. Academic Press. 


Wasserman, Stanley, and Katherine Faust. 1999. Social Network Analysis: Methods and Applications. 
Cambridge University Press. 


Weller, Susan, and Kimball Romney. 1990. “Metric Scaling: Correspondence Analysis.” A Sage University 
Paper Series: Quantitative Applications in the Social Sciences. Sage. 


White, Douglas R., and Karl P Reitz. 1985. “Measuring Role Distance: Structural, Regular and Relational 
Equivalence.” Unpublished manuscript. University of California at Irvine. 


The Persistence of Structural Inequality? + 1889 


Appendix: Inter-year Correlations of Rank Order Position and Correlations with Kentor (2000) 
for Available Years 


ES 


Rank Order Positions 

1965 1970 1980 1990 2000 
Rank Order 
Positions 
1965" 1.00" _ — = — 
1970 .969*** 1.00*** — — _ 
1980 .910*** oz, 1O0re. — — 
1990 .882*** oe .962*** 100s: _ 
2000 .856*** .889*** .928*** OT ila TOUS 
Kentor (2000) 
1970 — .866*** — _— — 
1990 = — — .943*** = 


wD < .001, two-tailed test 
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Transnationalism: Diaspora~-Homeland Development 


Rubin Patterson, University of Toledo 





Abstract 

Based on detected correlations between the strategic collaboration of U.S.-based diasporas 
and their respective ancestral homelands on the one hand and the socioeconomic and 
technological development of those homelands on the other, this paper, which provides a 
conceptual foundation of the correlation, attempts to ignite a new area of research on 
transnationalism and development in the global South. The conceptual foundation 
suggesting such an important correlation is ensconced in the theoretical contexts of world 
systems and racial formation theories. The hierarchically ranked status of a nation in some 
ways reflects the hierarchically ranked status of its diaspora in the United States. Strategic 
collaboration and brain circulation between the diaspora and the homeland can favorably 
affect the status of transnational communities, both within the United States and within 
the wider global system. 


Introduction 


Transnationalism' is an emergent field of study with a focus on citizens who, though migrating 
from poor to rich countries, manage to construct and nurture social fields that intimately link 
their respective homelands and their new diasporic locations. When citizens of the global 
South, with advanced human capital, migrate to a rich Western society such as the United 
States and they maintain strategic dialectical interplay between the old and new locations, 
brain circulation occurs. Conversely, when such talented citizens migrate to a rich society and 
are permanently uprooted from the homeland, without future involvement in its affairs, the 
latter suffers a brain drain. Those Southern nations and regions that support the emigration of 
large numbers of their nationals to the United States — and those diasporans who engage in 
the building of institutions, conducting transactions and generally influencing local and 
national events in their respective homelands — have a huge comparative advantage over 
those Southern nations and regions that fail to send their nationals to the United States or 
those diasporans who fail to nurture transnational social fields with the homeland. 

Transnationalism and brain circulation occur with several Asian nations, increasingly with 
some Latin nations, and not much at all with Sub-Saharan African nations. In addition to 
developing a conceptual foundation of strategically-oriented transnationalism and suggesting 
that it can accelerate and extend socioeconomic and technological development in the South, 
this paper explores reasons that partially account for the relative lack of such strategic 
transnationalism in Sub-Saharan Africa. Potential advantages stemming from African emigration 
to developed countries are so great that African governments should aid and abet such 
emigration. This proposition should not be considered against an ideal world wherein all African 
nationals can find productive work in their country of birth as they labor collectively with fellow 
citizens to build a prosperous society.’ Rather, the proposition should be considered against the 
real world in which African citizens are inhibited from freely contributing to national and 
professional development due to economic and political constraints. 
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Comparative analyses presented in this paper suggest that transnationalism is an efficient 
means of transferring knowledge, skills and wealth from core nations to those in the 
semiperiphery and the periphery. Various types of evidence exist to support the chief claim to 
knowledge presented in this paper. On an abstract level, a combination of diffusion and 
indigenously-generated expansion of technical knowledge and skills has characterized the 
history of advancement in human societies. Jared Diamond, in Guns, Germs, and Steel: A 
Short History of Everybody for the Last 1,300 Years, notes that agriculture, wheels, writing, 
metallurgy and other earlier watershed technological developments rarely occurred de novo. 
One way or another the technologies are diffused or transferred to other non-inventing 
societies. Knowledge and skills have always been imported from the outside, sometimes 
grudgingly and other times aggressively, sometimes with broadly favorable outcomes and at 
other times far less so. While the movement certainly is not linear, overall advancements in 
complexity and efficacy of technical knowledge and skills have been manifest. In addition to 
this sublime worldwide panoramic view of how knowledge and skills have progressed along 
with their impact on social groups across millennia, this paper reflects on more concrete 
historical records of specific countries over the past few decades. Many countries, from 
China to South Korea to India and others, have demonstrated that development of the 
homeland can be accelerated by making strategic use of their diasporas in North America and 
Western Europe. Where simply the United States is used here as a surrogate, both societies 
are major engines of technical knowledge and skills of the 20" and early 21* centuries. Sheffer 
(1996) notes that members of a prosperous diasporic community “can contribute more 
financial resources and that their access to powerful segments in the host country becomes 
easier... and the talent, acumen, and dedication that are needed for [homeland development] 
become more available.” (p. 61) 

Though this paper concludes with the qualified validation of brain circulation as part of a 
viable development strategy for Third World nations, it is not Pollyannaish about assured 
immediate disruptions or unassured automatic benefits for emigrant-sending countries. Such 
views are checked by observations such as the one by Randall Tobias, the current global AIDS 
coordinator in the Bush administration: “[T]here were more Ethiopian-trained doctors 
practicing in Chicago [alone] than in [all of] Ethiopia.” (New York Times 2004:A22) The 
immediate disruptions to Ethiopia's national development as a consequence of such 
emigration of technical talent are palpable. This paper argues that, as a discrete isolate, such 
emigration would be tantamount to brain draining. However, if it were part of a national, if not 
regional, strategic campaign in Africa for a “diaspora-homeland collaborative development” 
agenda of brain circulation, as documented with other transnational groups in this paper, the 
situation would likely be less unnerving. 

There are several ways to make this case for African transnationalism. One is the 
avoidance-of-the-negative approach while another is the pursuit-of-the-positive approach. The 
latter approach will be used in this paper. The former argues that emigration is inevitable in 
light of difficult and stultifying living conditions found in many African countries. Apraku (1991) 
makes this case by asking a set of poignant and obvious questions: “[D]oes Africa necessarily 
benefit from keeping its skilled manpower home, whether or not they engage in productive 
economic activity? Are the corrupt, unmotivated, and frustrated, yet highly trained segments 
of the African intelligentsia who are living off the government (mainly in the public sector) 
more beneficial to Africa than their emigrant colleagues who chose to migrate overseas for 
additional training and experience, who support their families and friends with remittances, 
and have become exposed to new technology, ideas, and perspectives?” (pp.57-58) A more 
instructive approach is the pursuit-of-the positive approach, which involves examining ways 
that countries experience socioeconomic development through diaspora-homeland 
collaboration. 
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Africa, of course, is a continent, not a country as in the other cases explored here to 
explain this study's proposition. Africa has 54 states and Sub-Saharan Africa has 48. This 
study focuses on the Sub-Saharan region since, by far, most African descendants in America 
regard that region as their ancestral homeland. Nevertheless, comparing Sub-Saharan Africa 
to the singular states of China, South Korea, India, Mexico and Israel is inconsistent on the 
surface. However, upon deeper inspection of the objectives and techniques of the study, the 
validity and utility of making such a comparison become evident. In evaluating the role 
diasporas have played in the development of China, South Korea, India, Mexico and Israel, 
we are analyzing the activities of a diasporic community on behalf of a given individual 
homeland state. Because of the unique make-up of the Sub-Saharan African diaspora in the 
United States, the vast majority of it is unable to point to a single state as its homeland. With 
most African Americans unable to identify a specific homeland state, their proclivity is to 
“adopt” all of the continent, or at least the Sub-Saharan area, as their homeland. In this sense, 
the African diaspora-homeland project is a sociological construct, unlike in the singular case 
state situations. Another analytical peculiarity with the African case is that each of the 48 Sub- 
Saharan states pursues what is perceived, or at least is argued, as its individual national 
interests at any given moment. These states would have to work together to experience the 
type of strategic impact wrought by the other transnational groups covered in this study. The 
individual states of the study are not burdened with this level of complexity. 

This paper will develop the conceptual model that countries that have more successfully 
developed socioeconomically and technologically over the past four decades have done more 
U.S.-based transnational lobbying, wealth repatriation and technology transfer on behalf of the 
homeland by U.S.-based diasporas. The corollary is that: those Southern nations that have 
done less U.S.-based transnational lobbying, wealth repatriation and technology transfer have 
developed less successfully socioeconomically and technologically. 


Theory and Method 


Theory 


Transnational communities that advance their ancestral homelands through strategic brain 
circulation are best examined in the context of the historical development of the modern 
world system and racialization. With regard to class and status, nations in the world and racial- 
ethnic groups in the world — a microcosm of which exists in the United States — are 
hierarchically ordered. World systems theory schematizes the hierarchy among nations as 
core, semiperiphery and periphery. Predominantly white, rich and militarily powerful Western 
nations are generally regarded as the core; newly industrialized Asian and Latin American 
nations as well as non-core Western nations constitute the semiperiphery; and most of Africa 
along with the poorest Latin American and South and Southeast Asian nations make up the 
periphery. The racial-ethnic hierarchy in America generally reflects this global hierarchy among 
nations (Henry 1999; Hewitt 2000). As Miri Song (2004) notes, “In the USA, there appears to 
be a fairly widespread view, both among many academics and the wider public, that white 
Americans are at the top of a racial hierarchy, African Americans at the bottom (with sporadic 
reference to Native Americans as an equally oppressed group), and groups such as Asian 
Americans and Latinos somewhere in between.” (p. 861) 

Bonilla-Silva (2004) further spells out this racial hierarchy in America in his prediction of a 
general three-category racial scheme: whites, honorary whites and collective blacks. He 
hypothesizes that “the white group will include ‘traditional whites, new ‘white’ immigrants and, 
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in the near future, totally assimilated white Latinos..., lighter-skinned multiracials...., and other 
sub-groups; the intermediate racial group or honorary whites will comprise most light-skinned 
Latinos..., Japanese Americans, Korean Americans, Asian Indians, Chinese Americans, 
Filippinos, and most Middle Eastern Americans; and finally, that the collective black group will 
include blacks, dark-skinned Latinos, Vietnamese, Cambodians and Laotians.” (pp. 932-3) 

World systems theory contends that nations possess changing levels of upward and 
downward mobility in the world economy. According to Wallerstein (1979); nations can 
advance from the periphery to the semiperiphery by (1) seizing opportunities, (2) accepting 
invitations from core nations, or (3) devising self-reliance. Periphery and semiperiphery 
nations, individually and collectively, face a cacophany of constraining dynamics operating to 
keep them in check and to serve the interest of core nations. As Oliver C. Cox (1964) argued 
in the classic Capitalism As A System, “although the social processes of the capitalist system 
may encourage imitation of the practices of the leading [core] societies, the system itself 
cannot accommodate indiscriminate advancement of peoples.” (p. 171) Notwithstanding the 
constraining forces against peripheral peoples in the narrow interests of core nations, the 
status of a given periphery or semiperiphery nation is not immutable. Thus, all nations have 
agency, though it is more developed and mobilized in some relative to others. 
Transnationalism is one agentic strategy of advancing a nation’s status from the periphery 
(Patterson 2005b). 

Development through strategic transnationalism is bidirectional. First, Southern-based 
racial-ethnic groups in the United States that are of the honorary white status tend to possess 
greater human, social and economic capital, which means they may have a greater means 
of assisting the homeland. Second, approaching the transnational community from Southern 
nations that are in the semiperiphery, they are in a better position to assist members of their 
ethnic kin in the United States, either through capital and connections or simply through 
“status capital.” In other words, American immigrants are both assimilated into their racial- 
ethnic group and are accorded the general status of the group. 

States are central to the processes of racial formation and hierarchy (Winant 2000), both 
intranationally and internationally. However, in the postmodern, Knowledge-Age of the 21* 
century, states and regimes use race-neutral language and policies that recreate racial and 
national hierarchies throughout the global politicoeconomic system. Doane (2003) concludes 
that “’color-blind’ ideology plays an important role in the maintenance of white hegemony. As 
an organized set of claims about race, color blindness rests on the seemingly unassailable 
moral foundations of equality, which is the basis for its political strength. What is overlooked 
— or deliberately masked — is the persistence of racial stratification and the ongoing role of 
social institutions in reproducing social inequality...” (p. 13) With color-blind policies in America 
today, school and residential segregation and white-black wealth gap are as great as they 
were during the days when raw Jim Crow laws and policies were conspicuously formulated 
and expressly implemented. A parallel exists at the international level. Wealth and technology 
inequality between the West and the South are greater today than in the early 1960s, the 
period when oppressive, racist colonialism was coming to an end. While being “race-neutral,” 
international financial and trade regimes that determine the global economy — such as the 
International Monetary Fund, the World Bank and the World Trade Organization — have served 
to reproduce a national hierarchy for the global system. Due primarily to this parallel, the 
hierarchical status of racial-ethnic groups in America tends to correspond.to the hierarchical 
status of nations in the global system. One may note the elevated status in past decades of 
South Korea-Korean Americans and in more recent years the status elevation of India-Asian 
Indian Americans. As Wallerstein (1991) argues, peoplehood is not “a primordial stable reality, 
but a complex, clay-like historical product of the capitalist world-economy through which the 
antagonistic forces struggle with each other.” (p. 85) 
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History shows that the Western countries’ effective political strategy of avoiding race in 
the principal international regimes they dominate has resulted in gains for these nations. Left 
to their own devices, transnational race-neutral economic agendas recreate Western 
dominance. As critical race theory may suggest, non-white (non-Western at the international 
level) peoples would have to mobilize their agency in part by avoiding race-neutrality and 
pursuing agendas that allow for more equitable disbursement of socially valued rewards. 
Strategic transnationalism of brain circulation on the part of Chinese, Koreans, Indians and 
more recently Mexicans is a form of politicoeconomic agency to stem the predominance of 
the West. Their individual transnational-level agency has been sufficiently powerful enough to 
overcome the impetus of global economic regimes against them. 

African peoples are arguably at the bottom of the global hierarchy — notwithstanding some 
historical and ongoing agentic efforts — due to greater constraints and insufficient agency thus 
far. Having blacks on the bottom of the hierarchy is not a product of historical accident. 
Instead, as Joe Feagin (2000) argues, it is the product of a long-running historical endeavor by 
whites to maintain racism most intensely trained against blacks. Horace Campbell (1994) is 
one of many who have studied efforts by the West to defeat pan-Africanism, from U.S. efforts 
in breaking the back of the Garvey Movement to France and England's relentless work in 
derailing mass-centered postcolonial struggles in Africa. After the African diaspora in the 
United States establishes a level of strategic transnational brain circulation commensurate 
with the constraining forces of reigning Western-dominated race-neutral regimes, African 
peoples too will likely have upward mobility in the world system (Patterson 2005a). 


Method 


A mixed system of most similar and dissimilar strategy is utilized in the study, which is 
regarded as a much more powerful analytic tool than either of the two independently 
(DeFelice 1986; Frendreis 1983). Countries and transnational groups with similar outcomes of 
homeland socioeconomic and technological development, and with comparable values of 
operative factors helping to generate those outcomes (i.e., diasporic influence on U.S. public 
policy, both foreign and domestic; technology transfer; and financial contributions), are 
compared conceptually to cases with categorically different outcomes of homeland 
socioeconomic and technological development, and with equally categorically different 
values of the same operative factors. The idea was to study a set of countries and associated 
transnationals with one level of outcome on a given variable (i.e., homeland development) — 
those with mere gradational differences — and compare them with another set of countries 
and associated transnationals with categorically different levels of outcomes on the same 
dependent variable, given varying inputs to the same ey variables (i.e., diaspora- 
homeland collaborative development). 

This paper posits, based on the study of similar and dissimilar trends over four decades, 
that African nations will experience more rapid and consequential development if there were 
more such strategic transnational brain circulation. Nominal comparisons are made to inform 
us of what explanatory variables roughly match outcomes (Mahoney 2000). Additional studies 
can move beyond the mere detection of operative variables in relation to outcomes by 
observing and measuring quantitative variations of the operative variables in relation to 
varying outcomes (Lijphart 1971). While this study has conducted some “pattern matching 
and process tracing,” clearly detailed case studies of each of the transnational groups and 
countries, and sub-regions (in the case of Sub-Saharan Africa) covered will provide additional 
insights into the specific mechanisms between the operative factors and outcomes. And 
when enough of these detailed case studies and small-V comparative analytic studies have 
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been conducted with concentrated attention on these variables, inferential statistical analysis 
will provide additional clarification. 

The first measure of strategic diaspora-homeland collaboration is the level of influence the 
diasporic community has with the host state on behalf of the homeland. Leaders of some 
diasporic communities work collaboratively with the homeland state in attempts to affect the 
host state’s domestic public policy (e.g., immigration) and its foreign public policy (e.g., 
foreign aid) in a manner perceived to be advantageous to the homeland. Another measure of 
this strategic collaboration is technology transfer. The type of technology transfer here 
involves diasporic technologists in rich, technologically-advanced countries investing in 
technology enterprises, contributing to research and development, and teaching science and 
engineering courses in universities, all in the homeland. Some homeland states work 
strategically with their technologically-talented nationals in rich counties to facilitate this 
technology transfer while other states do not. The final strategic collaboration measure is 
financial contribution. Aggregate financial contributions can be of two types, namely, 
remittances and investment. Remittances are financial contributions from workers in foreign 
countries to relatives and friends; such funds are mostly always from rich to poor countries. 
Seventy percent of the world’s $216 billion official international remittances in 2004 were from 
rich to poor countries (Ratha 2005). Investments by diasporans into their homelands can be 
critical. Such investments help to accelerate growth and upgrade the homeland economy. 


Diasporas 


Diaspora in this study — building on work by Safran (1991) — refers to a people dispersed from their 
original homeland, a people possessing a collective memory and myth about and sentimental 
and/or material links to that homeland, which fosters a sense of sympathy and solidarity with co- 
ethnic diasporans and with putative brethren in the ancestral homeland. As this definition implies, 
diasporic communities can be as concrete as individuals dispersed from, yet with tangible, on- 
going connections to, a given clan in a given village in a homeland or, conversely, as ideological as 
a construct or myth about a homeland in which specific individuals or even their parents or their 
grandparents have never visited, much less resided. The point is that a member of a transnational 
community who has never lived in the homeland can have an even greater sense of obligation 
to support the homeland as another member who was born there. 

Diaspora-homeland collaborative development is founded on sentimental and material 
arguments for a bounded ethnic group, both within the diaspora and between it and its 
homeland. The sentimental bonds among co-ethnics are supposed to be “a given” and 
emotionally coercive. Ineffable feelings of affinity toward fellow ethnic group members are 
frequently strong because personal and collective identities are often conflated in 
complicated ways. Personal identity is also undoubtedly linked with other identities, such as 
class, business associations and professional standing, but such linkages are less organically 
unified relative to the personal/collective identity-conflation. Personal identity is, after all, 
socially constructed and socially reinforced. Socialization around the indelible quality of 
“racially-based” ethnicity affects the core of one’s identity more than transitory identities. 
Ethnic group members are socialized around emotionally evocative issues such as their 
common ancestral heritage, fictive extended family and mutual mythological experiences. 
Conspicuous differences exist between ethnic groups in terms of maintaining boundaries, 
burnishing mutual mythological experiences and reinforcing solidarity; those who do more of 
these tend to be more unified than those who do less. 

With each ethnic group, personal and collective identity-conflation is wrapped up in a 
shared sense of vulnerabilities and anxieties concerning the political, social and economic 
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landscape. This matters because individuals are judged in part on their ethnicity. Sentimental 
pride and embarrassment in conjunction with material opportunities and constraints are all 
affected by a person's group membership as well as by the ranking of that group in society 
and indeed in the world. In a world where ethnicity matters, there is a material interest in 
having one’s ethnic group elevated in the world’s hierarchy. 


Development In, Through and By the Diaspora 


While global forces are influencing the size and political dynamism of multiethnic groups in 
America, these groups are simultaneously influencing American power, wealth and 
technology storehouse on behalf of their ancestral homelands. Esman (2000) succinctly 
explains the connections between diasporas and international relations below: 


The continuing links between diasporas and homelands can be 
politicized, and this is their major significance in the study of 
international relations. Diaspora solidarities can be mobilized and 
focused to influence political outcomes in the home country, to 
provide economic, diplomatic and even military assistance to the 
home country or to seek protection and help from its government. 
Likewise, the government of the home country may call on the 
diaspora community for economic or political support, and the host 
country’s government may attempt to use the diaspora community to 
promote its interests vis-a-vis the home country. With their variable 
capacities, opportunities and propensities to exert influence on behalf 
of their domestic or external interests, diaspora communities can be 
regarded as interest groups and as political actors. (p. 318) 


A threefold classification scheme is applicable when the linkages of diaspora and 
development are examined (Mohann 2000): 


° Development in the diaspora 
* Development through the diaspora 
¢ Development by the diaspora 


Development in the diaspora relates to the pooling of financial capital, intellectual capital and 
political capital by members of an ethnic community for the purpose of growing wealth and 
providing for a measure of security and independence in America. In the ancestral homeland 
in question, the state may lend support to members of the diaspora in the United States for 
two reasons: first, to help its nationals or ethnic kin succeed for their own good, and second, 
and probably more important to the state, to help its nationals become financially and 
politically powerful enough to contribute effectively and significantly to the development and 
security of the homeland. An example would be how the South Korean state aided its ethnic 
kin, primarily in California in the 1960s and 1970s. Today, South Korea is the world’s 11" largest 
exporter, exporting everything from automobiles to seafaring tankers and container ships (i.e., 
it is the world’s largest shipbuilding nation) to computer chips and countless other 
sophisticated industrial and telecommunications products. However, just a few short decades 
ago, a principal South Korean export product was wigs. The Korean Export Bank (KEB) helped 
wig manufactures in South Korea and South Korean immigrant wig wholesales and retailers 
in the United States with subsidized loans. The almost immediate result of this strategic 
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diaspora-homeland collaboration was a jump in the South Korean market share of wigs in the 
United States, from a mere 5 percent in 1965 to 89 percent in 1972. KEB even opened a BS 
branch in Los Angeles in 1967 to further assist with business lending to Korean immigrants 
(Chin, Yoon and Smith 1996). The immigrants gained in experience and confidence in the wig 
and other hair care and beauty supply businesses that made them more adept at penetrating 
numerous other markets, particularly in poor minority communities and beyond. 

As Choi (2003) points out, “ethnic Koreans have contributed to the development of the South 
Korean economy by transferring their knowledge and skills — which they obtained and 
strengthened in the more advanced countries of their residence — to their homeland.” (p.25) The 
South Korean state has long maintained a network of direct and indirect means of influencing 
the diaspora. Clearly, “the South Korean government has sponsored programs for professional 
emigrants seeking to build businesses abroad, with the expectation of direct financial benefits 
to Korea.” (Shain 1999: 170) The government also provided free language education for 
professionals, such as computer scientists, who were preparing to immigrate to the United 
States, Japan and other highly industrialized countries. For those planning to start a business 
abroad, the government provided loans up to $200,000 (Weiner 1995). 

Studies show that, wherever Koreans reside in the diaspora, trade between that country and 
South Korea is positively affected. In other words, the larger the Korean diaspora of a given 
country, in general, the greater the trade between the “host” country and South Korea tends to 
be (Choi). 

Development through the diaspora refers to development as a result of networking within 
and between diasporas of the same ethnic group in different parts of the world. A classic 
example of development through the diaspora involves overseas Chinese. Mainland and 
overseas Chinese as an economic bloc constitutes the third largest economy in the world. The 
Chinese economic bloc of PRC, Taiwan, Singapore and Hong Kong alone is the world’s largest 
exporter. Nearly 80 percent of overseas Chinese live in Southeast Asia. Their proportion of the 
total population in many of these Southeast Asian nations, however, is low. For example, the 
Chinese share of the populations of Indonesia, the Philippines and Thailand ranges from 3 
percent to 10 percent; however, overseas Chinese control more than 80 percent of the business 
equity in these countries (Cheong 2003). As for Singapore and Taiwan, some 80 percent of the 
populations are ethnic Chinese, and they also control the lion's share of the business equity in 
these two countries. As for overseas Chinese in the United States, they have particularly 
distinguished themselves in high-technology fields, and they have helped to transfer such 
technology to PRC and Taiwan. By 1998, 20 percent of startup high-tech companies in Silicon 
Valley were by ethnic Chinese. “The region's Chinese engineers constructed a vibrant two-way 
bridge connecting the technology communities in Silicon Valley and Taiwan...” (Saxenian 1999: 
53) The dialectical interplay in Chinese transnational communities is conspicuously successful. 

Since Deng Xioping opened up the country in 1978, overseas Chinese have been the 
principal investors in the PRC. Presently, China ranks second behind the United States in 
recipients of foreign direct investment. In 2003, total FDI into China was $57 billion (UNCTAD 
2004). Thus, in 1991, Lee Kuan Yew, the founding president of Singapore, and Deng collectively 
presided over the first World Chinese Entrepreneurs’ Convention (WCEC) (Cheong). By the sixth 
WCEC, ethnic Chinese were descending on Nanjing, the location of the gathering, from some 
77 diasporic Chinese communities around the world. At that meeting, where Beijing invested 
more than $1 billion to strengthen diaspora-homeland collaboration, it was resolved that the 
mainland will lead the global economic bloc of overseas Chinese (Cheong). 

There are important lessons for Africans and others to glean from these diaspora-homeland 
collaborative development experiences. Whereas the mainland is encouraging successful 
overseas Chinese to invest, if not repatriate, for purposes of ancestral homeland development 
and the pursuit of personal fortune,’ many African governments often look suspiciously at their 


-Transnationalism: Diaspora-Homeland Development * 1899 


fellow nationals who have gone away and returned successful. Take, for example, the 
Zimbabwean national, Strive Masiyiwa, a hugely successful telecommunications pioneer. 
Masiyiwa is the founder of Econet Wireless Communications out of Zimbabwe. Econet was 
Zimbabwe's first privately run telecom concern in the country, which became an overnight 
success (Patterson 2001). Despite the fact that Econet was on course to provide reliable and 
affordable telecom service to millions of citizens — something the government was unable to do 
— as well as attract new investment and technology into the country and provide additional 
employment, the government kept the company from persuing its business and servicing the 
population for more than five years. Masiyiwa, whose wealth is estimated to exceed $100 
million, runs a telecommunications empire that includes six cellular businesses and two satellite 
companies that span a number of countries including Great Britain, Morocco, Nigeria, Botswana 
and New Zealand. Whereas overseas Chinese who amass fortunes in the diaspora and are 
encouraged and assisted in investing in the future of the homeland, Masiyiwa, who fears for 
his life, lives in self-imposed exile in the Johannesburg metro area (Itano 2004). 

Development by the diaspora refers to the diaspora working chiefly if not exclusively in 
helping the homeland develop. Israel provides an instructive example of this class of diaspora 
and development. For many years now, Israel has been the annual recipient of more than $3 
billion of foreign economic and military aid from the US government's Agency for International 
Development (USAID). Ostensibly, USAID's mission is to work with and through Third World 
governments and nongovernmental organizations to help raise the living standards of some of 
the poorest people on the planet. Israel, however, ranks as the world’s 26th richest nation out of 
approximately 200. The $3 billion annual disbursement to Israel makes the country the number 
1 recipient of U.S. foreign aid. The population of Israel today is approximately six million. Since 
1962, Israel has received more foreign aid from the United States than all of Latin America, the 
Caribbean and Sub-Saharan Africa combined (USAID 2002); the combined population of these 
three areas is well over one billion. The complex organizational structure of private nonprofit and 
advocacy institutions working on behalf of Israel and Jewish Americans is very sophisticated 
and extremely well-run, not to mention exceedingly effective. There are numerous noteworthy 
institutions, but only two of the most effective will be referenced here. One is known as the 
Presidents’ Conference, comprised of presidents of many large Jewish organizations such as 
the World Jewish Congress, B'nai Brith and the World Jewish Committee. The Presidents’ 
Conference tends to work more closely with the executive branch of the U.S. government, while 
the second organization, the America-lsrael Public Affairs Committee (AIPAC), works more 
closely with Congress (Smith 2000). The Presidents’ Conference and AIPAC have been 
exceptionally deft at working with the Israelis to articulate the needs of Israel to Washington, 
raising and contributing funds to elect Congressional members who support Israel, and helping 
defeat those who do not. While African diasporans will never influence U.S. public policy, 
transfer cutting-edge technology or make financial contributions to their homelands on the scale 
of Jewish Americans, the point is that more progress can occur in these areas. 


Latecomers of Development In, Through and By the Diaspora 


India 


Indians, by far, have been the largest recipient of H-1B visas, which are a device used by the 
federal government in the interest of “assuring the maintenance of a workforce at the global 
competitive forefront.” The H-1B visa program allows foreigners to move to the United States 
for employment in designated critical areas of the economy. The cap was set at 65,000 by the 
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1990 Immigration Act; it was raised to 115,000 in fiscal year 1999 and to 195,000 between 
fiscal years 2000 and 2002, whereupon the cap would revert back to 65,000, which it did. 
Unions, the U.S. branch of the Institute of Electrical and Electronic Engineers (IEEE-U.S.) and 
others fought the increase while many industry executives and trade associations, such as 
the Business Software Alliance (BSA), Information Technology Association of America (ITAA) 
and the Electronic Association of America (EAA) — which represent companies including Cisco 
Systems, Intel, Motorola, Texas Instruments and Oracle, firms that employ H-1Bs — wanted 
still higher numbers of H-1Bs. Here we have a domestic public policy fight that is fraught with 
serious foreign implications, some that loom particularly large for the homelands of engaged 
U.S.-based diasporas. Countries that send large numbers of their nationals to the United 
States on H-1B visas have their development agendas bound up in this debate. Organized 
diasporas in collaboration with their respective homeland states enter into the political fray in 
support of EAA, ITAA, BSA and their member firms. Key to the diaspora-homeland 
collaborative successes of these ethnonationals is to align their project with other domestic 
forces (e.g., voting blocs and/or lobbying campaign of icons such as the blue chip tech firms 
cited above) and/or commonly expressed American interests and ideals. 

Once more, the Indian government - like that of the South Korean, Chinese, Taiwanese, 
Singaporean and other countries — has taken a proactive, comprehensive and strategic 
approach to its diaspora-homeland collaborative development effort, hence brain circulation. 
Note the observations of Myron Weiner (1990): 


The Indian government has made a major effort to reach out to the 
Indian community in the United States for support. It has sought to 
induce nonresident Indians (NRIs...) to deposit their savings in Indian 
banks, invest in Indian companies, and start their own businesses in 
India. The Indian finance minister has met with the New York-based 
NA! Club of North America to discuss creating an “India Fund” to help 
members invest in India, and there have been discussions of pressing 
the IRS to permit tax-sheltered IRA accounts to be used for 
investments in India. The Indian Embassy has also encouraged the 
Indian Community in the United States to actively support Indian 
foreign policy objectives in Washington. (p. 202) 


The Ministry of External Affairs of the Indian government has established a Special High 
Level Committee on the Indian Diaspora. The committee was created to study the problems, 
aspirations and attitudes of the Indian Diaspora, and to study the roles that Nonresident 
Indians (NRIs) and people of Indian origin could play in the economic, social and technological 
development of India (7he Hindu 2001). To this end, the Indian government has recently taken 
two concrete, proactive steps: (1) ruled to allow dual citizenship for NRIs and (2) established 
a new ministry for the singular purpose of serving the needs and eliciting the support of NRIs. 
These two demonstrable acts show how India has fully embraced the strategic homeland- 
diaspora development model. 


Mexico 


Sharp contrasts can be drawn between the above Asian nations and other nations seeking 
to develop. Mexico is one of those nations that, until recently, failed to aggressively pursue a _ 
strategic diaspora-homeland collaborative development agenda. State officials, non- 
government establishment figures and the Mexican diasporic elite in the United States failed 
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to collaborate until the late 1970s. Prior to that, Mexicans in the homeland, from government 
officials on down to the common person in the street, regularly viewed recent members of 
the diaspora in the United States essentially as “sellouts.” Matching that castigating view, 
many diaspora members looked despairingly and embarrassingly at Mexico for its continued 
impoverishment (Jones-Correa 1995-96). Mutually disdainful views on both sides had the 
effect of preventing a diaspora-homeland collaborative development agenda. 

The rapprochement finally occurred in the late 1970s. The government began to rethink 
Mexico's relationship with its diaspora, over 98 percent of whom reside in the United States 
(Gutierrez 1999). The emigration of Mexicans to join the ranks of the U.S. diaspora was being 
viewed by strategic thinkers as a resource gain rather than a resource drain. In 2002, at $9.9 
billion, Mexico was the second largest recipient of remittances in the developing world 
(Solimano 2003). By 2004, Mexican remittances, standing at $13.2 billion, placed Mexico in 
the number one spot ahead of India (Thouez 2005). Turning the brain drain into a brain 
circulation reflected both the growing political clout of Chicanos and the successful effort of 
Mexico City to breathe life into the transnational community. 

President Carlos Salinas de Gortari, a Harvard Ph.D. in political economy and government, 
was predisposed well before his election in 1988 to advance the collaborative development 
agenda much further and faster. President Salinas had the built-in cultural affinity for such an 
agenda, but even more important was his focus on the political utility of the diaspora to the 
north as well as the indispensable economic contributions for the southern part of the country's 
efforts to develop. By the time the first Bush presidency was winding down, the Mexican state 
and leading Mexican establishment thinkers were as eager for the swift passage of NAFTA as 
U.S. corporate advocates: “During the NAFTA debate, the Mexican government flew entire 
delegations of Mexican-American leaders to Mexico City for briefings, preparing them for their 
role as advocates for NAFTA.” (Jones-Correa: 59) As stated earlier in the paper, U.S. diasporas 
seeking to successfully influence foreign policy to benefit their respective homelands favorably 
attempt to unite in common cause through, at the very least, temporary marriages of 
convenience with domestic politically well-established groups, and/or align the homeland- 
interested public policy agenda with manifestly powerful American interests and ideals. 

The Salinas government was as strategic as it was determined in eliciting the support of 
the Mexican diaspora for the successful passage of fast-track authority for NAFTA. Leading 
diasporic figures were continually stroked. For example, “In 1991, Antonia Hernandez of the 
human rights protection group MALDEF (Mexican-American Legal Defense and Educational 
Fund) was awarded the Order of the Aztec Eagle, Mexico's highest honor given to a foreign 
citizen for his or her contribution to the advancement of Mexico or to the better understanding 
of Mexico abroad. In 1993, an Aztec Eagle medal went to Raul Yaguirre, president of the 
National Council of La Raza.” (Shain 1999:189) While these recognitions do not necessarily 
imply a quid-pro-quo for supporting the fast-track deal, they do illustrate how a homeland 
state can reach out to its diaspora in the United States as part of a strategic collaborative 
development agenda. Moreover, Mexico City’s influence, combined with leading Mexican- 
American organizations in conjunction with major U.S. manufacturing, trucking, textile and 
other industries, pressured the Congressional Hispanic Caucus so effectively that all but one 
of the Caucus members voted for NAFTA (Shain 1999-2000). 


The African Diaspora 
The schism between the two groups of the African diaspora in the United States - 


descendants of enslaved Africans and newer, voluntary immigrants — is knotty and hitherto 
largely politically debilitating, a relationship experienced by no other U.S.-based diaspora. 
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Stereotypes often prevent African immigrants and descendants from pursuing common 
ancestral homeland development agendas. Many African immigrants often see African 
Americans as too fixated on race, unwilling to accept their shortcomings regarding sacrifice 
and hard work, and having squandered tremendous opportunities for success in the United 
States. Conversely, many African descendents frequently see African immigrants as socially 
and culturally backwards, haughty, and unappreciative of the benefits they enjoy as a result of 
their ancestors’ sacrifices in this country (Authur 2002). 

Three black institutions purported to have some political clout are the Congressional Black 
Caucus, the NAACP and the Urban League.® Because the lives of large sections of the black 
community are encumbered by issues of institutional racism, poorly performing schools, 
inadequate affordable housing, urban violence and other domestic social ills, the lion's share 
of the attention and these institutions’ concerted campaigns have focused on issues within 
the United States. Conversely, new African immigrants tend to be organized in mutual aid 
societies, which span from facilitating acculturation into the American society to contributing 
to capital formation for entrepreneurial activities in the United States and/or back in specific 
African countries. Political organizations run by and chiefly concerned with descendants’ 
domestic issues regularly engage the levers of state power whereas new immigrant Africans, 
while significantly concerned about problems and prospects back in their respective 
homelands, are far less likely to engage the state or be party to elite mobilization in America 
about matters concerning them. 

Episodically, there have been issues around which the two black communities of the 
African diaspora in the United States unite. The Congressional Black Caucus emerged in 1971 
to provide a platform for blacks to engage the state on the issue of putting its might behind 
ending white minority dictatorship in Southern Africa. Representatives Charles Diggs, 
cofounder of the Congressional Black Caucus, and Randall Robinson, founder of TransAfrica, 
used the force of their institutions to help support revolutions in Southern Africa. Ron Walters 
(1987) trenchantly critiqued the politically-loaded rhetorical question posed by Martin Weil 
back in 1974: “Can the Blacks Do for Africa What the Jews Did for Israel?” However, Weil was 
correct by noting that, “To aid the revolution abroad, blacks must first join the establishment 
at home.” (p.86) This paper argues that some other U.S.-based diasporic groups have 
successfully applied Weil's dictum to provide their homelands with a comparative advantage 
in development agendas. 

Beyond the descendant-immigrant schism, a major impediment for an African-American 
diaspora-homeland collaborative development project, unlike successful diaspora-homeland 
projects, is that the African diaspora has lacked strategic backing of a major state or states in 
Africa to generate enthusiasm, focus and resource bases for some specific homeland 
development — which had been the case with South Korea and China for four decades and for 
two decades with India and Mexico. These may simply be manifestations of the newest 
historical forms of more intense efforts to keep blacks at the bottom of the hierarchy 
discussed by Golberg (2002), Feagin (2000) and Campbell (1994), among others. History, of 
course, is not fate; therefore, it is indeed possible for African peoples to mobilize their agency 
in the form of strategic transnational brain circulation like other transnational groups, though 
in their own syncretistic manner and at the commensurate level to overcome the “race- 
neutral” politicoeconomic forces presently constraining them. 


Sub-Saharan Africa 


In no way does this paper imply that Africa is completely devoid of progress on the fronts of 
strategic vision, specific programs and concrete practices to enhance country-specific or 
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continent-general diasporic-homeland collaborative development agendas. There are clear 
examples in Africa to which one can point for illustration. In his study, Apraku (1991), for one 
example, has found evidence of African governments’ efforts to lure their respective technical 
nationals into repatriation with subsidies for homes, cars, and professional equipment in the 
homeland country. Nevertheless, there is a chasm between state officials and strategic 
thinking establishment figures in African societies in comparison with other countries profiled 
in this study when it comes to mobilizing its U.S.-based diasporas. 

One of the most significant recent U.S. foreign policy debates toward Africa was the 
proposed African Growth and Opportunity Act (AGOA). AGOA effectively split members of the 
Congressional Black Caucus, 24 ultimately voted in support and 12 against (Walters 2004). The 
assenting side saw it as a feeble yet important start of new investment into Africa and new 
market opportunities in America for African exporters beyond primary products. Thus, they saw 
AGOA as a belated emulation of U.S. investment and market openings decades earlier that 
contributed to East Asia's starting out in lowly apparel and textile production and subsequently 
going through rapid industrialization. Conversely, the dissenters saw it as destructive for a 
continent already broken economically. In short, the dissenters largely agreed with 
Representative Jessie Jackson, Jr.'s characterization of it when he said that the initiative was 
so inimical to African interests until it may as well be labeled as the “Africa Recolonization Act.” 

Some have thought of AGOA as the “African NAFTA.” However, there was inadequate 
coordination between African capitals or with the former Organization for African Unity (OAU) 
and black political and Africa-focused leaders in the United States. Walters (2004) notes that, 
although groups like the Constituency for Africa and the African Diplomatic Corps in 
Washington did work some with African American leaders, their collective efforts at issue 
framing and collaborative agenda setting appear not to have been on the level of the Mexican 
NAFTA effort. The Mexican government regularly brought large delegations of Mexican 
American leaders to Mexico City for consultation and coordination to both help shape the 
drafting and passage of NAFTA. Such issue framing and collaborative development agenda 
setting were not emulated by diasporic leaders and leaders of African states, commercial 
enterprises and civil society organizations. 


Conclusion 


Using the approach of comparative analysis, this paper has investigated the importance of 
transnationalism as a means of advancing peoples, which comes in the form of homeland 
states working collaboratively with their respective diasporas in the United States to benefit 
the homeland. The significant finding is that when science and technology talented nationals 
from a Southern country emigrate to the United States, such emigration can be identified as 
either a brain drain — an adverse impact on the homeland - or a case of brain circulation — a 
socioeconomic and technology gain for the homeland. The factor that most determines the 
outcome is the nature of collaboration between the homeland and the U.S.-based diaspora. 
At one end of the spectrum there is no substantive coordination between sanctioned 
government offices or quasi-government institutions and leaders of organizations comprised 
chiefly of their diasporans. For that matter, there may even be a hostile relationship between 
the two. At the other end of the spectrum is institutionalized strategic collaboration between 
the two sides, resulting from a combination of a heightened sense of sympathy, solidarity, 
obligation and profit to be made. 

This study looked at homeland development as the outcome with strategic collaborative 
development between diasporic leaders and the homeland as the operative factor. Homeland 
development was conceptualized as technological and socioeconomic development (including 
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life expectancy, educational attainment and average income). Strategic collaboration between 
the diaspora and the homeland was taken as a measure of a diasporic community's favorable 
affect on the U.S. state; technology transfer; and financial contributions, including both 
remittances and investments. 

Unlike Israel, China, India and South Korea, Mexico had to overcome the mutually 
disparaging sentiments between the two sides. Since the rapprochement in the late 1970s, 
progress started slowly, then picked up steam during the period of the NAFTA debate, and is 
now gaining more ground around issues of permanent residency, amnesty and guest-worker 
programs. African nations are further behind in institutionalizing diaspora-homeland 
collaborative development agendas. In the past few decades, African diasporas and 
homelands have worked only episodically to promote development in the latter, but, again, 
such collaborative development agendas in Sub-Saharan Africa have never been 
institutionalized as in other nations. The challenge to institutionalizing diaspora-homeland 
collaborative development is exponentially more difficult because of the multiple states 
involved and the fact that Sub-Saharan African countries would be attempting this 
institutionalization in the early 21* century rather than in the mid- to late-20" century when the 
single-case states launched their collaborative campaigns and a historical moment perhaps 
more amenable to such development strategies existed. 


Notes 


1. For discussions on transnationalism, see Schiller and Basch (1995); Schiller, Basch and 
Blanc-Szanton 1992; Portes 1999; Kivisto 2001. 


2. Homing in on the nature of African coordination of its diasporized and domestic 
homeland-based peoples around technological development is not to ignore the AIDS 
pandemic, crippling illiteracy, grinding poverty, rampant corruption and other social ills 
across the continent. At the same time, solving those ills without promoting and 
strengthening the collaboration of technically-oriented diasporized and homeland-based 
Africans will also likely confound social and economic development. In other words, this 
paper in no way privileges the diaspora-homeland collaboration over other challenges. 
Instead, this paper merely seeks to draw attention to and conceptually clarify one huge 
challenge as well as opportunity that is typically ignored or glossed over in development 
scholarship. China, Israel, South Korea and India have witnessed considerable 
development in the past few decades; eliminate the U.S.-based diaspora-homeland 
collaborative development agenda and their socioeconomic and_ technological 
development would have been stymied or worse. 


3. Diaspora-homeland development in the case of Israel has been and continues to be 
uniquely crucial to its security and sheer survival, not just to socioeconomic and 
technological development. Without U.S. direct and indirect support of economic, military 
and intelligence assistance to Israel over its 54-year history, the nation would have long 
ceased to exist. Without the diasporic community's mobilization efforts and economic 
successes, U.S. state support and American wealth would not have been forthcoming to 
Israel. Moreover, no other diasporic community in the United States is likely to emulate 
this success due in part to Jewish Americans’ unrivaled breadth and depth of influence 
in positions of power in American society, relative to Third World diasporas. Nonetheless, 
the non-emulative Jewish model provides insight regarding possible collaborative projects 
for other transnational groups. 
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4. China, of course, also regularly surveils and in some cases detains some ethnic Chinese, 
especially those from the United States, for fear of seditious activity. Nevertheless, such 
fear may be more prevalent with African governments. 


5. There are real limits to this so-called political clout by black political organizations in the United 
States. One evident illustration of this limitation is President Bush's refusal to meet with the 
Congressional Black Caucus and to address the annual meeting of the NAACP in the past five 
years. A president's failure to provide such an audience and address Jewish political and 
advocacy leaders is almost unthinkable. While meetings can be construed as merely 
symbolic, these meetings are nonetheless reflections of the gravity of the seriousness with 
which issues and concerns of ethno-national groups are taken by the government. 
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When All Else Fails: 


International Adjudication of Human Rights Abuse Claims, 
1976-1999 


Wade M. Cole, Stanford University 





Abstract 

Although interest in the consolidation and expansion of the international human rights 
regime has grown in recent years, little attention is accorded to the formal procedures that 
allow individuals aggrieved by states to appeal directly to an international audience. Using 
data for 82 countries between 1976 and 1999, this article examines the political and cultural 
factors that produce cross-national variation in the propensity of individuals to file allegations 
of human rights abuse with the Human Rights Committee. Negative binomial and event 
history analyses indicate that the number and rate of human rights abuse claims (1) increase 
as a country’s human rights practices worsen; (2) decline as domestic “opportunity structures” 
become available; and (3) increase with the cultural empowerment of individuals. 


Between the idea 
And the reality 
Between the motion 
And the act 
Falls the Shadow 
—T.S. Eliot, 7he Hollow Men 


The idea that all individuals possess rights simply because they are human has gained 
widespread currency since World War Il, but the reality is that many states continue to neglect 
or abuse those rights. Given that rights increasingly originate outside the state, what 
measures are available to individuals aggrieved by state actions? This article focuses on one 
such means of recourse: the claims-making procedure established by the First Optional 
Protocol of the International Covenant on Civil and Political Rights, hereafter referred to as 
“Optional Protocol” or “Protocol.” By enabling individuals to issue formal complaints alleging 
state abuse, the Optional Protocol illuminates the long shadow that sometimes falls between 
the idea and the reality of human rights. 

Observers routinely acknowledge that the international human monte regime” (Krasner 
1982; Donnelly 1986), of which the Optional Protocol plays an integral part, infringes upon 
national sovereignty (Donnelly 1986; Sikkink 1993; Moravcsik 2000; Hathaway 2003).' Not all 
of the regime's elements, however, are equally invasive. Indeed, some countries’ 
endorsement of human rights is purely symbolic, confined only to the ratification of toothless 
declarations that allow governments to “decouple” rhetorical commitment from actual 
practice (Meyer, Boli, Thomas and Ramirez 1997). Other countries commit themselves more 
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extensively by joining well-monitored and strongly enforced treaties (Cole 2005). The Optional 
Protocol, as one of the most effectively monitored human rights treaties yet promulgated, 
"provides a genuine, if limited, instance of international monitoring, which in at least a few 
cases has altered state practice.” (Donnelly 1986:611) 

In recent years, a number of studies have investigated the emergence, expansion and 
penetration of the international human rights regime, with multilateral treaties garnering much 
of the attention. Researchers have explored both the “cause” and “effect” of treaty ratification: 
the factors that elevate or depress rates of treaty ratification among states (Cole 2005; Wotipka 
and Tsutsui 2001: Wotipka and Ramirez 2003), and the impact of treaty ratification on human 
rights practices (Hafner-Burton and Tsutsui 2005; Hathaway 2002). Much less attention has 
focused on the ways in which human rights treaties are implemented, although Boyle and 
Thompson's (2001) analysis of abuse cases adjudicated under the European Human Rights 
Convention is a notable exception. Boyle and Thompson (2001 :339) find that “domestic political 
organization and the manner in which domestic systems link to the international system” both 
account for cross-national variation in the number of claims brought before the European 
Commission on Human Rights. Countries with constitutionally “open” polities and numerous 
linkages to non-governmental organizations experience the greatest number of abuse claims, 
whereas state participation in intergovernmental organizations significantly reduces the number 
of claims. Boyle and Thompson (2001:341) surmise that “thelir] findings have implications 
outside of Europe,” and my analysis of communications filed under Optional Protocol provides 
a more comprehensive — that is, global rather than regional — study of human rights abuse 
claims. This permits us to evaluate the broader implications of Boyle and Thompson's findings.” 

In what follows, | describe the complaint procedure established by the Optional Protocol 
and discuss the political and cultural features of nation-states that motivate its use. | then 
advance hypotheses informed by social movement perspectives (McAdam, McCarthy and 
Zald 1996), sociological neo-institutionalism (Thomas, Meyer, Boli and Ramirez 1987; Meyer, 
Boli et al. 1997), and the clash of civilizations thesis (Huntington 1993a, 1993b, 1996), that 
seek to explain the number of communications filed and the rate at which they are filed. Using 
longitudinal data on 82 countries between 1976 and 1999, | evaluate these hypotheses with 
corresponding negative binomial and event history analyses. 


Enforcing Human Rights Under the Optional Protocol 


In 1948, the United Nations General Assembly unanimously approved the Universal 
Declaration of Human Rights, a definitive but legally ineffectual proclamation extolling the 
“inherent dignity” and “inalienable rights of all members of the human family.” The 
International Covenant on Economic, Social and Cultural Rights (ICESCR) and the International 
Covenant on Civil and Political Rights (ICCPR), adopted concurrently by the U.N. in 1966, give 
effect to the Universal Declaration. The ICESCR protects a variety of socioeconomic rights 
(e.g., the rights to unionize, enjoy a minimum standard of living and receive an education), and 
the ICCPR safeguards basic political freedoms and civil liberties (e.g., equal protection under 
the law, presumption of innocence, freedom of conscious and the right to assemble). Both 
covenants include routine procedures for monitoring compliance: the Committee on 
Economic, Social and Cultural Rights is charged with implementing the ICESCR, and the 
Human Rights Committee (HRC) monitors the ICCPR. 

Any state that ratifies the ICCPR may also join its First Optional Protocol, an auxiliary treaty 
that empowers the “[Human Rights] Committee to receive and consider communications from 
individuals subject to its jurisdiction who claim to be victims of a violation by that State Party 
of any of the rights set forth in the [ICCPR].” (Optional Protocol, Article 1) Figure 1 diagrams 
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the process. Article 2 of the Protocol stipulates that individuals aggrieved by a state party must 
first exhaust all possible domestic remedies (path 1) before petitioning the HRC (path 2), Once 
a communication is filed, the HRC determines its admissibility, assesses its merits and renders 
a decision (path 3). In addition to the domestic recourse provision, the HRC also considers the 
personal, territorial, temporal and procedural jurisdiction of a case when evaluating its 
admissibility (Joseph, Schultz and Castan 2004:22). First, claims must be filed on behalf of an 
actual and identifiable individual victim; “abstract” violations, claims filed on behalf of 
organizations, and anonymous petitions are inadmissible. Second, the claimant must have 
been subject to the offending state's jurisdiction when the suspected violation took place, 
although he or she need not be a citizen of that state. Third, the alleged offense must have 
been perpetrated after the state's ratification of the Protocol. Finally, the petition must not be 
adjudicated simultaneously by another international tribunal. 


Figure 1. The Optional Protocol “Boomerang” 
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After the HRC rules a claim admissible, it evaluates the claim's merits. At this stage, the HRC 
determines if a state’s actions against a petitioner violates its obligations under the ICCPR. When 
a decision is reached, the HRC issues its opinion, called a “view,” and delivers it to both the state 
party and the individual claimant. Although views are not legally binding, the ICCPR is. Because 
“the HRC is the pre-eminent interpreter of the ICCPR,” its “decisions are therefore strong indicators 
of legal obligations, so rejection of those decisions is good evidence of a State's bad faith attitude 
towards its ICCPR obligations.” (Joseph, Schultz and Castan 2004:4). Consequently, HRC decisions 
in favor of individual claimants bring direct moral pressure to bear on states that continue to violate 
tights protected by the ICCPR (path 4). Failure to implement the HRC’s recommendations also 
becomes public record, which can damage a state's reputation in international relations (path 5). 
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The Political (Endogenous) and Cultural (Exogenous) Determinants of Abuse 
Claims 


As depicted in Figure 1, the Optional Protocol institutionalizes what Keck and Sikkink (1998) 
call the “boomerang pattern,” the process whereby domestic actors bypass a repressive or 
unresponsive government by addressing their grievances to an international audience. My 
analysis directs attention to path 2 of this “boomerang” process. Under what conditions do 
individuals activate the complaint procedure of the Optional Protocol? 

In thinking about the endogenous political context from which individual communications 
to the HRC emerge, | draw on concepts developed by social movement theorists (McAdam, 
McCarthy and Zald 1996). To set the complaint process into motion, an individual first 
experiences a real or perceived grievance: in particular, a state party to the Optional Protocol is 
alleged to have violated its commitment under the ICCPR by abusing the individual's civil or 
political rights. As mandated by the Optional Protocol, the individual then directs his or her 
complaint to the offending state using all appropriate domestic channels. Whether the 
grievance is resolved at this stage depends largely on the structure of political opportunities 
(Kriesi et al. 1995; Tarrow 1998; Meyer 2004) within the state. States that are responsive to 
challengers — those with “open” (Kitschelt 1986) or “inclusive” (Kriesi 1995) opportunity 
structures — are more likely to settle allegations of human rights abuse internally than are states 
with “closed” or “exclusive” structures. Alternatively, individuals living under especially 
repressive regimes may be dissuaded from petitioning the HRC for fear of retribution. Should 
domestic channels of recourse fail, individuals may then file a communication with the HRC. 
Exhausting all domestic remedies that makes use of the Optional Protocol possible can be 
costly, however, as can the prosecution of claims internationally. Consequently, the amount of 
resources (McCarthy and Zald 1977) available to individual claimants also becomes important.’ 

The specific configuration of grievances, opportunities and resources within states can be 
thought of as political “push” factors that impel individuals to file human rights abuse claims. 
States are also embedded in wider cultural environments, both global and civilizational, that 
under some conditions “pull” individuals into submitting communications. Sociological neo- 
institutionalists, for example, see “modern individuals, organizations, and nation-states by their 
construction [as] deeply embedded in a wider world society, polity, and rationalistic culture.” 
(Meyer, Frank et al. 1997:628) This culture, which is now global in scope but evolved historically 
out of the “Western cultural account” (Meyer, Boli and Thomas 1987), is constitutive: it defines 
“actors” and specifies their identities, purposes and capacities (Jepperson and Swidler 1994; 
Meyer, Boli et al. 1997). Postwar world culture endows individuals with tremendous amounts of 
moral worth and agency (Meyer and Jepperson 2000). It also invests them with universal human 
rights, admonishes states to respect and protect those rights, and, when necessary, authorizes 
individuals to defend their rights against state infringement. 

World culture produces a great deal of ritualized structural isomorphism across national 
societies despite extensive political, cultural and economic disparities among them (Meyer, 
Boli et al. 1997). Civilizational cultures, on the other hand, differentiate communities of states 
based on persistent and frequently opposing beliefs and values (Huntington 1993a, 1993b, 
1996; Inglehart and Baker 2000). Although individualism and human rights masquerade as 
universal principles, they are in fact quintessentially Western constructs (Donnelly 1982; 
Howard and Donnelly 1986). Most non-Western societies (and, for that matter, pre-modern 
Western societies) subsume individuals within larger collectivities, allocating different rights 
to members of particular castes, religious communities, kinship groups or economic classes 
(Donnelly 2003). Human rights, however, are not contingent on group affiliations or even 
national citizenship; instead, they inhere equally in a// individuals. When non-Western societies 
adopt these values, they do so grudgingly and perfunctorily, and then only after a great deal 
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of prodding. “What is universalism to the West,” argues Huntington (1996:184), “is 
imperialism to the rest.” 

Contrary to Huntington's belief about the incommensurability of civilizations, Donnelly (2003) 
contends that an “overlapping consensus” exists worldwide with respect to human rights. In 
support of his claim, Donnelly points out that the Universal Declaration of Human Rights was 
adopted unanimously. Declarations, however, are not legally binding, which allows for a great 
deal of slippage between rhetorical and actual commitment. Donnelly (2003:97) concedes that 
“[dlifferences in implementations” exist, and the analyses reported below seek to account for 
those differences empirically. 


Research Design 


Dependent Variables 


My research analyzes the number of petitions filed under the Optional Protocol annually 
between 1976 and 1999 for 82 countries, producing a matrix of 1,041 country-year observations. 
Two dependent variables examine respectively how many petitions are filed and the rate at 
which they are filed. First, a count variable records the annual number of cases brought against 
each country under the Optional Protocol. Included are communications ultimately ruled 
inadmissible, as well as cases decided in favor of either the individual petitioner or the state 
party, as reported in Joseph, Schultz and Castan (2004:878-903). Second, | derive a truncated 
discrete indicator from the count variable set equal to “1” if a country has any communication 
brought against it during a given year, and “0” if no cases are filed. 

Filing a petition with the HRC is a relatively rare event, as evidenced in Table 1. Fully 33 
countries did not have any communications lodged against them during the observation period. 
Of those countries that were formally charged with violating their commitment under the ICCPR, 
comparatively few — 10.2 percent of country-year observations — had multiple claims filed 
against them in a given year. 

Figure 2 displays the annual number of communications filed with the HRC, aggregated 
across countries, from the time the Optional Protocol entered into force in 1976 through 2000. 
The bars, charted along the primary y axis, represent the number of communications submitted 
under the Optional Protocol each year; the line, plotted against the secondary y axis, gives the 
cumulative number of Optional Protocol ratifications. The number of state parties grew steadily 
until the early 1990s, when the fall of communism produced a flurry of newly independent 
states that subsequently ratified the Protocol. (Appendix A provides a list of all ratifiers.) Between 
1977 and 1986, the HRC received an average of 14 communications each year. The number 
increased precipitously in 1987, reaching an apex of 63 in 1988. Thereafter, the number of 
communications averaged about 40 per year, with another large spike in 1997.° 

Figure 3 charts the number of communications filed against each country through 1999, 
with the total number of years for which the Protocol was in force during the observation 
period given next to the country’s name. Communications may only be brought against a 
country while its ratification of the Protocol is in force, so countries that ratify the Protocol at 
the same time should, all else being equal, experience the same number of claims. This 
clearly is not the case (all else, after all, is rarely equal), and the trends are somewhat startling. 
The largest number of complaints (n = 150) have emanated from Jamaica, prompting it to 
denounce the Optional Protocol in 1997. Uruguay is also responsible for a sizable number of 
communications, due in large part to the military government's use of torture, arbitrary 
imprisonment and “disappearances” during the 1970s and early 1980s. Several well- 
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Table 1: Frequency Distribution of Petitions Filed under the Optional Protocol, 1976-1999 


SS 





Petitions Cumulative 
per Year Frequency Percent Percent 
0 925 77.34 77.34 
| 149 12.46 . 89.80 
2 56 4.68 94.48 
3 23 1.92 96.40 
4 11 92 97.32 
5 11 .92 98.24 
6 7 109 98.83 
7 2 Hy 99.00 
8 1 08 99.08 
9 1 08 99.16 
10 1 .08 99.25 
11 1 .08 99.33 
1 o eo 99.58 
14 1 .08 99.67 
16 1 .08 99.75 
19 1 .08 99.83 
25 1 .08 99.92 
30 1 .08 100.00 





Note: “Frequency” gives the number of country-year observations. 


established democratic countries, including the Netherlands, Canada, France, Australia and 
Finland, also had above-average numbers of communications filed against them during the 
period. Based on this evidence alone, it is difficult to decipher patterns that might suggest 
when a country is most susceptible to claims of abuse. Distinct patterns begin to emerge only 
with multivariate analyses. | turn next to a description of the variables used in these analyses, 
and consider their expected effects on the number and rate of petitions filed under the 
Optional Protocol. 


Hypotheses and Explanatory Variables 


| am interested in the properties of states that prompt individuals to file-claims of human 
rights abuse. Explanatory variables are measured at yearly intervals for all state parties to the 
Optional Protocol for which data are available.® | first consider the effect of a state’s internal 
political climate on the number and rate of human rights abuse claims filed against it, focusing 
especially on a state’s human rights practices, its commitment to democracy and regime 
changes. 
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Human Rights Practices 

| offer two competing hypotheses for the effect of a state’s human rights practices on 
claims-making. One might expect, ceteris paribus, that more abuse produces more claims. 
Alternatively, severe human rights violations may reduce the number and rate of claims, if 
individuals fear state reprisal for petitioning the HRC. 


Hypothesis 1a (and 1b): The greater the level of human rights 
abuses in a country, the greater (lower) are the number and rate of 
claims brought against it under the Optional Protocol. 


A measure of human rights practices is gleaned from annual reports issued by the Bureau of 
Democracy, Human Rights, and Labor at the U.S. Department of State (see Poe and Tate 
1994; Boyle and Thompson 2001; Hafner-Burton and Tsutsui 2005). The variable ranges from 
1 to 5 onan ordinal scale, with higher scores indicating better human rights practices.’ 


Democracy 

The HRC is an arbiter of last resort. For a communication to be admissible, the petitioner must 
first have exhausted all available means of recourse domestically. In general, democratic 
countries have more — and more effective — mechanisms for redressing individual grievances 
than do non-democratic countries, thus obviating the need for international enforcement of 
the ICCPR. The availability of political opportunity structures within democratic states (Kriesi 
1995; Kitschelt 1986) renders the use of transnational opportunity structures (Smith 1995) 
unnecessary, which consequently reduces the number of claims filed. Democracy should also 
delay the submission of communications that eventually do make it to the HRC, as individuals 
must first prosecute their claims through the many channels made available to them by the 
state. 


Hypothesis 2a: Democratic countries produce fewer communications 
than autocratic countries do, and the rate at which claims are filed is 
lower for democratic than it is for autocratic countries. 


Another expectation, in line with theories predicting an inverse curvilinear relationship 
between mobilization and repression (e.g., Kriesi et al. 1995), is that claims-making increases 
as repression declines until reaching a threshold, at which point “more” democracy 
produces fewer claims. Hence, the greatest number and rate of claims would originate from 
moderately autocratic or weakly democratic countries in which questionable human rights 
practices exist, but where levels of repression are not so high as to discourage individuals 
from submitting petitions with the HRC. In highly autocratic countries, state repression may 
deter individuals from filing claims; in strongly democratic countries, rates of abuse, and 
hence abuse claims, are presumably lower. 


Hypothesis 2b: Moderately autocratic or weakly democratic regimes 
produce more communications than either strongly democratic or 
highly autocratic countries do. 


The extent of a country’s commitment to democracy is measured with the Polity /V 
composite institutionalized polity score (Marshall and Jaggers 2002), with values ranging 
from — 10 (extreme autocracy) to 10 (full democracy). To evaluate the hypothesized 
curvilinear relationship, | include a second-order term. 
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Regime Changes 

In addition to a country's level of democracy during a given year, annual changes in its polity 
score may affect the number and rate of petitions filed. Although regime liberalization — 
indicated by positive change scores — typically enfranchises citizens, civil and political rights 
often remain tenuous immediately following democratic transitions. The persistence of 
dubious human rights practices, coupled with the increased sense of individuals’ political 
empowerment that accompanies democratic transitions, produces an environment ripe for 
abuse claims. On the other hand, one could reasonably anticipate that regime changes in the 
opposite direction (i.e., negative values indicating “illiberal” changes) would also produce 
more claims. 


Hypotheses 3a (and 3b): The number and rate of petitions increases 
immediately following liberal (illiberal) regime changes. 


The direction and magnitude of regime changes are estimated by calculating annual 
deviations in a country's institutionalized polity score. 


International Organizations 

In addition to the political climate of states, the extent to which countries and their citizens 
participate in the global polity is also expected to influence the number and rate of claims 
filed. World-cultural frameworks are exogenous to the states and individuals they constitute, 
and are therefore “constant” at any one point in time. What varies is the strength of a 
country’s linkage to, or the degree of individuals’ participation in, the organizational carriers of 
world culture. International non-governmental (INGO) and governmental (IGO) organizations 
tap different dimensions of a state's embeddedness in the world polity. INGOs constitute a 
global “civil society” composed of individuals, experts, professionals and associations. They 
embody and disseminate world-cultural principles — including universalism, individualism and 
world citizenship (Boli and Thomas 1997) — that combine to produce highly purposive 
members who are not only informed about their human rights, but are also culturally 
authorized as “small gods” (Meyer 2000) to defend them. Participation in INGOs also makes 
individuals adept at negotiating world society more generally, which should expedite the 
complaint process once the decision is made to file a claim. 


Hypothesis 4: As the number of INGO. memberships in @ country 
increases, the number and rate of petitions also increases. 


Whereas INGOs organize global civil society, |GOs, which are composed chiefly of states, 
undergird a state-centric world polity. Not surprisingly, then, IGOs tend to privilege state 
interests. Because the Optional Protocol's complaint procedure directly challenges internal 
sovereignty by subjecting a state's actions to external evaluation, participation in |GOs might 
be expected to discourage the submission of claims to the HRC. In the European case, Boyle 
and Thompson (2001:330) also suggest that |GOs “provide formal alternatives to claims- 
making” and thereby “channel, and even redirect, dissent” in ways that reduce the number of 
claims filed under human rights treaty regimes. 


Hypothesis 5: As the number of IGOs to which a country belongs 
increases, the number and rate of petitions decreases. 


Data for INGOs and IGOs come from the Yearbook of International Organizations (UIA, various 
years), which reports memberships in 1976, 1978, and annually from 1982 onward. Whenever 
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possible, missing data for the years 1977, 1979, 1980 and 1981 were replaced with the 
number of memberships for 1978. The civil society measure sums all INGOs per country that 
count at least one national as a member, and is logged (after adding a constant of “1”) to 
reduce extreme skew. The world polity measure records the untransformed number of IGOs 
to which a state belongs each year. 


World Human Rights Conference 

Elaborately staged international conferences heighten awareness about, and mobilize people 
and resources around, highly legitimated world-cultural principles. In 1993, the World Human 
Rights Conference at Vienna, the only global human rights conference convened since the 
Optional Protocol entered into force, drew the world’s attention to human_ rights.® 
Approximately 7,000 individuals representing 171 countries and more than 800 non- 
governmental organizations attended. 


Hypothesis 6: The Vienna Conference is associated with an increase 
in the number and rate of petitions. 


This hypothesis is evaluated with a dummy variable set equal to "1" for 1993 and the period 
directly following it (1994 and 1995). 


Civilizational Membership 

Turning to the “expressive” rather than “constitutive” dimensions of culture, one might expect 
that because Western culture values individualism so highly, it would promote use of the 
Optional Protocol's complaint procedure more frequently than non-Western cultures do. 


Hypothesis 7: The number and rate of human rights abuse claims 
emanating from Western states is greater than from non-Western 
countries. 


Huntington (1996) carves the world into nine civilizations, but only seven are represented 
among parties to the Optional Protocol: African, Hindu, Islamic, Latin American, Sinic, Slavic- 
Orthodox and Western.’ Each state's primary civilizational affiliation is identified by Russett, 
Oneal and Cox (2000) and Henderson and Tucker (2001). Analyses compare the number and 
rate of claims filed against states belonging to each non-Western civilization relative to the 
West. Also taken into account is Huntington's “West vs. the rest” thesis, which suggests that 
the most salient civilizational cleavage is between Western and non-Western countries. 


Diffusion Effects 

One might also expect that claims-making simply feeds on itself over time — that claims filed 
at time #1 inspire individuals to file more claims at time ¢. To that end, analyses consider the 
impact of national, civilizational and global diffusion processes on the number and rate of 
abuse claims filed. Prior research has demonstrated that practices diffuse most readily among 
geographically proximate actors (e.g., Knoke 1982; Land, Deane and Blau 1991; Ramirez, 
Soysal and Shanahan 1997). Because the diffusion of claims-making seems contingent on 
an individual's awareness of and identification with previous claimants, | expect its influence 
to be strongest among nationals. Diffusion within states is further enhanced by the claimants’ 
shared political environment. The visibility of claims is reduced at the level of civilizations, but 
perceived cultural similarities may nevertheless foster imitation of those claims that do 
become publicized. Finally, global diffusion is facilitated by the isomorphic forces at work in 
the world polity. According to Strang and Meyer (1994: 103), cross-national “diffusion within 
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the world system... seems grounded in the way contemporary nation-states are culturally 
constructed as formally equivalent.” The same is true, mutatis mutandis, of individuals: like 
nation-states, world culture constitutes formally equal individuals and endows them with 
similar competencies and rights vis-a-vis states. This allows an individual's allegation of abuse 
in one part of the world to serve as a model for subsequent claims in another region. 


Hypothesis 8: The number and rate of human rights abuse claims 
increase as the number of claims filed during the previous year 
increases, but with diminishing effects for national, civilizational and 
global diffusion. 


To examine diffusion effects, three variables measure the number of communications filed 
under the Optional Protocol during the previous year, aggregated to the level of nation-states, 
civilizations and the world. 


Control Variables ; 

Finally, | control for the confounding effects of economic development and population size by 
including measures of gross domestic product per capita (World Bank 2002) and national 
population (Banks 1999). There are several reasons to suspect that more claims originate 
from wealthy than from poor countries. Although the HRC has determined that individuals 
need not exhaust domestic remedies that are prohibitively expensive before filing a petition 
under the Optional Protocol (Joseph, Schultz and Castan 2004:126-28), resources are 
nevertheless required to prosecute claims. Individuals in poor countries may also be more 
concerned with securing fundamental economic or social rights rather than the civil and 
political rights protected by the Optional Protocol (Smith 1995). The citizens of wealthy 
countries, whose basic needs for survival have been met, are presumably more likely to 
advance claims rooted in “post-materialist” values (Inglehart 1990). Furthermore, the 
empirically high correlation between democracy and economic development (Lipset 1959; 
Bollen and Jackman 1985; Burkhart and Lewis-Beck 1994) suggests an inverse relationship 
between per capita GDP and the number of abuse claims. By including a measure of 
population, | control for the simple fact that countries with more individuals are at increased 
risk of having claims filed against them. Both variables are logged to reduce skew. Appendix 
B reports descriptive statistics for, and correlations among, all variables used in analyses. 


Estimation Techniques: Modeling Counts and Rates 
Only those countries “at risk” of having a case brought against them under the Optional 
Protocol enter the analyses. A country joins the risk set the year its ratification of the Optional 
Protocol becomes effective, and exits only if it formally withdraws from the Protocol. To date, 
only three countries — Guyana, Trinidad and Tobago, and Jamaica — have renounced the 
Protocol.10 

Two sets of analyses are conducted, one corresponding to each dependent variable. The 
annual number of petitions filed against each country exhibits three properties that dictate 
choice of modeling strategy: it is an overdispersed, longitudinal count variable. Approaches 
based on the Poisson distribution are appropriate for analyzing count data, but because the 
variance of the number of petitions (S.D. = 1.88) exceeds the mean (.56), the negative binomial 
regression model is favored. The longitudinal format of the data further complicates the 
methodological design by violating the independence assumption of conventional negative 
binomial models (Long 1997). Problems with autocorrelation and heteroscedasticity result, 
producing spuriously low standard error estimates. Random-effects models for cross-sectional 
time-series data have been developed to account for the non-independence of events. | use a 
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random- rather than fixed-effects design because the parameters of time-invariant covariates, 
such as the civilizational indicators, are difficult to estimate with a fixed-effect approach, and 
because countries with all “0” outcomes — countries without any petitions filed against them 
during the observation period — are dropped from fixed-effects analyses, resulting in the 
attrition of 33 countries. Random-effects models retain these countries in analyses. Negative 
binomial models are estimated using the “xtnbreg” function in STATA 8.0, with standard error 
estimates adjusted for clustering within countries (StataCorp 2003:141-49). 

| model the discrete dependent variable with an event history framework (Allison 1984; 
Tuma and Hannan 1984) describing the rate at which a country has at least one claim filed 
against it during a given year. | use the exponential (“constant-rate”) specification, which 
assumes that the rate of filing petitions is conditional on changes in the values of covariates 
over time, but is not a function of historical time per se. Because filing a petition under the 
Optional Protocol is a repeatable event, countries remain in the analysis until they renounce 
the Protocol or until the end of the observation period in 1999. | use robust standard error 
estimates to account for the clustering of petitions within countries. 


Results 


Table 2 presents results from negative binomial regression analyses of petition counts 
(Models 1-5), and Table 3 reports event history analyses of the rate at which petitions are filed 
(Models 6-10). Models 1 and 6 adduce the effect of a country’s membership in each of seven 
non-Western civilizations on claims-making relative to the West, net of political, world-cultural 
and control variables. Models 2 and 7 evaluate the “West vs. the rest” thesis by repiacing the 
non-Western indicators with a single Western civilization dummy. To analyze the impact of 
diffusion at different levels of analysis, Models 3-5 and 8-10 iteratively enter lagged petition 
counts within states, civilizations and the world. Event history analyses also include an 
additional control variable set equal to “1” when multiple communications are filed against a 
country in a given year, and “0” when only one or no communications are filed. 

Raw coefficient estimates reported in Tables 2 and 3 give the direction and statistical 
significance of effects, but are otherwise difficult to interpret. The antilog of a parameter 
yields the net effect of a one-unit change in its corresponding covariate on the predicted 
number of petitions brought against a country (for negative binomial analyses reported in 
Table 2) or the rate at which petitions are filed (for event history analyses reported in Table 3). 

First, consider the political conditions giving rise to communications filed under the 
Optional Protocol. Democratic countries, and countries with favorable human rights records, 
have significantly fewer claims brought against them than countries with autocratic or 
repressive regimes do (see Models 1-5). For instance, results from Model 1 suggest that a 
one-unit increase in the autocracy/democracy scale reduces the annual number of claims filed 
against a country by an estimated 8.6 percent (= 100[exp(-.09) — 1]), while a one-unit 
improvement in a country’s human rights practices decreases the predicted number of claims 
by 24 percent (= 100[exp(-.27) - 1]). Likewise, political democracy and liberal human rights 
practices consistently reduce the rate at which claims are filed (Models 6-10). This evidence 
is consistent with Hypotheses 1a and 2a, which predict fewer claims for democratic countries 
and regimes with favorable human rights practices. Conversely, Hypothesis 2b, positing an 
inverse curvilinear effect of a country’s commitment to democracy on the number and rate of 
Claims, is clearly refuted. The statistically insignificant squared polity term in Models 1-5 
indicates that the effect of autocracy/democracy on the number of claims is linear. The effect 
of democracy on the rate of claims-making (Table 3) is curvilinear, but in the unanticipated 
direction: the negative linear and positive quadratic coefficients suggest a U-shaped effect in 
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Table 2: Cross-sectional Time-series Negative Binomial Analyses of the Volume of Communications 
- Filed under the Optional Protocol, 1976-1999 


ee A Te ek ee 
Variables Model 1 Model2 Model3  Model4 Model 5 


Control Variables 

GDP per capita (logged) 21 cto 23 ee et 
(.18) (.16) (.17) (.18) (.18) 

Population (logged) 16 14 19 18 16 

(.13) (.13) (.12) (.13) (.13) 

Political Context 

Human Rights Practices - 27" - 28° - 26 Brot, - 27" 
chi (.10) (.10) (.10) (.11) 

Autocracy/Democracy - 09 - 097 - 09" - 097 - 09" 
(.02) (.02) (.02) (.02) (.02) 

Autocracy/Democracy? .002 .003 .002 .002 .002 
(.004) (.004) (.004) (.004) (.004) 

Polity Change Score (t - 1) .04 .04 .04 .04 .04 
(.04) (.04) (.04) (.04) (.04) 

World-Cultural Context 

INGO Memberships (logged) 66" fie Al 58 68 
(.24) (.23) (.23) (.26) (.27) 

IGO Memberships - .008 - .008 - .003 - 006 - 008 
(.009) (.008) (.009) (.009) (.009) 

World Human Rights Conference  -.17 - 18 - 15 - 19 -.17 


(.16) (.16) (.16) (.16) Bee (15) 
Civilizational Context 


African —- 90" — - .76 - 84 - 89 
(.53) (.52) (.54) (.53) 
Hindus - .13 — - .03 - 05 - 13 
(1.09) | (1.03) (1.09) (1.09) 
Islamic —- 3.91 — - 3.72 - 3.84 3.90 
(1.17). (1.15) (1.17) (1.17) 
Latin American - 1.21 = - 1.00 - 1.14 1.21 
(.35) (.34) (.37) (.35) 
Sinic - 1.07 — - 88 - -99 - 1.07 
(1.14) (1.05) (1.14) (1.14) 
Slavic-Orthodox - 1.75 ae 1.48 - 1.65 Lo 
(.63) z (.60) (.65) (.63) 
Western 1.36 
(.33) 
Lagged Diffusion Effects e 
Cases per Country (t - 1) (oH) 
Cases per Civilization (t - 1) .004 
(.006) 
Total Cases, World (t - 1) - 001 
(004) 
Intercept AZ nn 710. - 6.30" - 6.28" ee 
Wald ? (vs. Null) 88.05" 85.05 123.86 89.36 87.96 
Df 15 10 16 16 16 


Notes: N of communications = 666; N of countries = 82; N of country-year spells = 1,041 
Standard error estimates, in parentheses, are adjusted for clustering within countries. 
tp<.10 “p<.05 “p<.01 ™ p< .001 (two-tailed tests) 
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Table 3: Exponential Event History Analyses of the Rate of Communications Filed, 1976-1999 


Se eee 
Variables —Sss————SSCSCSC*C‘Modde 6 Model 7__ Model 8 _Model9__ Model 10_ 
Control Variables 
GDP per capita (logged) - 01 - .001 - 002 - 02 - .03 
(.10) .09 10 (.10) (.10) 
Population (logged) 08 08 .09 07 07 
(o7), (07), (07), 07), 07), 
Multiple Claims Filed 4.51 1.52 1.44 1.51 1,51 
(.16) (.16) (.17) (.16) (.16) 
Political Context f 2 : ; ; 
Human Rights Practices - 22 - 24 -.21 - .22 - 22 
(.09) (.09) (.09) (09) (.09) 
Autocracy/Democracy - 04 - 04 - 04 - .04 - 04 
(.01) (01) (.01) (01) (01) 
Autocracy/Democracy” 006 007 006" 006 006" 
(.003) (.003) (.003) (.003) (.003) 
Polity Change Score (t - 1) 06" 06" 06" 06" 06" 
(.03) (03) (.03) (.03) (.03) 
World-Cultural Context a : " 4 “f 
INGO Memberships (logged) nO0 ot oo 40 At 
(.12) (.12) (.12) (.14) (.14) 
|GO Memberships - 003 - 005 - 003 - 004 .004 
(.005) (.005) (.005) (.005) (.005) 
World Human Rights Conference - .04 - 07 - 05 - 03 - 04 
(.12) (.12) (.12) (.12) (.12) 
Civilizational Context 
African - .50 — - 44 - 59 48 
(.37) (.37) (.38) (.37) 
Hindu -.43 mg - 36 - 54 - 43 
(91) (.92) (91) (.91) 
Islamic - 2.94 = 2.89 - 3.03" 2.90” 
(.93) (.92) (.92) (.91) 
Latin American - .54 = - 54 - 61° - 55° 
(.18) (.19) (.20) (.18) 
Sinic -.11 ss -.12 -.21 -.10 
(.19) (.18) (.20) (.19) 
Slavic-Orthodox - .98 a - 93 1.08" - .97 
(40) PD 2 (40 gpsggyfel40 yn earns 40} 
Western 61 
(.19) 
Lagged Diffusion Effects 
Cases per Country (t - 1) 02t 
01 
Cases per Civilization (t - 1) en - 004 
.004 
Total Cases, World (t - 1) - 003 
wee wet 003 
Intercept -4.14 - 4.94 CAAT Vy she sang, 
Wald x? (vs. Null) 287.59" 371.12" 418.94" 323.44" 279.44" 
df 16 11 Av 17 17 


Notes: N of “failures” = 257; N of countries = 82; N of country-year spells = 1,041. Robust 
standard error estimates in parentheses. 


'p<.10 "p<.05 “p<.01 ™ p <.001 (two-tailed tests) 
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which the estimated rate of claims is lowest for moderately democratic regimes." Regime 
changes, moreover, do not significantly affect the volume of communications submitted to 
the HRC, but weakly increase the rate at which they are filed. Hypothesis 3a, then, finds 
modest support; Hypothesis 3b receives none. It also bears mentioning that population size 
and national wealth do not influence the number or rate of claims. 

The impact of global cultural and associational processes on use of the Optional Protocol 
is mixed. Clearly, participation in global civil society is important. As the number of INGOs to 
which a country's nationals belong increases, the number and rate of petitions filed against 
it also increases, confirming Hypothesis 4. On the other hand, the extent of a state's 
participation in global governance regimes (as measured by |GO memberships) does not 
significantly influence either the volume or rate of petitions filed. State sovereignty norms 
embodied in and championed by |GOs apparently do not discourage individuals from filing 
claims with the HRC. Nor does it appear that IGOs offer individuals effective or widely 
deployed alternatives for resolving their disputes with states, as suggested by Boyle and 
Thompson (2001). Hypothesis 5 is not supported. 

Furthermore, the consciousness-raising activities of the Vienna Conference evidently did not 
promote use of the Optional Protocol, contradicting Hypothesis 6. Perhaps human rights 
conferences prompt states to improve their practices, thus reducing formal allegations of abuse. 
Although it is beyond the scope of this study to evaluate that claim, it invites further analysis. 

Compared with the varied impact of world culture on use of the Optional Protocol, a 
country’s civilizational context profoundly affects both the number and rate of abuse claims 
filed against it. This finding contrasts sharply with the null effect of civilizational membership 
on the ratification of human rights instruments, including the Optional Protocol itself (Cole 
2005). Countries belonging to non-Western civilizations are universally less likely than Western 
states to have claims filed against them, although not all of the coefficient estimates surpass 
conventional thresholds of statistical reliability. Individuals in Islamic, Latin American and 
Slavic-Orthodox countries file significantly fewer claims against their respective governments 
than do their counterparts in the West (Models 1 and 3-5). The rate of petitions emanating 
from these countries is also lower relative to Western states (Models 6 and 8-10). African, 
Hindu and Sinic countries register consistently negative but statistically insignificant results 
for both the number and rate of petitions filed. 

Models 2 and 7 indicate that the “West vs. the rest” thesis also holds: the number and rate 
of human rights abuse claims is greater in the West compared to non-Western civilizations 
taken together. In fact, Model 2 demonstrates that Western countries account for an 
estimated 3.9 times as many claims as all non-Western countries combined (= exp[1.36}), 
holding other factors constant. This is true even though human rights practices in the West 
(mean = 4.76) are significantly better, on average, than in non-Western countries (mean = 
3.45; t = -23.44, p < .001). Of course, Western countries also account for fully 40 percent of 
country-year cbservations (see Appendix B), which indicates that they are more willing than 
are non-Western states to subject their human rights practices to external evaluation and 
monitoring. Nevertheless, on balance, Hypothesis 7 withstands scrutiny. 

As for the effects of diffusion (Models 3-5 and 8-10), results demonstrate that only intra- 
state diffusion processes affect the subsequent number and rate of claims filed (Models 3 
and 8). The number of communications filed during the previous year at the level of 
civilizations and the world at large has no demonstrable impact on future claims. Individuals 
apparently look only to their compatriots for cues about filing abuse claims. Alternatively, the 
national-level effect could be interpreted to mean that complaints filed at time t are triggered 
by the same (or broadly similar) political conditions that obtained at time +1, and suggests 
that a state’s human rights practices are stable over short periods of time. Either way, 
Hypothesis 8 is only partially affirmed.” 
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Conclusion 


Four factors — human rights practices, domestic opportunity structures, the cultural 
empowerment of individuals and intra-state diffusion — account for the number and rate of 
Optional Protocol communications. These results corroborate Boyle and Thompson's (2001) 
analysis of claims submitted under the European Human Rights Convention at the global, 
rather than regional, level. They find for Europe, as | do for the world, that state openness (i.e., 
democracy), human rights abuses and country-level INGO memberships all contribute to the 
number of human rights abuse claims filed against countries. In addition, | uncover 
civilizational effects that Boyle and Thompson were unable to assess, given the regional 
scope of their analysis. 

In explicating the political, cultural and institutional conditions that promote use of the 
Optional Protocol's individual complaint procedure, | contribute to a human rights literature 
that focuses almost exclusively on the efforts of non-governmental organizations as third- 
party champions of human rights (e.g., Keck and Sikkink 1998; Risse, Thomas and Sikkink 
1999; Tsutsui and Wotipka 2004). Indispensable as NGOs are to the protection of human 
rights, focusing our attention too narrowly on them overlooks those features of the 
international human rights regime that encourage the direct participation of individuals. As my 
analyses make clear, the number and rate of human rights abuse claims submitted to the HRC 
clearly increase as countries become less democratic and as human rights practices worsen, 
but also vary with the extent to which global and civilizational cultural forces constitute and 
empower individuals. The “myth” of the purposive individual, which originates in the West and 
continues to be strongest there, is real in its effects. Individuals from Western countries allege 
state abuse much more frequently than do most of their non-Western counterparts, even 
though differences in the human rights practices between the West and “the rest” would 
seem to predict otherwise. Whether or not the presumed “clash of civilizations” is real 
(Russett, Oneal and Cox 2000; Henderson and Tucker 2001), broad cultural cleavages 
certainly matter so far as use of the Optional Protocol is concerned. 

This is despite the fact that rates of Optional Protocol ratification, as well as ratification of 
the International Human Rights Covenants proper, do not differ across civilizations (Cole 
2005). Donnelly (2003), then, is only partially correct when he argues for the existence of an 
“overlapping consensus” on international human rights. Symbolic endorsement of human 
rights by states in the form of treaty ratification transcends broad civilizational divides, but the 
effects of human rights norms on the actual practices of individua/s are much slower to 
penetrate cultural barriers. Before filing a claim under the Optional Protocol, an individual must 
first acknowledge that he or she holds certain rights against the state, that those rights are 
worth protecting, and that something can and should be done to redress state abuse. 

Submitting a communication to the HRC therefore presumes a sense of moral worth and 
efficacy vis-a-vis the state that most Westerners take for granted. Culture, then, clearly 
matters for the implementation of the Optional Protocol, although the precise reasons are 
less clear. Perhaps the liberal ideology suffusing human rights discourse simply fails to 
articulate with non-Western worldviews. Or, non-Western individuals may lack the sense of 
efficacy needed to defend their human rights against state abuse. Even after holding 
individual attitudes “constant,” non-Western individuals may simply lack access to the 
normative, associational and legal networks that support the prosecution of abuse claims — 
indeed, such networks may even actively a/scourage individual action. These questions 
remain unanswered and deserve further attention. 

Despite the continued salience of civilizations, a model of society that builds a cult around 
the individual has, since at least World War II, transcended its Western origins and diffused 
widely across the globe. Nevertheless, the degree to which countries are linked into this 
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global model influences the extent of its penetration. As carriers of world-cultural models, 
INGOs transmit notions of the empowered and enfranchised individual around the world, and 
countries whose citizens participate most extensively in these organizations not coincidentally 
experience the greatest number and rate of human rights abuse claims. 

Although world-cultural frameworks are increasingly universal in scope as well as in 
pretension, they remain anchored historically and culturally in Western moorings, and 
therefore resonate more fully in Western countries. In a world with some other — perhaps 
Islamic — cultural hegemon, the ontological standing of individuals would most certainly differ, 
as would notions about their rights. All the same, countries from each of the world’s major 
civilizations currently subscribe to human rights in some form, and future work might 
compare more systematically the use of individual complaint procedures established by 
regional human rights treaties in Africa, the Americas and Europe. Regional human rights 
commissions collectively receive many more communications each year than the HRC, but 
still unknown is whether subsidiarity is a principle that extends to the enforcement of human 
rights norms. Are individuals best served by national governments, regional institutions or 
global treaties? Until this question can be answered, | conclude that when all else fails, 
individuals whose rights have been violated by states can, and under certain conditions do, 
take advantage of “last resort” mechanisms when those options are available to them. 


Notes 


1. By “sovereignty” is meant a state's ability to exclude external actors from its internal 
affairs (Krasner 1999). 


2. The Optional Protocol refers to formal allegations of treaty violation as “communications.” 
| substitute “cases,” “claims” and “petitions” interchangeably. 


3. The HRC has determined that individuals need not exhaust prohibitively costly remedies 
before filing complaints, however. Nor must they resort to “futile” or “unreasonably” 
lengthy remedies (Joseph, Schultz and Castan 2004: 103-38). 


4. | place “actors” in quotes to underscore the neo-institutional idea that agency is a 
culturally constructed rather than primordial feature of states (Meyer 1999) and individuals 
(Meyer and Jepperson 2000). 


5. Communications filed against Jamaica are responsible for much of the dramatic increase 
in the late 1980s, with 25 claims issued in 1987 and 30 in 1988. Jamaica’s denunciation of 
the protocol in 1997, together with lagged data reporting procedures, likely accounts for 
much of the sudden decline thereafter. 

6. Of the 96 countries that were party to the Optional Protocol at any time between 1976 and 
1999, my sample includes 82. 


7. In addition to the State Department's measure of human rights practices, | tried a similar 
variable constructed from Amnesty International reports. Both variables produce 
substantively identical results in the ensuing analyses. 


8. Other conferences have addressed the domain-specific rights of, inter alia, minorities, 
women and indigenous peoples, but my interest is with venues that focus on human 
rights as a comprehensive issue. 
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9. The remaining two civilizations not accounted for here are Buddhist and Japanese. 


10. Jamaica renounced the Protocol in 1997. Trinidad and Tobago withdrew in 1998, 
immediately re-ratified with a new reservation prohibiting death row inmates from 
petitioning the HRC, and again withdrew in 2000. Guyana rescinded in 1999 and 
subsequently rejoined with its own death-row reservation. 


11. Using estimates from Model 6, the predicted rate of claims-making reaches a nadir when 
a country's polity score equals 2.0. This point of inflection is calculated as the (negative) 
linear coefficient divided by twice the quadratic coefficient (i.e., --.04]/2[.01] = 2.0). 


12. To determine if Jamaica, which accounts for nearly one-quarter (22.5 percent) of all 
communications filed with the HRC during the observation period, exerts a 
disproportionate influence on results, | conducted two sets of supplementary analyses. 
First, a dummy variable set equal to “1” for Jamaica was entered into analyses, but it failed 
to achieve statistical significance and did not alter the effects of other variables. Second, 
models were re-estimated after excluding Jamaica; this also did not change the 
substantive interpretation of findings. 
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Abstract 

This paper applies a Boolean approach to examine the social background of movements for 
linguistic human rights. Predictive determinants to explain the occurrence of LHRs 
movements in this study included linguistic diversity within a country, literacy rate, 
population size, national income as an index of affluence, and the existence of a constitution 
supporting those rights. Data for 159 countries were collected and analyzed using a Boolean 
analysis. The result of the analysis shows that there are four combinations of economic and 
linguistic conditions that cause LHRs movements in a country. A further analysis with varying 
cutoff values reveals that the combination GD (higher gross income AND linguistic diversity) 
is the “strongest” condition for LHRs movements in the four combinations. 


Introduction 


Linguistic Human Rights: A Historical Background 


Human beings naturally seek out opportunities to use languages they have learned first and 
identify with. However, some minority groups have been and are now forced to use majority 
languages in formal education and social settings. 

The concept of Linguistic Human Rights was first recognized in the Austro-Hungarian 
Empire at the time when movements for self-determination came to the fore among ethnic 
minorities. Consequently, a legal guarantee of LHRs was attained in the Empire. This legal 
measure in the Austro-Hungarian Empire affected other countries’ constitutions, such as the 
Weimar Constitution in 1919 and the constitution of the Union of Soviet Socialist Republics 
in 1936, both of which guaranteed an individual the right to use languages of his or her group. 

The Declaration of Human Rights, adopted by the United Nations General Assembly in 
1948, contains a statement referring to LHRs as one of the basic human rights. Countries 
which have ratified the Declaration are obviously expected to abide by its provisions and carry 
out policies in accordance with those provisions. However, the aim of the Declaration was the 
protection of human rights for individuals rather than the protection of rights for minority 
people including minority language speakers. 


A shortened version of this paper was read at the 2nd General Conference of the European 
Consortium for Political Research at Philipps-Universitat in Marburg, Germany, Sept. 18-21, 2003. 
We are grateful for participants’ comments, as well as the extremely constructive suggestions by one 
of the anonymous reviewers for Social Forces. We also thank Alan Brady for his help. Direct 
correspondence to Atsushi Ishida, School of Sociology and Social Work, Kwansei Gakuin University, 
1-1-155 Uegahara, Nishinomiya, Hyogo, JAPAN. E-mail: aishida@kwansei.ac.jp. 
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Tove Skutnabb-Kangas and Robert Phillipson (1994) point out that itis only after the 1970s 
that LHRs were clearly and concretely formulated. Skutnabb-Kangas defines the Linguistic 
Human Rights in 1983 as below. This definition is quoted in the “Resolution on Linguistic 
Rights,” which was adopted by the International Seminar on Human Rights and Cultural 
Rights held in Recife, Brazil in 1987. The conception is summarized as follows (Skutnabb- 
Kangas and Phillipson 1994: 98-99)": 


1. Every social group has the right to identify positively with one or 
more languages and to have such an identification accepted and 
respected by others. 

2. Every child has the right to learn fully the language(s) of his/her 
group. 

3. Every person has the right to use language(s) of his/her group in 
any official situation. 

4. Every person has the right to learn fully at least one of the official 
languages in the country where s/he is a resident, according to 
her/his own choice. 


This definition implies that the promotion and protection of LHRs is an attempt to apply the 
concept of human equality so as to cover the use of language and, hence, make any linguistic 
discrimination visible and problematic, and thus abolish such discrimination. 

The Convention of the Council of Ministers of the Council of Europe adopted the European 
Charter for Regional or Minority Languages in 1992, and this charter was ratified by 12 countries. 
In 1996, the World Conference of Linguistic Rights in Barcelona adopted the Universal 
Declaration of Linguistic Rights, the first universal declaration which specifically and clearly 
described LHRs. The Barcelona conference was organized by the Translation & Linguistic Rights 
Committee of the International PEN Center and CIEMEN (NGO) under the auspices of UNESCO. 
After the conference, the follow-up committee appealed for the adoption of the declaration by 
the United Nations General Assembly. However, because it was adopted by NGOs, it was not 
considered an official declaration. It has not been widely accepted. 


Research Questions 


Social movement is one of the ways for a linguistic group to demand LHRs. By the concept 
of movement, we mean collective (and sometime illegal) action for a certain political or 
social purpose. So, LHRs movements can be conceptualized as a collective action for 
acquiring partial or complete LHRs. Some movements for self-determination and/or for 
independence from colonial rule put forward a claim for use of their own languages, as was 
the case in East Timor. These movements, then, can also be classified as LHRs 
movements, although they are integrated into a wider encompassing cause. Whether a 
linguistic group does or does not make demands and/or claims in the form of an overt 
social movement depends upon the group's social situation. If a linguistic minority is 
powerless and small in size, it might not be able to claim rights amongst more dominant 
groups. Especially, in the post-World War II global society, the socio-economic condition of 
countries and their minority populations has played an important role in LHRs movements. 
However, very little empirical research has been conducted to clarify any of the specific 
determinants. 

A Boolean approach (Ragin 1987, 2000) is best suited to analyze our research questions 
such as: In what kind of societies do such movements occur and under what conditions? 
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LHRs Movements 


Cases of LHRs Movements 


We examined 7he Data Book of Resistance, Conflict and War 1900-1997, compiled by Tatsuo Urano 
(2000) to collect cases of movements directly related to LHRs after World War II. Urano collected 
data on 10,842 conflicts in the 20th century. Because Urano’s database is constructed for the 
purpose of statistical analysis and is a nearly exhaustive database, it is also useful for our study in 
that it refers to specific variables with quantitative statistics. We made use of the theme index to 
collect the cases. Each case in the database is described with a brief explanation, from which we 
judged whether or not it was related to LHRs in a relevant way. We picked up cases included in the 
themes labeled as “indigenous people,” “minority,” “immigrant workers,” “human rights,” “ethnic 
movement for self-determination” and “remote region” from the database (Table 1). 


Table 1: Number of Cases in Each Theme 





Theme Number of Cases 
Indigenous People 121 
Minorities 84 
Immigrant Workers 161 
Human Rights 225 
Ethnic Movement for Self-determination 330 
Remote Region(s) 996 


Out of the above, only those cases in which the issue was explicitly LHRs or an equivalent 
(use of or education in their own languages) were grouped as LHRs movements. Additionally, 
we checked other themes so as not to exclude other relevant cases from the database. 
Consequently, we managed to collect 38 cases observed in 21 countries. Table 2 shows the 
list of cases. 

Although each case is described briefly with an emphasis on the course of events, there is 
a difference in the amount of description between cases depending on the availability of 
information and its importance. So, we possibly excluded some cases despite their focus on 
LHRs. LHRs movements are meant to appeal to the public as well as to secure a right, thus 
visibility and overtness are primary features. We assumed that cases without descriptions of 
highly visible and overt actions were more covert movements. 

However, we did not want to overlook cases with bearing on our research. For this reason, 
we also collected cases categorized as “indigenous people's movement” or “minority 
movement” or “ethnic movement for self-determination” for final selection of relevant cases 


under concern. 


Theory of LHRs Movements 


There have been a number of references and theories in the literature, mostly about the 
normatively-oriented aspect of LHRs. However, there exist no specific theories about LHRs 
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Table 2: The List of the LHRs Movement after World War II 


i EEE 


Movement Country Years 


Ainu Movement for Autonomy 

Indigenous People’s Demand for Rights 

Demonstration by Chinese Junior High School Students 
Strike by Chinese Teachers 

Language Riot in Assam 

Language Riot in Assam 

Language Riot in Karnataka 

Bengali Language Movement 

Bengali Language Movement 

Language Riot 

Student Demonstration against Introduction of Arabic Language 
Tamil Resistance Movement 

Racial Riot 

Riot against the Introduction of the Arabic Language 
Student Demonstration 

Language Confrontation 

Demonstration in Soweto 

Movement for Autonomy in Occitan 

Northern Frisian Speakers Movement for Rights 
Movement for Making Sardinian the Official Language 
Language Confrontation 

Language Confrontation in Catholic University 
Language Confrontation 

Language Confrontation 

Language Confrontation in Fourons / Voeren 
Movement for Foundation of the National Assembly for Wales 
Welsh Language Movement 

Sami Movement for Rights 

Sami Movement for Rights 

Ethnic Riot in Georgia 

Demonstration for the Protection of Georgian 
Demonstration in Uzbek 

Gagauz Movement for Autonomy in Moldavia 
Assimilation Policy for Turks and Turks Riot 

Ethnic Confrontation 

Quebec Problem 

Demonstration by Italian People 

Aviation Strike against the Use of French in Airports of Quebec 


Japan 1992 
Taiwan 1984 
Malaysia 1973 
Malaysia 1987 
India 1960 
India 1972 
India 1984 
Pakistan 1948 
Pakistan 1952 
Pakistan 1972 
Bangladesh 1982 
Sri Lanka 1956-1957 
Sri Lanka 1961 
Mauritania 1966 
Cameroon 1983 
Cameroon 1993 
South Africa 1976 
France 1959 
Germany 1948-1949 
Italy 1977-1978 
Belgium 1960-1963 
Belgium 1967-1968 
Belgium 1968 
Belgium 1976 
Belgium 1986-1988 
UK 1950-1979 
UK 1982 
Norway 1968-1978 
Denmark (GL) 1975-1987 
USSR 1978 
USSR 1981 
USSR 1989 
USSR 1989-1994 
Bulgaria 1970-1989 
Romania 1990 
Canada 1968-1985 
Canada 1969 
Canada 1976 
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movements per se beyond mere descriptive “country studies” (Skutnabb-Kangas and 
Phillipson 1994). We could perhaps refer to theories of social movements and collective 
behavior in general. Recent theoretical developments in the area of study would be 
represented by a theory of resource mobilization (McCarthy and Zald 1973; Morris and 
Mueller 1991; Oberschall 1973; Useem 1981) and theory of new social movements (Melluci 
1989; Offe 1985; Touraine 1985). 

LHRs movements are “new social movements” in the sense that people are now seeking 
ethnic identity in the midst of ethnic diversity and heterogeneity, which are also closely tied to 
self-realization, life-style and values, etc. New social movements, however, implicitly covers 
the advanced societies as its field, while LHRs movements are not confined to the advanced 
societies but are widely observed among less economically advanced societies with various 
sizes of population. 

The theory of resource mobilization emphasizes availability of a variety of resources, the 
nature of existing organizations with varied preferences, and interactive processes between 
the authorities and those people involved. In general, this paradigm focuses on the 
mobilization process rather than deprivation and grievance which were traditional focal points 
of studies of social movements. Under the influence of theory of resource mobilization, many 
recent studies of social movements focus on the concept of “political opportunity structure, ” 
as well as “mobilizing structure” and “framing” (McAdam et al. 1996). Political opportunity 
structure is defined as “consistent dimensions of the political environment which either 
encourage or discourage people from using collective action.” (Tarrow 1994: 17) Although 
there have been many studies about political opportunity structure, it still remains uncertain 
how the structure effects the occurrence of social movements. Moreover, the literature in 
these theories does not take up LHRs movements at all perhaps because historical cases of 
such movements may be of only minor interest. 

We want to decide the specific determinants that cause LHRs movements as well as the 
general determinants that cause movements in general. To that end, our research begins with 
macroscopic factors such as socio-economic structural and institutional aspects. 

One of the possible determinants of the LHRs movement is the degree of the host 
country’s socio-economic development. The unique study by Coulmas (1993) emphasizes a 
strong relationship between domestic economic situation and domestic linguistic situation. 
Possibly, such determinants cause not only LHRs movements but also some ethnic or labor 
movements as well. It is also possible that poverty causes the ethnic movements with 
Marxist orientation, which may or may not be related to LHRs movements. On the other hand, 
as theories of new social movement suggest, there is a new kind of movement taking place 
in affluent societies, which seeks symbolic values and identity. 

Furthermore, it is likely that a LHRs movement results from social geographic conditions 
such as diversity of languages within a country and varying population size. These conditions 
seem to be specific to LHRs movements. 

Skutnabb-Kangas and Phillipson (1994) also stress the influence of education in LHRs, 
suggesting that at least some education is required for people to demand LHRs and 
participated in LHRs movements, which in turn promotes minority education. Therefore, 
education as outcome and media of movements is, we believe, to be examined in the 
following analysis. LHRs movements imply that a minority group and a majority group (or the 
authorities) are in conflict, where some linguistic diversity is assumed to be involved in 
occurrences of LHRs movements. 

Because there is no single encompassing theory to explain the occurrence of LHRs 
movements, we could at best distinguish relevant determinants that have some bearing on 
the causational linkage among them in mind at the moment. Cases of LHRs movements are 
not as large in number as seen shortly but cover a wide variety of countries at different points 
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in time. In such a situation, we cannot fully rely on a statistical approach. We need a better 
way to extract the determinants from a relatively small number of cases. A Boolean approach 
as the comparative method is best suited for the purpose.° 


A Boolean Approach 


An Introduction 


The Boolean approach, an application of Boolean algebra in the field of social science, was 
initially proposed by Charles C. Ragin (1987) as a method for qualitative comparative 
analysis. The approach aims to find a configuration of conditions which causes a social 
phenomenon, by means of a comparison of multiple causal conditions from each case. 
Hence, the approach has the advantage of finding not a tally of the relevant correlatives 
leading to a dependent variable as such, but a particular pattern or combination of variables 
to produce the outcome. Furthermore, a fairly limited number of cases or observations under 
study does not allow us to depend on conventional variable-oriented statistical analysis such 
as regression analysis. However, these cases do allow us to analyze more than mere 
idiosyncratic case-oriented studies because the Boolean approach empirically generalizes 
phenomena in causal terms. 


Table 3: An Example of a Truth Table 





Condition A Condition B Phenomenon C 
1 1 { 
1 0 { 
0 { 0 
0 0 0 





In Boolean algebra, variables (independent and dependent) must be binary, that is, they are 
true (or present) or false (or absent). In our paper we represent these values in numerical 
form: 1 indicates true (or present); O indicates false (or absent). Thus, if we want to use any 
non-binary variables for a Boolean analysis of data, we must transform these into binary 
variables. Because such a transformation sometimes loses some information, we should be 
careful in the choice and transformation of variables. 

A truth table is used for representing data in the Boolean approach. In a truth table, each 
logical combination of values on the independent variables is represented as one row. Table 3 
is an example of a truth table. Suppose there are four societies. Some societies have 
experienced an occurrence of social phenomenon C while others didn't. We want to study 
under what conditions C occurs. Also suppose there are two possible conditions A and B 
which might be related to an occurrence of C. In Society 1: A, B and C are observed. In 
Society 2: A and C are observed. But, in Society 3: only B is observed; while in Society 4: A, 
B and C are not observed. From these observations, we construct a truth table as follows. 

In this situation, there are four logical combinations of input values (presence or absence of 
each condition), and each combination has a distinct output value (value of C) respectively. 
However, in most studies, it is difficult to determine an output value for each combination because 
there may be cases with the same combination of conditions but with a different output value. 
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There are some possible solutions for the problem of such a contradiction (Ragin 1987: 113-8). In 
our study, we will apply frequency criteria as a cutoff value for determining an output value. 

Here we follow the routine notation where uppercase letters indicate the presence of a 
condition or phenomenon (that is, the value is 1), and lowercase letters indicate its absence 
(that is, the value is 0). For example, A indicates the presence of condition ‘A and b indicates 
the absence of condition B. Then, by looking at Table 3, we can describe multiple 
combinations of conditions which cause C using logical operators OR and AND: 


(A AND B) OR (A AND 6) 


In Boolean algebra, addition is equivalent to OR, while multiplication is equivalent to AND, so 
the above description can be reformulated to: 


AB +Ab=C (1) 


Boolean algebra allows us to simplify the complexity of expression for formulating a 
simple Boolean statement. The most fundamental of minimization rules is: 


/f two Boolean expressions differ in only one causal condition yet 
produce the same outcome, then the causal condition that distinguishes 
the two expressions can be considered irrelevant and can be removed 
to create a simpler, combined expression. (Ragin 1987: 93) 


Thus, in the above case AB and Ab both produce the outcome C, but differ only in the value 
of condition B. So we can minimize Equation (1) as: 


A=C (2) 


This minimized equation suggests that C will occur whatever the value of condition B is, thus 
the presence of condition A is the only cause which produces the outcome C, that is, 
necessary and sufficient cause. A cause whose presence is needed for a certain outcome is 
regarded as a necessary condition, while a cause which can produce a certain outcome by 
itself is regarded as a sufficient condition. 

The Quine-McCluskey minimization technique will be employed in the minimization of 
Boolean equations in this study. To process the data, we used the packaged program called 
fs/QCA version0.963 made by Kriss Drass and Charles C. Ragin and a program we wrote in 
Mathematica 4.2 which we believe made the program much easier to use. 


Correlatives and their Operational Definitions 


We operationally made conditions related to LHRs movements as binary variables so that data can 
be dealt by using Boolean addition and multiplication. The data set for each variable was created 
from all countries in the world at the present time. Note that because there are significant 
institutional differences between the USSR and countries independent from USSR at present, we 
also collected the data taken from the then USSR in 1990 to take into account movements that 
occurred under that regime. Aside from that exception, we collected the data of countries 
regardless of the existence of movements in the year 2000. It is reasonable to suppose that binary- 
valued conditions, especially social-geographic conditions such as population size and linguistic 
diversity, remain the same throughout time after the end of World War Il. It is also reasonable to 
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suppose that the current condition of a country reflects its own history. Hence, we safely infer the 
determinants of occurrence of movements from the analysis of contemporary conditions. 

We show the list of conditions and how we create binary variables from the data in the 
following. We denote variables in the analysis by initial italic letters. 


D: A Diversity Index 
This is one of the social geographic conditions relevant to languages. Greenberg's 
linguistic diversity index shows the probability that any two people in the country picked 
at random will have different languages (Greenberg 1956; Lieberson 1981). The diversity 
index is expressed as: 


Diversity Index = i-)>, nz; (P;F 


where P; is the proportion of total population in the / th language group in a country. If all 
people belong to one group, that is, P; = 1, then the diversity index becomes zero, 
meaning complete homogeneity. If all groups consist of the same size, the index 
approximates one as the number of groups increase, meaning highly diversity. We 
collected the data from the Ethnologue: Languages of the World (Grimes 2000), which is a 
database presented by SIL International on the web. 


L: A Literacy Rate 

To measure the degree of development in the field of education of countries, we adopted a literacy 
rate of the population. The data was taken from 7he World Almanac and Book of Facts (2001). Note 
that we assigned 100 percent to those countries (Japan, UK, France, Germany etc.) whose literacy 
rates are obviously close enough to 100 percent but are shown as “unknown” in the book. However, 
literacy rates of 31 countries remain unknown. We will exclude these countries from the following 
analysis. We thereby exclude cases of LHRs movements in Denmark, Norway and Taiwan. 


P: Population 
This is one of the social-geographic conditions relevant to the constitution of languages in 
use. The data was also taken from 7he World Almanac and Book of Facts (2001). 


G: Gross National Income 

To measure the degree of economic development of countries, we adopted GNI (Gross 
National Income) per capita. The data source is also The World Almanac and Book of Facts 
(2001). Coulmas (1993) suggests that linguistic conditions, such as the degree of spread of a 
common language (or the standardization of languages within a country) and linguistic 
diversity, have an effect on the economic development of a country. If this holds true, then we 
could safely predict that there will be more cases of LHRs movements in affluent countries 
than in non-affluent countries since affluent countries must have enforced a language policy 
towards the spread of a common language and suppressed minority languages, if any. 


C: Constitution 

To measure the legal institutional condition related to LHRs, we created a binary variable which 
stands for the presence (or the absence) of the constitutional provisions which guarantee the rights 
of minority language speakers. Although the constitutional provisions, if any, are generally expressed 
in such an abstract way that we cannot evaluate the policy, the presence of a constitutional 
guarantee seems to encourage LHRs and facilitate movements.‘ We used the database of linguistic 
rights (UNESCO 2002) created by UNESCO's MOST (Management of Social Transformations) 
Program which put special emphasis on policy-oriented research in multicultural societies. Figure 1 
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shows the process of assignment of binary value to each country. We assigned “-" to missing or 
uncertain cases. However, we shall re-assign “0” to “-" in the following analysis. The reasoning and 
process of assigning values to the variable C will be shown in the following steps (Figure 1). 


M: Movements 

This is a dependent variable in the following analysis, designating whether each country had 
LHRs movements or not. The way of assigning values to each combination of conditions will 
be shown later on. 

We treated countries whose literacy rates are unknown as missing cases. As a result, 159 
countries including the USSR remained as valid cases. We computed the median of D, L, P 
and G, respectively for those countries; thus for each variable we assigned “1” to countries 
whose value was larger than the median, and “0” to countries whose value was less than the 
median. Since the median divides the total population into two subgroups with the same size, 
it conveniently highlights the different types of countries by making every possible 
combination of conditions the same size. Consequently, we can explore various types of 
configuration of conditions which determine the occurrence of LHRs movements. 


Figure 1. The Process of Determination of Variable 


Step 1. The country name is in “Alphabetical List of Countries” 


Step 2. The country name is in the list of “without constitutional provisions 
related to language” 


Step 3. The country name is in the list of “with constitutional provisions 
related to language,” AND the constitution of the country contains the - 
provision which guarantees LHRs of minority language speakers. 


Uncertain 





Analysis 


Now, we turn to the analysis with V/ as a dependent variable and D, L, P, G and C as independent 
variables. We assign the binary value of / to each combination of independent variables as 
follows: the row which includes more than one country where the LHRs movement occurred 
was assigned “1,” the row which includes no country where the LHRs movement occurred was 
assigned “0.” The truth table appears in Table 4. 
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Table 4: Truth Table for the Analysis 


Countries with 
Minority or Indigenous 


Countries with 
Movements 


LHR movements 


D LP G C M_ Countries 


16 
13 
12 
12 
11 


34 


18 


159 
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Minimizing rows with M = 1, then we have: 


M = CDGP + CLdg + DPg/ + GLPd + cdg/ 
= GP(CD + Ld) + g {DP! + d (CL + cl)} (3) 


Although there is another way of factoring the minimized equation, we are tempted to adopt 
Equation (3) as the factored equation to elucidate the pattern of the occurrence condition of 
the LHRs movement. Note that GP as a combinatory factor, or equivalently, as an intersection 
of G and P composes the common factor out of two independent sets of factors leading to 
movements. When GP is accompanied by either CD or Ld, it may produce movements. A 
similar discussion holds for the second term beginning g. 


Interpretation of Results 
Conditions of Occurrence of the LHRs Movement 


The structure of Equation (3) says this: combinations of independent variables leading to 
occurrences of LHRs movements (//) are grouped into two, the one with G (AND P) as a 
necessary condition and the other with g as a necessary condition. The former group can be 
further sub-grouped into two by paying attention to the presence and absence of diversity, 
while the latter group also can be sub-grouped into two although the final specification is a bit 
more complex than the former group. A complete specification can be illustrated as in Figure 
2 for the convenience of interpretation. 


Figure 2. The Structure of Equation (3) 


(with CL + cl) 





A glance at Figure 2 will reveal that there are four types of conditions of occurrence of the 
LHRs movement, and each type consists mainly of the combination of variables G and D. This 
result clearly shows that the combination of economic and linguistic conditions cause LHRs 
movements in a country. 
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The median of GNI was $1,550. Hence, we regard a country with G as a high or middle 
income country, and a country with g as a low income country. We divided the countries 
which is included in the combinations with M/ = 1 into two subgroups according to the binary 
value of G, and then we counted the number of countries where the LHRs movements 
occurred, and the number of countries where the minority or indigenous people's movement 
occurred for each subgroup as well (Table 5). 


Table 5: Number of Cases for Each Subgroup Divided by the Value of G 


)) with LHR Movements or Indigenous Movements Countries 
G 10 (31.3) i (21.9) 32 (100) 
g 8 (16.3) 10 (20.4) 49 (100) 
Total 18 (22.2) 17 (21) 81 (100) 


Table 5 shows that the countries with G have a higher occurrence rate of the LHRs 
movements than the countries with g. This result suggests that the condition G is a more 
effective condition of occurrence of the LHRs movement than g. 

The median of the diversity index was .44. Next we turn to the characteristics of the diversity 
index. For simplicity, we postulate that P} = P7 =... = Py 2, nj; Pj =1. If we have P; = P) = 
.5, then the diversity index is .5. This represents the situation in which a country consists of two 
linguistic groups of the same size. As the number of subgroups in a country increases, the value 
approaches 1, then finally we have lim, ie nj=, (P;)*) =1. This represents the ultimate 
multi-lingual situation in which every person in a country has his/her own language. From these 
characteristics of the diversity index, it can be seen that there are roughly two types of countries 
with D: countries with two or three linguistic groups of nearly the same size, and multi-ethnic 
and multi-lingual countries. On the other hand, most countries with d consist of a majority 
language group of a large size and minority language groups of small sizes.country 

In the following subsections, we will examine each prime implicant with some examples. 
The list of countries within each prime implicant appears in Table 6. 


Table 6: The List of Countries within Each Prime Implicant 





G and D CDGP Malaysia, Italy, Canada, Belgium, South Africa 
G and d GLPd USSR, UK, France, Japan, Germany 
g and D DPgl Pakistan, India, Cameroon 
g and d CLdg Sri Lanka, Romania, Bulgaria 
Cdgl Bangladesh, Mauritania 





Countries with G (Affluence) and D (Diversity) 


LHRs movements in countries with G and D seem to originate in the language 
confrontation in a relatively rich country. A typical case is Belgium. Belgium has seven 
languages and a .65 diversity index. However, there are two large linguistic groups: Dutch 
(Flemish) speakers with 57 percent of the total population and French speakers with 42 
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percent of the total population (Minority Rights Group 1990). The language confrontation 
between those two linguistic groups has been a huge problem in political, economic, 
social and cultural domains since the 1950s. 

Malaysia is a multi-lingual and multi-ethnic country with a .75 diversity index and 140 
languages. There are mainly two ethnic groups: Malay with 50 percent of the population 
and Chinese with 35 percent of the population (Minority Rights Group 1990). Malaysia has 
the second highest percentage of Chinese within a country in Southeast Asia. Chinese 
immigrants entered the Malay Peninsula after the 19" century. A flood of Chinese 
economic refugees after 1979 caused in particular an emotional ethnic confrontation in 
Malaysia. 

The language confrontation between French and English speakers in Quebec, Canada 
is well known. French speakers are a comparatively large minority language group with 25 
percent of the national population and 80 percent of the population of Quebec, while 
English speakers are the majority language group. 

Countries with G and D are also facilitated towards movements by conditions P, 
namely large population and C, namely constitutional guarantee of the LHRs of minority 
language speakers. Note that the median of the population of each country was 8.34 
million. This means that those categorized as countries with P=1 are not necessarily “big 
countries” as we ordinarily take them to be. 


Countries with G (Affluence) and d (Lack of Diversity) 


It can be said that LHRs movements in the countries with G and d originate in the assertion of 
LHRs by ethnic and linguistic minorities in a relatively rich country. The most typical case is 
“the ethno-regional movement” (Esman 1977) in the Occitan region in southern France.® 
Economic stagnation in the region in the 1950s encouraged the awareness of regional 
identity, and movements against the centralization of politics in France occurred (Ashworth 
1977, 1978 and 1980). The interesting feature of these movements is that the young 
generation who lost the ancient culture and language, that is, “the generation of the May 
Revolution” in France was actively involved in the movement (Kajita 1985). L is a facilitating 
condition of the countries with G and D. Hence, LHRs movements in the countries with G and 
D can be interpreted as a kind of “new social movement” in which young generations with 
high educational achievement and new values rediscover ethnic or regional identity as one 
of the alternative identities. 

In Japan, the activities for preservation and the revival of the Ainu language are 
developing among the young generation of the Ainu people, the indigenous people living 
in northern Japan (Gengoken Kenkyukai 1999: 30-7). 


Countries with g (Lack of Affluence) and D (Diversity) 


LHRs movement in the countries with g and D originates in the language confrontation in 
relatively poor countries. In addition, the countries with g and D have comparatively large 
population (P) and lower educational level (/). 

Pakistan was founded as a country for Indian Muslims after the partition of the Indian 
sub-continent. However, there has been a keen confrontation between West Pakistan 
(Pakistan today) where almost all people speaks Urdu, and East Pakistan (Bangladesh 
today) where the majority language group was Bengali. The movement occurred in East 
Pakistan resulting from the confrontation between the two language groups. 
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India is one of the most multi-lingual and multi-ethnic countries in the world, with a .93 
diversity index and 398 languages. Two cases of LHRs movements occurred in Assam, 
which is characterized by the most complex ethnic composition. In Assam, there has been 
a language confrontation between Assamese and Bengali. 

Similarly, Cameroon is a multi-lingual country with a .97 diversity index and 286 
languages. Since the French and English colonies were united into one country, there has 
been serious language confrontation between French and English speakers. 


Countries with g (Lack of Affluence) and d (Lack of Diversity) 


LHRs movements in countries with g and d originate in the assertion of LHRs by ethnic 
and linguistic minorities in a relatively poor country. Furthermore, we can divide this group 
into two subgroups according to the difference of the conditions CL and c/. 

The countries with gdCL include two East European countries: Romania and Bulgaria. 
Within Romania there is the problem of Transylvania. The land of Transylvania was 
alternately occupied by Romania and Hungary until it was incorporated into Romania. 
Hungarians comprise as much as 10 percent of the population of Romania, with a great 
number living in Transylvania (Minority Rights Group 1990). Bulgaria has adopted the 
policy of assimilation of Turks (9 percent of the population) since the 1970s. In general, the 
LHRs movements in these countries can be regarded as a reaction to suppression by the 
majority. 

The feature of the LHRs movements in the countries with gdc/ (Bangladesh and 
Mauritania) is not much different from that in countries with gdCL in that they are all 
characterized by a reaction to suppression by the majority. The distinctive feature lies in 
the fact that the reactions in Bangladesh and Mauritania as well was toward the 
introduction of the Arabic language into the school training as part of a religious 
educational policy. 


Further Analysis for Varying Cutoff Values 


In the previous interpretations, we assigned "1" as a value of M/ to the row that includes 
more than one country where the LHRs movement occurred. However, all rows assigned 
"1" as an output value were, in fact, contradictory rows, in the sense that each row 
includes some countries where LHRs movements occurred and some countries where 
they didn’t. Consequently, our previous assignation rule was equivalent to adopting a 
certain frequency criterion for deciding output values of contradictory rows. We call such 
criteria as “cutoff values.” We round the value of M/ into “1” when the occurrence rate is 
above the designated cutoff value and “O"” when the occurrence rate is below the 
designated cutoff value. We mean by an occurrence rate the proportion of countries 
where the LHRs movement occurred to entire countries having the same combinations of 
input values in common. Then, the cutoff value .006, which is approximately equal to 
1/159, was adopted in our previous assignation rule. This is the minimum possible 
occurrence rate in one row. By using this cutoff value, we made a finest-grained analysis 
of multiple causation of LHRs movements. As seen before, it was useful to classify the 
context of diverse LHRs movements all over the world. However, the higher the cutoff 
value becomes, the simpler the Boolean equation we will obtain. In the following analysis 
we show that different cutoff values lead us to the different fined formulations, and show 
the “robustness” of each type of combinations of conditions. 
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Now, let us check the results of how the value of M/ changes (from 1 to 0, or from 0 to 
1) for varying cutoff values. Table 7 shows the value of / for each conditional combination 


with different cutoff values. 


Table 7: The Truth Table with Different Cutoff Values 


M (with cutoff value) 


.006 


Rate 


Cc 


A3 
:15 
.00 
08 
.00 
oh 
13 
.00 
14 
.00 


67 


.00 
.00 
.00 
00 
.00 
.00 


.00 
00 
00 
.00 
.00 


00 
.00 


.00 


90 


.00 
.00 


.00 
.00 
.00 
.00 
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Firstly, if we adopt .006 as the cutoff value, then we have same equation as Equation (3). 
As seen before, Equation (3) shows that there are four types of combinations of the variables 
G and D which, together, cause the LHRs movements in a country. 

However, setting a cutoff value to .1 instead, for example, leads us to re-evaluate the value 
of M from 1 to 0 for the combinational pattern of independent variables D/Pge which might 
change the final formulation. That is, as a consequence, we have two expressions of 
equations, both of which end up with same simplicity, such as: 


M = digc + dLgC + dLPG + DIPC + DPGC 


= GP(CD + Ld) + gd (CL + cl) + DIPC (4) 
M = digc + dLgC + dLPG + DIPC + LPGC 
= GP(CL + Ld) + gd (CL + cl) + DIPC (5) 


For the interpretation, we should adopt Equation (4), and then we can find that one of the 
combinations of the variables G and D, namely gD, has disappeared from the formulation. 
The occurrence rate in the world (that is, 18/159) is .113. If we adopt this numerical value as 
a cutoff value, we can obtain the same result as Equation (4) and (5). 

Suppose, next, we adopt .2 as a cutoff value, then we have: 


M = DPGC + diPgc + dLPGc + dLPgC 
=P {G (CD + Lced)-+ gd (CL + cl)} (6) 


We see from Equation (6) that P turns out to be a necessary condition, and that there are three 


types of conditions, GD, Gd and gd. 
lf we adopt .5 as a cutoff value, then we have: 


M = DPGC + diPgc + dLPGC 
=P {G(CD + LCd) + gdcl} (7) 


Similarly as in Equation (6), P is also a necessary condition and there are three types of 
conditions in Equation (7). However, facilitating conditions both of Gd and gd are different 
from those in Equation (6). Finally, if we adopt .6 as a cutoff value, then we only have: 


M = DLPG (8) 


Table 8: The Results of Minimizations for Varying Cutoff Values 





Necessary 
Value __Minimized Equation Condition Types of Condition 
.006 GP (CD + Ld) +g {DPI + d (CL + cl)} None GD, Gd, gD, gd 
1 GP (CD + Ld) +gd (CL + cl) + DIPC None GD, Gd, gd, DIPC 
2 P{G (CD + Lcd) + gd (CL + cl} E GD, Gd, gd 
5 P{G (CD + LCd) + gdch P GD, Gd, gd 
6 DLPGC DLPGC GD 


The results indicate the “robustness” of each type of conditions (Table 8). That is, gD is a 
“weak” condition, while GD is a “strong” condition. In general, a condition is said to be 
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“strong” if and only if it persists all way through the varying cutoff values we choose for a 
dependent variable. A condition is said to be “weak” if and only if it holds for a subset of 
different cutoff values. A condition might be said to be “arbitrary” if and only if it pops up only 
for a particular cutoff value. We shall be tempted to pay more attention to “strong” conditions 
generally, but we believe it hinges on our analytical purpose whether “strong” conditions 
deserve more attention. When we want to explore the possibility of the occurrence of an 
event, no matter how minimal, all of the “weak” conditions must also be considered. When 
we want to know the greater probability of occurrence, we would pay more attention to the 
“stronger” conditions. In our analysis, the condition GD is regarded as the “strongest” 
implying that LHRs movements are more likely to occur in countries characterized by higher 
gross national income AND linguistic diversity. 


Conclusions and Future Research Tasks 


Thus far, we have presented an analysis of the determinants of the LHRs movements after 
World War Il by using a Boolean approach. Results show that there are four types of 
combinations of the variables G and D which, together, cause the LHRs movements in a 
country. We conclude that each combination of domestic economic condition and domestic 
linguistic condition causes different types of LHRs respectively. This means both conditions 
determine the nature of LHRs movements. A further analysis for varying cutoff values makes 
it clear that each combination has the different degree of robustness in causing LHRs 
movements, and the “strongest” combination in four types are GD, namely, higher gross 
national income AND linguistic diversity. This analysis of robustness of a given combination of 
independent variables reduce the instability of Boolean approach which might have been 
caused by the small number of relevant cases under consideration. 

Indeed, much remains to be done in future studies. For example, we should find 
hidden conditions that might cause the LHRs movements and bring these conditions 
explicitly into the analysis or alternate the existing variables. For example, the economic 
status of a language minority group among society at large is one such candidate 
conditions. We paid much attention on a level of gross national income of a society at 
large where minority groups are living. However, the higher or lower economic status of 
a language minority group might give different affect occurrence of LHRs movements, 
given a certain level of income at a national level. Even within a high income country, as 
we noted earlier, those minority people who barely survive economically may conform 
themselves to majority codes including language code without initiating a movement. In 
order to capture and create the relative economic status of a minority group within a 
society at large as an explicit variable, further information is required at the initial stage of 
data creation. 


Notes 


1. This is a reformulated version by Skutnabb-Kangas and Phillipson (1994). The original 
formulation by Skutnabb-Kangas had “mother tongue” rather than “language” in the first 
three points. 


2. As for the cases of “movement for self-determination,” we did not include the case which 
occurred before the present regime in each country. That is, the case such as the 
independence movement in the colonial period was excluded. 
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3. Charles C. Ragin has analyzed determinants of ethnic political mobilization in Western 
Europe by a Boolean approach (Ragin 1987:133-49). His study was of importance to us 
in our research. 


4. Francesco Capotorti (1979) studied laws affecting minorities of each country. This study 
was done for the United Nations. 


5. Asimilar movement can be witnessed even in various places in France, Italy and Spain 
such as Corsica, Basque, Alsace and Bretagne. 
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Welcome to World Peace 


Neil Englehart, Bowling Green State University 
Charles Kurzman, University of North Carolina, Chapel Hill 





Why study terrorism? To many the answer might seem all too obvious, but in fact terrorist 
attacks are relatively rare events that kill relatively few people — far fewer than civil wars, 
homicides or traffic accidents. The social disruptions generated by terrorist attacks are largely 
due to the disproportionate attention we devote to them. Indeed, a truly dispassionate cost- 
benefit calculation might conclude that in a world of big problems and limited resources to 
address them, the best response to terrorism could be to ignore it. 

So why devote a special section of Socia/ Forces to the issue? We propose three related 
reasons to take the study of terrorism seriously. One is that the form of organized, politically 
motivated violence that has long interested activists and scholars the most — inter-state war 
— is in decline. Students of political violence are thus shifting their attention to non-state 
actors, who assume a new importance in an era of relative peace and stability. Second, non- 
state actors have returned over the past quarter century to terrorist methods, using seemingly 
random violence and the pulpit afforded by global media coverage to press their political 
agendas. Third, terrorism has become such a potent political issue in many countries that the 
academic study of terrorism has important ramifications for electoral competition and public 


policy. 


World Peace and Non-State Violence 


Hard as it may be to believe, given constant headlines about conflict in Iraq and elsewhere, 
we live in a time of world peace. The ancient scourge of war has disappeared, at least in the 
sense of governments fighting one another with armies. As of this writing in early 2006, there 
are no wars between states, and there have been none since 2003, when the United States 
and its allies removed Saddam Hussein's government in Iraq, and India and Pakistan instituted 
a ceasefire. 

The present interlude of world peace is the culmination of a steady decline in interstate 
war in the aftermath of the Cold War. Since World War II, the number of interstate wars with 
20 or more deaths per year rose to nine in 1987, then dropped to one in 2002, two in 2003, 
and zero in 2004, according to the International Peace Research Institute, Oslo (PRIO) (see 
Figure 1).' This trend has not gone unnoticed among scholars of international relations, who 
have engaged in a lively debate about the decline of war and its causes. It is noteworthy that 
their analyses often involve factors that are unlikely to be reversed in the near future: 
increased world trade, the spread of democracy, growing international norms against war, a 
set of international institutions that mediate disputes and discourage conflict, and the 
emergence of a single hegemonic military power (Keohane and Nye 1977; Doyle 1983; 
Mueller 1989; Jervis 1991; Ruggie 1998). If any of these explanations are correct, we may not 
see a sudden increase in the incidence of interstate war anytime soon. 

Equally dramatic is the decline in the number of civil wars, which rose steadily throughout 
the Cold War, peaked in 1991 at 51, and then dropped to 30 in the last year of the data, 2002, 
according to PRIO data (Figure 1). Analogous trends are visible in other counts (Fearon and 
Laitin 2003; Marshall and Gurr 2005). This trend has not been well explained, or even widely 
noted. An uptick in civil wars after the end of the Cold War led many analysts to conclude that 
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Figure 1. Wars, 1946-2002 
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Legend: Conflicts with 20 or more war-related deaths per year. Civil wars defined as conflicts with one state party (hollow rectangles); interstate wars 
defined as conflicts with two or more state parties (solid ovals). Source: International Peace Research Institute, Oslo (2005). 
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we were entering an era of increased intra-state violence (Holsti 1996; Kaplan 2000). Yet this 
gloomy prediction appears to have been premature: unlike interstate war, civil wars have not 
disappeared entirely, but they are in decline. 

The reduction of war has been particularly pronounced in the “Western world,” where 
military-related fatalities are far fewer in number than other causes of death (Mueller 2004b). 
According to mortality statistics from the World Health Organization, war operations in the 78 
countries for which complete data is available resulted in only 206 deaths per year on average 
during the period 1980-1995. Homicides, by contrast, averaged 71,100 deaths per year in the 
same countries (World Heath Organization 2005). Furthermore, deaths from military 
operations are trending downwards, while homicides are trending up: in 1995 these countries 
reported only 59 deaths from military operations but 82,805 deaths from homicide. Similar 
points could be made about automobile accidents, preventable diseases and so on. Our 
interest in war, therefore, is not due the number of people killed. 

Nonetheless, we remain fixated on war. The United Nations General Assembly recently 
passed a resolution calling on member states to respect the traditional Olympic Truce during 
the Winter Games in Turin, disregarding the fact that none of the member states were actually 
fighting with one another (United Nations General Assembly 2005). Scholars, journalists, 
activists and politicians seem to focus more attention on war than on other global challenges, 
even as war has declined in number and importance. Certainly this is due in part to the 
bellicose campaigns of jihad, the war on terror and the high-profile clashes between them. 
In addition, we may be suffering from a form of cultural lag: we have not yet digested and 
adapted our thinking to the recent outbreak of world peace. Perhaps, also, we focus on war 
because it seems relatively easy to address: it involves only a small number of decision- 
makers, whose orders to fight or stand down have a disproportionate impact on the amount 
of violence in the world. Small-scale, autonomous fighting units are harder to study and 
harder to influence than the centralized bureaucracies of interstate conflict. 

But small-scale, autonomous fighting units are the main forms of collective political 
violence that the world faces after the end of war. In place of contests between states or 
state-builders, the remnants of war “are more nearly opportunistic predation waged by packs, 
often remarkably small ones, of criminals, bandits, and thugs.” (Mueller 2004a:1) Terrorism, 
human rights abuse by local bosses and warlords, attacks and oppression by militias and 
paramilitaries are the problems now, rather than the concentrated destruction of war. There is 
little systematic information on non-state political violence, a symptom of its sprawling nature, 
its practitioners’ desire for secrecy (with the exception of media-oriented terrorism), and its 
relative neglect by scholars. It is thus difficult to establish trends in small-scale political 
violence (see also, Mack 2005). 

What partial information we have suggests that non-state political violence has increased 
in recent years, not just relative to war but also in absolute terms. The most extensive data- 
gathering on non-state political violence relates, as one might expect, to international 
terrorism. While definitions of “international” and “terrorism” may be contested, Figure 2 
shows that the trend in two major datasets — one from the ITERATE project (Mickolus et al. 
2003), as reported by Robison, Crenshaw and Jenkins in this issue, and the other compiled by 
the RAND Corporation and made available by the National Memorial Institute for the 
Prevention of Terrorism (Terrorist Knowledge Base 2006) — appears to be consistent: a 
downturn after the end of the Cold War, followed by an upturn after 2000. The Terrorism 
Knowledge Base also contains partial data on domestic terrorist incidents since 1998, which 
show a huge leap in the number of attacks in the early 2000s, from approximately 1,000 
incidents per year in 1998-2000 to almost 4,000 incidents in 2005, nearly half of them in Iraq. 
Terrorism is not a new tactic — in the modern period, it dates back to the 19" century — but in 
recent years it has re-entered the repertoire of radical political groups, in particular radical 
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Islamist groups. To paraphrase Clausewitz's aphorism about war, terrorism is war conducted 
by other means. 


Non-State Violence and the Study of Terrorism 


in the torrent of attention that terrorism has attracted since September 2001, the field of 
terrorism studies has to a large degree recapitulated, in an accelerated manner, the 
progression of related fields such as the study of social movements and civil wars. Both of 
these neighboring fields underwent parallel shifts over the past quarter century, in reaction 
to the leading theories of the preceding era. Social movement studies emerged during this 
period as a critique of the “collective behavior” approach to the study of collective action, 
which had emphasized the combination of large-scale social dislocations and individual 
psychological characteristics. As activists of the 1960s generation entered academia and 
took up the study of social movements, they built a more sympathetic theoretical approach 
that treated protest as rational political engagement (Kurzman 2004). In the study of civil war, 
too, older studies emphasizing large-scale social change were replaced by neorealist 
approaches that were hardly more sympathetic to their subjects than earlier scholars had 
been, but that treated civil war in terms of rational calculations and security dilemmas (David 
1997: Sambanis 2002). The rational actor approach always had its critics, but these 
challenges began to accumulate at the turn of the millennium, a somewhat belated echo of 
the cultural turn that affected many other aspects of the social sciences in the preceding 
decade. In social movement studies, a significant number of scholars started to insist on the 
importance of cultural, ideological and other intangible elements of protest, while attempting 
to avoid a return to the large-scale generalizations of the 1960s and before (Goodwin and 
Jasper 2004). In the study of civil war, as well, a somewhat smaller number of scholars have 
begun to question the rational choice assumptions and methods that dominate the 
mainstream of their field (Mack 2002; Human Security Centre 2004). 

Since 9/11, the study of terrorism has undergone a forced march through these 
intellectual generations. Explanations focusing on large-scale social dislocations and 
grievances dominated the most influential works to appear in this period, building on 
several decades of work in this area. Within two years, rational-choice analyses of terrorism 
began to appear, culminating in the books by Pape (2005) and Bloom (2005), which explore 
the strategies that may have been adopted by terrorist organizations. Other rational choice 
analyses wrestled with the individual act of self-destruction, as enacted in terrorist suicide 
attacks, which would seem to provide vivid counterexamples to the principle of preference- 
maximizing action, unless death and the conditions of the afterlife are allowed to enter into 
the calculation. 

The papers in this special section contribute to an emerging third generation within the 
study of terrorism, analogous to recent trends in the study of social movements and civil 
war. These studies aim to temper rational choice approaches by focusing on context. In 
each of these studies, the emphasis is less on universalistic mechanisms involving rational 
preference-seeking behavior, but rather on the formation and activation of preferences. The 
goal is not to deny the presumption of rationality to anybody, or to return to an era of grand 
generalizations about socio-cultural characteristics of various populations, but rather to 
examine local and cross-national variation in preferences, perceptions and strategies. As it 
happens, the four papers in this special section focus on four distinct contexts for decisions 
to engage in terrorist activities: socio-political, ideological, local network and motivational. 

Jeff Goodwin's paper offers a social-scientific theory of terrorism that emphasizes the 
socio-political context for terrorists’ strategic decisions. Goodwin's paper argues that 
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terrorism is a form of revolution, and asks why some revolutionary movements take this 
form while others do not. Building on recent work by Black (2004) and others, Goodwin 
suggests that key factors in a political campaign's turn to terrorism — “categorical terrorism,” 
indiscriminate attacks on a civilian population — include the social distance that potential 
terrorists feel from their potential targets, the complicity they consider the targets to have 
with the government and its policies, and the perceived likelihood that the targets could 
never be convinced to side with the revolutionary movement. This approach suggests that 
the choice of terrorist tactics is in large part a function of how insurgents perceive their 
enemies. These perceptions cannot be deduced in the abstract or generalized across all of 
humanity: they must be studied case by case in order to be understood. 

The paper by Kristopher K. Robison, Edward M. Crenshaw and J. Craig Jenkins explores 
the ideological context of international terrorism over the past generation. The paper takes 
an updated version of the well-known ITERATE dataset of transnational terrorist attacks and 
codes the ideology associated with each attack, focusing for present purposes on leftist 
and Islamist organizations. The paper finds that ideology matters: the trends for leftist and 
Islamist attacks head in different directions over the past quarter century, with leftist 
terrorism diminishing after the end of the Cold War and Islamist terrorism continuing on a 
roller-coaster pattern, with the new millennium marking a renewed upswing. The paper's 
multivariate analysis concludes that some of the large-scale socio-economic factors that 
were emphasized in the earliest generation of terrorism studies affect the two ideological 
trends differently: urbanization, for example, is associated with Islamist terrorism but not 
with leftist terrorism. These results imply that terrorist organizations may provide an 
ideological lens through which large-scale causation is filtered. 

Ami Pedahzur and Arie Perliger’s paper describes the local network context of a 
particular form of terrorism (Suicide attacks against civilians) in a particular location 
(Israel/Palestine). Using Israeli media and other public sources, Pedahzur and Perliger are 
able to map out the friendship, familial and other relations among individuals in four suicide 
attack networks. The paper argues that there are at least three distinct contexts in which 
networks matter. First, a political context: the network structures appear to evolve in 
response to increased pressure by the Israeli government, becoming less hierarchical, less 
homogenous and more resilient. Second, a neighborhood context: networks compete with 
one another for preeminence, in keeping with the one-upmanship strategy hypothesized by 
Bloom (2005), but on a local, more personal scale. Third, an internal network context: 
suicide bombers appear to have peripheral positions in the attack networks, relatively 
unconnected from other participants. The implication of this work is that these terrorist 
attacks — and by extension others as well? — were the product of small-scale vortices of 
politics and personal relations. 

Finally, the paper by Robert J. Brym and Bader Araj emphasizes the motivational context 
of Palestinian suicide terrorists. Drawing on the final testaments of the suicide bombers 
and statements by the organizations that claimed responsibility for the attacks, as reported 
in Palestinian newspapers, the paper argues that this form of terrorism is best understood 
as retaliation against Israeli government attacks. The bombers and their backers express 
this motivation frequently, and the timing of the attacks is correlated with Israeli 
assassinations and other Palestinian casualties. The paper takes on the rational choice 
perspective directly, arguing that suicide terrorism may not be strategically sound in an 
abstract cost-benefit sense, given the heightened level of repression that ensues from each 
attack. Rather, terrorism seems to grow out of a cultural logic of retaliation — a cultural logic 
that is familiar to its enemies, who engage in a similar logic themselves, generating a 
seemingly endless cycle of revenge. 
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Future Directions 


Where do we go from here? These papers collectively suggest it will be most fruitful to study 
terrorism, and non-state violence in general, with a multi-faceted approach that ties together 
individual motivations and capabilities, social and organizational networks, and the larger 
social and global context in which groups operate. This is a tall order, but non-state violence 
is intrinsically more difficult to study than state violence or civil war between well-defined 
sides, and fruitful approaches will necessarily need to integrate micro, macro and meso-levels 
of analysis. 


The Micro-level: Individual Motivations 


Studies of the psychological characteristics of individual terrorists have found little 
commonality among their personality types and motivations (Sageman 2004; Victoroff 2005). 
The Pedahzur and Perliger paper implies that this diversity may reflect the variety of positions 
within terrorist networks. The motives, goals and strategies of the focal leaders of these 
networks may differ from those of the suicide bombers, who occupy marginal positions 
within the networks. 

The network leaders may choose terrorist actions on the basis of strategic calculations, as 
suggested by the Goodwin and the Brym and Araj papers, or on ideological grounds, as 
suggested by Robison, Crenshaw and Jenkins. As both Pedahzur and Perliger and Brym and 
Araj suggest, terrorist leaders are playing a two-stage game in which they simultaneously 
seek to score points against their opponents, and to win support from the aggrieved 
population they purport to represent. Bombers, however, and others in non-leadership roles 
may have quite different motivations. 

The motivations of individuals in terrorist organizations are thus likely to differ according to 
their role. The same is likely to be true for other non-state violent actors, such as militias, 
mercenaries or criminals. Recognizing the different motivations of individual actors will be 
essential to understanding and addressing non-state violence. 


The Macro-level: The Post-Cold War World 


Numerous macro-level changes have conspired to create increased awareness of non-state 
violence, and possibly to increase its incidence. The end of the Cold War has certainly 
changed the ideological context of non-state violence. The decline of interstate war itself 
may have stimulated greater non-state violence. In addition, the proliferation of failing states 
has changed the post-Cold War international system in ways that foster non-state violence 
of all kinds. 

The end of the Cold War, the rise of United States hegemony and globalization have 
altered the ideological spectrum and hence the motivations and justifications for non-state 
violence. This is most clear in the Robison, Crenshaw and Jenkins paper, which tracks 
variation in leftist and Islamist terrorism over time. Terrorist organizations and other non-state 
violent actors filter these changes, although not in a simple linear pattern. The decline of leftist 
terrorism is clearly related to the decline of state socialism, but the pattern for Islamist terror 
is more complex. Other macro-level factors, including changes in the geopolitical 
environment, may be at work here. 

The decline of war itself may feed non-state violence. There are at least four mechanisms 


by which this might happen: 
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1. Downsizing: Since the end of the Cold War, military establishments 
around the world have declined in size, with few exceptions. The rapid 
reduction of national military forces led to a large pool of skilled labor in 
the private sector, trained and willing to use violence for hire (Singer 
2003; Avant 2005). The impact of these private military companies has 
been particularly pronounced in the case of Africa, where demobilized 
South African forces reformed into mercenary companies have played 
a major role in variously stabilizing or destabilizing weak states (Cilliers 
and Mason 1999; Musah and Fayemi 2000). 


2. Liquidation: Downsizing militaries have also reduced their weapons 
stocks, nearly always by selling them off. This has led to a proliferation 
of weapons in many countries. Of particular concern are military small 
arms, which have led to a bloody “democratization of violence” in 
many places, and powerful explosives useful for terrorist operations. 


3. Reduction of Foreign Aid: Declining aid receipts after the end of the 
Cold War have weakened some governments, making it more difficult 
for them to control both opposition movement and _ peripheral 
members of their own regime. This may lead to an increase in 
challenges to the state, but also it may lead to the consolidation of 
petty fiefdoms by local bosses and warlords, and it may reduce the 
state’s capacity to regulate the activities of private organizations, 
including terrorists and criminals. 


4. Security: \ronically, international peace itself may help create 
conditions that increase non-state violence. Relative security from 
international threats may embolden opposition groups to challenge 
governments. It may also lead to the reduction of military 
establishments, generating military downsizing and liquidation of military 
assets, reducing the capacity of states to control the use of violence 
within their borders. It has long been recognized that weak states are 
more susceptible to civil war, because rebel leaders feel they have a 
greater chance of success (Fearon and Laitin 2003), and these chances 
go up if a state reduces its military. 


Some of these mechanisms are short-term disturbances, but they may have the potential 
to become self-sustaining. For example, the sale of surplus arms stocks has generated valuable 
foreign exchange for Russia, stimulating the arms industry to produce more weapons for export. 
This further proliferates the small arms used by violent non-state actors and weakens vulnerable 
states. Similarly, the downsizing of military forces generated a pool of skilled labor for private 
security firms. The growth of lucrative positions in these firms may then alter career paths in 
national militaries, encouraging early retirement of skilled personnel into the private sector. This 
could perpetuate the labor pool feeding these firms, further weakening state control over the 
use of force. In turn this dynamic could ensure continuing demand for private security services. 

At least as important to the changing international context for non-state violence is the 
presence of a large number of failing states in the international system. It is debatable whether 
this is really a new phenomenon, but during the Cold War it was kept in check to some degree 


by both superpowers’ interest in stability. After the Cold War many failing states have been 
permitted to fail. 
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Failing states are equally a risk factor for non-state violence that is not aimed at overthrowing 
the government. States are the primary means we rely on to control violence. We expect states 
to use militaries to defend their citizens against external threats and police to defend against 
internal ones. In states that cannot perform these functions, the vacuum is filled by non-state 
actors. Domestically, militias or gangs may exert de facto control and become the primary 
organizations that people look to for security — often at a high price. Internationally, failing states 
are the primary clients of mercenary firms and convenient transit points for criminal networks. 
For terrorists, they can become safe havens and host important facilities such as training camps. 
The case of al-Qa'ida in Afghanistan is well known, but one might equally point to Lebanon in the 
1980s or Gaza under the period of al-Fatah authority. 

With the end of the Cold War, it appears that many weak states have gotten weaker. We have 
seen total collapses of central government authority in Afghanistan and Somalia, and a number 
of other countries — Congo, Liberia, Sierra Leone, Tajikistan — have come close. These failing 
states provide a congenial environment within which violent non-state actors can organize. 


The Meso Level 


The papers collected here demonstrate the importance of looking at how the micro- and macro- 
levels intersect at the meso level, the level of the organization or network. Non-state political 
violence is not an individual phenomenon: terrorists, as well as militias, mercenaries, criminals 
and other non-state actors, rarely act alone. Non-state violence requires organization to be 
effective, bringing together specialists in cooperative action to achieve their goals. Suicide 
bombers, for example, need more than a grievance and a death wish: they also need a bomb, 
a target, training, encouragement and reassurance that loved ones will be cared for. This means 
that a suicide bomber also needs a bombmaker, a planner, a trainer, a leader and a network of 
financial supporters. It takes a network, if not quite a village. Other users of non-state violence 
are similar, in that they require organization to be effective. Evidence from South Asia suggests 
that even seemingly spontaneous riots are the result of planning and organization, often by 
professional freelance riot promoters (Brass 1997). 

The meso level appears to be a particularly fruitful direction for future studies of terrorism 
and other forms of non-state violence: how groups generate and mobilize grievances in a 
changing international environment to produce collective violence (or not). Ideology and 
grievances do not by themselves produce violence. There are many hatreds in the world, both 
ancient and new, that could motivate violence, but only a minority of them actually do so. It is 
only when they are mobilized by groups that they become associated with violence. The 
question in explaining collective violence is not why it erupted in any particular case, but rather 
in each case, what was it that led to violence when in many other, similar cases it did not. The 
role of organizations and networks on both sides of the conflict is crucial to answering this 
question. 

The study of terrorist organizations and other non-state violent actors will be intrinsically 
more difficult than the study of war. The dominant approaches to the study of conflict between 
states paid relatively little attention to the internal dynamics of states, despite the existence of 
several extraordinary studies of the internal dynamics of states in conflict, such as Snyder 
(1984) and Allison and Zelikow (1999). The most prominent approach to inter-state conflict, 
Realism, explicitly eschewed domestic politics, treating states as “black boxes” whose actions 
could be understood entirely on the basis of their military and economic resources and their 
place within the international system (Waltz 1979). The primary opposing theory, Liberalism, 
took into account whether or not states were democracies, but also concerned the structure 
of the system more than the particular characteristics of individual states. 
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This neglect of the organizational characteristics of states may have been possible in part 
because states in the late 20th century have become relatively homogenous units (Meyer et al. 
1997). Violent non-state actors are not. Terrorist organization, militias, criminal networks, 
mercenary firms and other violent non-state actors vary enormously. In order to adequately 
understand them, it will be important to take those differences seriously, in order to understand 
how and when they generate grievances and violence. 


Note 


1. This count differs slightly from PRIO’s own published tally (Harbom and Wallensteen 
2005), which includes an additional category of “internationalized” conflicts. 


References 


Allison, Graham, and Philip Zelikow. 1999. Essence of Decision: Explaining the Cuban Missile Crisis, 2°° 
edition. Longman. 


Avant, Deborah D. 2005. The Market for Force: The Consequences of Privatizing Security. Cambridge 
University Press. 


Black, Donald. 2004. “The Geometry of Terrorism.” Sociological Theory 22:14-25. 
Bloom, Mia. 2005. Dying to Kill: The Allure of Suicide Terror. Columbia University Press 


Brass, Paul R. 1997. Theft of an Idol: Text and Context in the Representation of Collective Violence. Princeton 
University Press. 


Cilliers, Jakkie, and Peggy Mason. 1999. Peace, Profit or Plunder? The Privatisation of Security in Warlorn 
Africa. Pretoria, South Africa: Institute for Strategic Studies. 


David, Steven R. 1997. “Internal War: Causes and Cures.” World Politics 49:552-76. 


Doyle. Michael W. 1983. “Kant, Liberal Legacies and Foreign Affairs.” Philosophy and Public Affairs 12: 205- 
35, 323-53. 


Fearon, James D., and David D. Laitin. 2008. “Ethnicity, Insurgency, and Civil War.” American Political Science 
Review 97:75-90. 


Goodwin, Jeff, and James Jasper, editors. 2004. Rethinking Social Movements. Rowman & Littlefield. 


Harbom, Lotta, and Peter Wallensteen. 2005. “Armed Conflict and Its International Dimensions, 1946-2004." 
Journal of Peace Research 42:623-35. 


Holsti, Kalevi J. 1996. The State, War, and the State of War. Cambridge University Press. 


Human Security Centre 2004. “Understanding Civil War: Quantity Versus Quality. Towards More Effective 
Collaboration Between the Quantitative and Qualitative Conflict Research Communities.” 
Conference Report, Bellagio, Italy, April 5-7. Human Security Centre, University of British Columbia, 
Vancouver, Canada. 


International Peace Research Institute, Oslo (PRIO). 2005. 7he PR/IO/Uppsala Armed Conflict Dataset: Armed 
Conflict, Version 3-2005b. Oslo, Norway: Centre for the Study of Civil War at the International Peace 


Research Institute, Oslo (PRIO); and Uppsala, Sweden: Department of Peace and Conflict 
Research, Uppsala University. 


Jervis, Robert. 1991. “The Future of World Politics: Will It Resemble the Past?” /nternational Security 16: 39-73. 


Welcome to World Peace * 1967 


Kaplan, Robert D. 2000. 7he Coming Anarchy: Shattering the Dreams of the post-Cold War. Random House. 


Keohane, Robert O., and Joseph S. Nye. 1977. Power and Interdependence: World Politics in Transition. 
Little, Brown. 


Kurzman, Charles. 2004. “Social Movement Theory and Islamic Studies.” Pp. 289-303. /slamic Activism: A 
Social Movement Theory Approach. Quintan Wiktorowicz, editor. Indiana University Press. 


Mack, Andrew. 2002. “Civil War: Academic Research and the Policy Community.” Journal of Peace Research 
39:515-25. 


editor, 2005. Human Security Report 2005: War and Peace in the 21* Century. Vancouver, Canada: 
Human Security Centre, University of British Columbia. 


Marshall, Monty G., and Ted Robert Gurr. 2005. Peace and Conflict 2005: A Global Survey of Armed Conflicts, 
SelFDetermination Movements, and Democracy. University of Maryland, College Park: Center for 
International Development & Conflict Management. 


Meyer, John W.,, John Boli, George M. Thomas and Francisco O. Ramirez. 1997. “World Society and the 
Nation-State.” American Journal of Sociology 103:144-81. 


Mickolus, Edward, Todd Sandler, Jean M. Murdock and Peter A. Flemming. 2003. /nternational Terrorism: 
Attributes of Terrorist Events (ITERATE) (07/27/04 version). Vineyard Software. 


Mueller, John. 1988. Retreat from Doomsday: The Obsolescence of Major War. Basic Books. 
. 2004a. The Remnants of War." Cornell University Press. 
. 2004b. “Why Isn't There More Violence?” Security Studies 13:191-203. 


Musah, Abdel-Fatou, and J. Kayode Fayemi, editors. 2000. Mercenaries: An African Security Dilemma. Pluto 
Press. 


Pape, Robert A. 2005. Dying to Win: The Strategic Logic of Suicide Terrorism. Random House. 


Ruggie, John Gerard. 1998. Constructing the World Polity: Essays on International Institutionalization. 
Routledge. 


Sageman, Marc. 2004. Understanding Terror Networks. University of Pennsylvania Press. 


Sambanis, Nicholas. 2002. “A Review of Recent Advances and Future Directions in the Quantitative 
Literature on Civil War.” Defence and Peace Economics 13:215-43. 


Singer, PW. 2003. Corporate Warriors: The Rise of the Privatized Military Industry. Cornell University Press. 


Snyder, Jack. 1984. The /deology of the Offensive: Military Decision Making and the Disasters of 1914. 
Cornell University Press. 


Terrorism Knowledge Base. 2006. Tulsa, OK: National Memorial Institute for the Prevention of Terrorism, 
http://www.tkb.org. 


United Nations General Assembly. 2005. “General Assembly Urges States to Observe Olympic Truce During 
Upcoming Winter Games in Italy, Use Sport as Too! for Development.” New York: United Nations, 
GA/10415. 


Victoroff, Jeff. 2005. “The Mind of the Terrorist: A Review and Critique of Psychological Approaches.” Journal 
of Conflict Resolution 49:3-42. 


Waltz, Kenneth. 1979. Theory of International Relations. Addison-Wesley. 


World Health Organization. 2005. WHO Mortality Database. Geneva, Switzerland: WHO. Statistical 
Information System, http:/Mvww.who.intAwhosis. 


; i oe a eee oe ae 
TNE © Ge WOM EAN ierieke 4 Sage” eee 
\ H : 


, & 
ey. 
_ 


aM ttl bees 1 vari, oe ee se Faiedieel 
ees Cy a ee RRA 
npn arts ph CR SOE RS MR a 
. ' new aay a 
ay wins aj, ae 
oem iy 
_ 1) 



























aaa: 


i 


¥ 


Porter So “ewe, tl meiitie aoe vreb re eee 
tegs iy al Uitied via! ama (Parnes Petree ere 7 


ettieel nes, een ced ee i aan 7 


a x 
WA mucow «eer yO, vege?! ee NL ee satiate Nix ibe. © 


bea rea, Abia eS A ee 
sayin: aie ite Pieris, © wiht Sage ibe Pe a ARERR Dior Be Y 7 


Se ee eed fees Geert OR va ts iene . 
ie yocnts "hoe 2 inher er veedral Y o 


2 ae ; 

a yu Spee Lee YL!) feet 25 oe: OO abcmend? 14 ap Gos tytat A wk 

aa ; (a, Lat “hy pcre ara NENA PnP a, I : 
8 uns 3 ee 

ieziuvwes Cred ton FOU =e 5. Aw" neni We, ee DT rhe “pew 


re. * ROS i etrys Put Ha WN oye eas 
be 5 <0 ot ahs Grae vues ce Bt eS 
esp Raueedd age CG ogni ras har ih cesta Shes 


* 5 
ev 7s . pore) ne y ‘i he yi 


cre dy Bi seals: ate sacmwnry avo Averabt Om wate OE. * 


e 


a 9 » ‘i bd ? . agen = « ~ i ie 
tra” eran oo hit ap ae eS Os Ed ES S irenlaboal the ie 


es : . —— ry 
rr. : apa a en a. nic ane —) om acorns 


Atha dintay Pheer ee ews o Moab Re are AAR GRE ena tel; 
repent, 


sw te. fa a J 7 mea) 7 4 Pe aa nee 
2) je ger eter yee! en, ae oat tea 


ae hamaaiea ear Nae mnie lh wnat Sy ae ie Saar tbh 
: j fait ARUN sae | oh ee ae ay’ 


| neat Ra ath MGS i se Pop RD ng al 
- be: ae a era RE: ee) 0 bei 


Pag! 


Tygt ev tere " aD 7 o 
<ameving cote Whar anol #2 
i .* 
ey (G ru ans 2 ee rer 
gil aloe aes one: ra i) * eh = aid 
par ihe! Dig gat eae 


- 


mAs 


ee ee eee ee ee 
Shae oon ae a 


Ken tod QO) (no ae, Bann 
< —— a a tg (OR 
_ 7 


Poser De ClO 


4 


va erry Low; 
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Abstract 

Social scientists have explained the rise of suicide bombing since the early 1980s by focusing 
on the characteristics of suicide bombers, the cultural matrix in which they operate, and 
the strategic calculations they make to maximize their gains. We offer an alternative 
approach that emphasizes the interaction between Palestinian suicide bombings and Israeli 
government actions, analyzing the motivations, organizational rationales and precipitants 
for the 138 suicide bombings that took place in Israel, the West Bank and Gaza between 
October 2000 and July 2005. Using several sources, including Arabic newspapers, we find 
that much of the impetus for Palestinian suicide bombing can be explained by the desire 
to retaliate against Israeli killings of Palestinians; and that much of the impetus for Israeli 
killings of Palestinians can be explained by the desire to retaliate for suicide bombings. 


When men are angry, they commonly act out of revenge, and not 
ambition. 
— Aristotle, Politics 


Oppressed people cannot always be expected to behave in a 
reasonable manner. 
— Sheikh Muhammad Hussein Fadlallah, 
spiritual founder of Hizballah 


Explaining Suicide Bombing 


In April 1993 a suicide bombing took place in the Jordan Valley settlement of Mekhola. It was 
the first of 20 such attacks over the next four years in Israel, the West Bank and Gaza. 
Between 1993 and 1997 suicide bombers were responsible for the deaths of 175 people 
(including 21 suicide bombers) and the injury of 928 others. A second and more lethal wave 
of suicide bombings began on Oct. 26, 2000. By July 12, 2005, suicide bombers were 
responsible for the deaths of an additional 657 people (including 148 suicide bombers) and 
the injury of 3,682 others. Table 1 outlines the annual toll and the grim totals. In the early years 
of the 21% century, Israel, the West Bank and Gaza became the region of the world with the 
highest frequency of — and the highest per capita death toll due to — suicide bombing. 
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Table 1: Suicide Bombings, Deaths and Injuries, 1993-97 and 2000-05 


ES 


Attacks Deaths Injuries 
1993 1 2 7 
1994 4 46 118 
1995 4 36 211 
1996 a 61 185 
1997 6 30 407 
Subtotal, 1994-97 20 175 928 
2000 3 3 6 
2001 31 109 1001 
2002 om 284 1575 
2003 26 169 641 
2004 18 81 316 
2005 (to 12 July) 3 11 143 
Subtotal, 2000-05 138 657 3,682 
Total 158 832 4,610 





Sources: al-Quds (2000-05); al-Quds al-‘Arabi (2000-05); International Policy 
Institute for Counter-Terrorism (2004); Israeli Ministry of Foreign Affairs (2004); 
Johnston (2003); New York Times (2005). 


Social scientists have sought to explain the growing incidence of suicide bombing in the 
past quarter of a century by focusing on the alleged psychopathology of suicide bombers, the 
deprivations they supposedly experience, the religious mi/ieux from which they presumably 
originate, and the degree to which suicide bombing serves their strategic interests. Let us 
critically examine each of these arguments. 


Psychopathology 


In the late 19" century social scientists first proposed that an irrational or pathological state of 
mind typically precipitates collective violence (Le Bon 1969 [1895]: 28). Since then, that idea 
seems to have been repeated in the aftermath of nearly every major eruption of collective 
violence against authority, suicide bombings included (Reich 1990). For example, immediately 
following the suicide bombings of American and French military barracks in Beirut in 1983, 
psychologists characterized the bombers as “unstable individuals with a death wish,” although 
they lacked any evidence, clinical or anecdotal, of the bombers’ state of mind (Perina 2002). 
Similarly, following the Sept. 11, 2001 suicide attacks on the United States, U.S. government 
and media interpretations underscored the supposed irrationality and even outright insanity of 
the bombers, again without the benefit of corroborating data (Atran 2003: 1535-6). 

Despite such claims, interviews with prospective suicide bombers and reconstructions of 
the biographies of successful suicide bombers do not suggest a higher rate of 
psychopathology than in the general population (Davis 2003; Reuter 2004; Stern 2003; Victor 
2003). A recent study of all 462 suicide bombers who attacked targets worldwide between 
1980 and 2003 found not a single case of psychopathology (depression, psychosis, past 
suicide attempts, etc.) among them and only one case (a Chechen female) of probable mental 
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retardation (Pape 2005: 210). Evidence collected by other experts suggests that “recruits who 
display signs of pathological behaviour are automatically weeded out for reasons of 
organizational security.” (Taarnby 2003: 18) It seems reasonable to conclude that individualistic 
explanations based on psychopathology are of no value in helping us understand the rising 
incidence of suicide bombing. 


Deprivation 


Another explanation of suicide bombing focuses on the extraordinary deprivations, either 
absolute or relative, that suicide bombers supposedly suffer. Absolute deprivation refers to 
longstanding poverty and unemployment, relative deprivation to the growth of an intolerable 
gap between expectations and rewards (Gurr 1970; Moore 2003). Presumably, deprivation of 
one sort or the other frustrates some categories of people until they are driven to commit self- 
immolating acts of aggression against the perceived source of their suffering. 

Evidence does not support the deprivation theory. One researcher found education and 
income data on about 30 percent of Arab suicide bombers between 1980 and 2003. He 
reported that they were much better educated than the populations from which they were 
recruited. They were typically from the working and middle classes and were seldom 
unemployed or poor (Pape 2005: 213-15). Another scholar discovered that suicide bombers 
from Egypt and Saudi Arabia have come mainly from middle- or upper-middle-class families 
(Laqueur 2004: 16). As is well known, the perpetrators of the Sept. 11, 2001 attacks on the 
United States were well educated, middle-class men. Such evidence lends no credence to 
the notion that suicide bombers are especially deprived in any absolute sense. 

Arguments about relative deprivation are purely speculative. To date, no researcher has 
measured the degree to which suicide bombers are relatively deprived and compared their 
level of relative deprivation with that of non-suicide bombers. However, efforts to construct 
socio-demographic profiles of suicide bombers are of some relevance to the theory. In the 
1990s profilers discovered that suicide bombers in Israel, the West Bank and Gaza tended to 
be single, socially marginal, unemployed or underemployed men in their mid-20s with a deep 
attachment to Islamic fundamentalism — just the sort of individuals some people might expect 
to experience deprivation most keenly and react to it most violently. Israeli authorities 
subsequently used this information to disrupt suicide operations and prevent individuals 
matching the profile from entering the country. Insurgent organizations then started recruiting 
university graduates, married men, people with a secular background, women and youths as 
suicide bombers. For example, our data show that between 2000 and 2004, the percentage 
of female Palestinian suicide bombers rose from zero to 12 percent of the total. As the socio- 
demographic profile of suicide bombers came to resemble that of the Palestinian population 
as a whole, it became more difficult to sustain the hypothesis that suicide bombers are 
especially relatively deprived. And in fact the consensus in the literature today is that there are 
no ideal typical suicide bombers whose socio-demographic characteristics lead them to 
experience extraordinarily high levels of deprivation, either relative or absolute (Stern 2003: 
50-2; Taarnby 2003: 10-12; Victor 2003). 


Culture 


The idea that culture, especially religion, influences behavior net of other causes dates back 
to Max Weber's classic essay on the subject (Weber 1958 [1904-5)). Accordingly, in the late 
1980s, some social scientists began to attribute much of the collective violence in the world 
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to a “clash of civilizations” between Islam and the West (for syntheses, see Huntington 1996 
and Lewis 2002: and for a sustained critique, Hunter 1998). From their point of view, Islamic 
culture inclines Muslims to fanatic hatred of the West, violence, and in the extreme case, 
suicide attacks. For example, the martyrdom of Hussein at the battle of Karbala in 680 Ge 
was a signal event in Islamic history and it is often said to have reinforced the readiness of 
Muslims, especially Shiites, to sacrifice their lives for the collective good in the face of 
overwhelming odds (Reuter 2004: 37-9). 

While such cultural resources likely increase the probability that some groups will engage 
in suicide attacks, one must be careful not to exaggerate their significance. One difficulty with 
the “clash of civilizations” argument is that public opinion polls show that Arabs in the Middle 
East hold strongly favorable attitudes toward American science and technology, freedom and 
democracy, education, movies and television, and largely favorable attitudes toward the 
American people. They hold strongly negative attitudes only toward American Middle East 
policy (Zogby 2002). This is less evident of a clash of civilizations than a deep political 
disagreement. The notion that an elective affinity exists between Islam and suicide bombing 
is not supported by the fact that “much of the so-called Islamic behaviour that the West terms 
terrorism is outside the norms that Islam holds for political violence.” (Silverman 2002: 91) 
One must also bear in mind that some secular Palestinian groups (e.g., the nationalist Fatah 
and the Marxist Popular Front for the Liberation of Palestine) and some non-Muslim groups 
(e.g., the Liberation Tigers of Tamil Eelam in Sri Lanka) have employed suicide bombing as a 
tactic. Among the 83 percent of suicide attackers between 1980 and 2003 for whom data on 
ideological background is available, only a minority — 43 percent — was religious (Pape 2005: 
210). According to one analyst, even in Lebanon, Israel, the West Bank and Gaza between 
1981 and 2003, fewer than half of suicide missions were conducted by religious individuals 
(Ricolfi 2005). This hardly increases confidence in the view that religion in general or Islamic 
fundamentalism in particular is the wellspring of suicide bombing. A final difficulty with 
cultural interpretations is that suicide attacks are by no means a constant in Islamic history. 
They appear in 11" century northern Persia, in the 18" century in parts of India, Indonesia and 
the Philippines, and in the late 20t" century in various parts of the Muslim world. The episodic 
nature of suicide attacks suggests that certain social and political circumstances may be 
decisive in determining which cultural resources are drawn upon at a given time to formulate 
tactics for collective violence. For example, in the 18" century, suicide attacks were chosen as 
a tactic because little else proved effective against vastly militarily superior European and 
American colonial powers (Dale 1988). Similarly, militant Islamic groups in the late 20" and 
early 21° centuries adopted suicide bombing as a tactic only after other tactics had failed. 
Suicide bombing, it seems, is a weapon of last resort. All of this points to the difficulty of 
trying to explain political variables with cultural constants. 


Strategic Choice 


A major advance in thinking about suicide bombing took place when scholars began to 
analyze it as a strategically rational political action (Harrison 2003; Madsen 2004; Sprinzak 
2000). With the recent publication of Robert Pape’s study of all 462 suicide bombers who 
attacked targets worldwide between 1980 and 2003, this school of thought was given a 
strong empirical basis of support (Pape 2003, 2005). 

The core of Pape’s argument is that ”... every group mounting a suicide campaign over the 
past two decades has had as a major objective — or its central objective — coercing a foreign 
state that has military forces in what the terrorists see as their homeland to take those forces 
out.” (Pape 2005: 21; cf. Laqueur 2004: 19) Pape makes his case by first quoting the leaders 
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of Hizballah, Hamas, al-Qa'ida and the Liberation Tigers of Tamil Eelam on their organization's 
goals. These leaders stated plainly and forcefully that their chief aim is to liberate, respectively, 
Lebanon, Palestine, Saudi Arabia and the eastern and northern provinces of Sri Lanka from 
what they regard as foreign occupation or control (Pape 2005: 29-33). To support his contention 
that suicide bombing is a fundamentally rational strategy, Pape then notes that suicide attacks 
are not randomly distributed but occur in clusters as part of a campaign by an organized group 
to achieve a political goal. He identifies 18 suicide bombing campaigns that have taken place 
since the early 1980s, five of them, including the second intifada, ongoing (Pape 2005: 40). 
Finally, Pape argues that the rationality of suicide bombing is evident in the correlation between 
the increasing use of suicide bombing campaigns and their relative success in achieving their 
goals. He finds that suicide bombing has a roughly 50 percent success rate and regards that as 
high, since, by comparison, international military and economic coercion achieves its goals less 
than a third of the time (Pape 2005: 65). In short, Pape claims that strategic rationality is evident 
in the timing, objectives and results of suicide bombing campaigns. 

Pape’s analysis convinces us that many cases of suicide bombing are not devoid of 
strategic logic. However, our analysis of the second intifada — one of the three most persistent 
and violent series of Suicide bombing attacks in the past quarter of a century’ — leads us to 
conclude that characterizing suicide bombing as strategically rational is an oversimplification. 
Specifically: 


1. We have found that suicide bombing is an action that typically 
involves mixed motivations and mixed organizational rationales. 
Strategic thinking is only one element that may combine with others 
in the creation of a suicide bomber. It predominates in the explosive 
admixture that results in a suicide attack less frequently than Pape 
leads us to believe. 


2. Because the motivations and organizational rationales of suicide 
bombings are often mixed, suicide bombing campaigns are not 
always or even often timed to maximize the strategic advantages of 
insurgents. The timing of suicide bombings may be detached from 
strategic considerations because they take place for nonstrategic 
reasons such as revenge or retaliation or simply when opportunities 
for attack happen to emerge. As a result, their timing may not 
maximize the strategic gains of the attackers and on some occasions 
may even minimize such gains. 


3. Suicide bombing campaigns sometimes encourage targets to 
make minor concessions, but they often fail to achieve their main 
objectives and they sometimes have consequences that are the 
opposite of those intended by suicide attackers and their 
organizations. If suicide bombing pays, as Pape claims, its net returns 
are often meager and sometimes negative. 


Method, Variables, Data 
To substantiate our arguments, we collected data on 128 variables pertaining to the use of 


insurgent and state violence in Israel, the West Bank and Gaza from Oct. 26, 2000 (the date of the 
second intifada’s first suicide bombing) to July 12, 2005 (the second intifada’s last suicide 
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bombing as of this writing). Data were collected from the online database of the International 
Policy Institute for Counter-Terrorism (ICT) in Herzliya, Israel; the website of Israel's Ministry of 
Foreign Affairs; the East Coast evening edition of the New York Times; and two authoritative 
Arabic newspapers — a/-Quds, published in Jerusalem, and a/-Quas al-Arab/, published in London. 

We define suicide bombing as the use of explosives against one or more people by one or 
more attackers. The attackers enjoy organizational support and know in advance and with certainty 
that their actions will result in their deaths. By our definition, merely planning an attack does not 
quality as a suicide bombing; the attacker must be en route to his or her target. Nor is death or 
injury a necessary part of our definition since on occasion a suicide bomber is apprehended and 
disarmed after an attack has been launched but before detonation. Three suicide bombings listed 
by Israeli sources do not qualify as such by our definition. Fourteen suicide bombings by our 
definition are not listed as such by Israeli sources. 

Aside from the cases just mentioned, we found no discrepancies between Arab and Israeli 
sources with regard to the date, time and location of suicide attacks, the number of deaths 
and injuries they caused, the number of suicide bombers involved in each attack, and the age 
and gender of the suicide bombers. Our review of Arabic newspapers found 10 cases in 
which Israeli sources incorrectly list the name or residence of the suicide bomber. 

Israeli sources and the New York Times provide only sporadic information on the individual 
motives, organizational rationales and precipitants of suicide attacks. Here we relied more 
heavily on Arabic sources because they provide more consistent and detailed evidence drawn 
from interviews with organizational leaders and suicide bombers’ family members, official 
organizational statements, and statements made by suicide bombers before their attacks. 

Because we discuss only the objectives, timing and results of suicide bombing in this paper, 
we focus mainly on 12 variables here: the (1) year, (2) month and (3) day of each suicide attack; 
the (4) primary, (5) secondary and (6) tertiary motives of the suicide bomber(s); the (7) primary, (8) 
secondary and (9) tertiary rationales for the suicide bombing according to the organization(s) 
claiming responsibility for it; and the (10) primary, (11) secondary and (12) tertiary precipitants 
that, according to the responsible organizations(s), prompted the suicide attack. 

Motives are the reasons suicide bombers gave for their actions in statements they made prior to 
attacking. When such statements were unavailable, we turned to published statements of family 
members and friends or, in their absence, published statements of organizational representatives. 

We found five types of motives for suicide attacks: 


1. Desire for personal revenge or retaliation due to an Israeli action 
against the suicide bomber or his or her relatives or friends 


2. Desire for national revenge or retaliation due to an Israeli action 
against Palestinians in general 


3. Desire for religious revenge or retaliation due to an Israeli action 
against Muslims or Islam 


4, Desire to regain one’s reputation due to the suicide bomber having 
engaged in shameful behavior, such as collaboration with the enemy 


5. Desire to achieve a religious goal other than revenge or retaliation, 
such as the defense or spread of Islam 


Organizational rationales are the reasons that representatives of organizations claiming 
responsibility for suicide attacks gave for their actions. Organizational rationales are not ultimate 
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goals. The ultimate goals of insurgent Palestinian organizations are to recapture territory 
perceived as their members’ homeland and, for some organizations, to establish Islam as the 
official religion of that territory. Many strategies and tactics may be adopted to achieve these 
goals, suicide bombing among them. Rationales, in contrast, are the reasons organizational 
leaders give for undertaking specific suicide missions. Sometimes rationales and ultimate goals 
are identical but usually they are not. We found five types of rationales for suicide attacks: 


1. Desire for organizational revenge or retaliation — due, for example, 
to an Israeli attack on the organization's leaders, activists, sites or offices 


2. Desire for national revenge or retaliation — due, for example, to an 
Israeli attack against Palestinians who are not members of the 
organization claiming responsibility for the attack 


3. Desire to achieve a tactical (/.e., specific, short-term) political goal, 
such as disrupting security cooperation between Israeli and Palestinian 
authorities 


4. Desire to achieve a strategic (i.e., general, long-term) political goal, 
such as forcing |srael to withdraw trom occupied territories 


5. Desire to achieve a religious goal, such as the defense or spread of 
Islam 


Precipitants are the specific preceding events that affected the timing of suicide bombings 
according to representatives of organizations claiming responsibility for the attacks. We found 
five types of precipitants: 


1. Assassination of organizational leaders or members by Israel 


2. Killing of Palestinians other than organizational leaders or members 
by Israel 


3. Anti-Palestinian actions by Israel not involving the killing of 
Palestinians, such as house demolitions 


4. Significant political events such as an Israeli election, the visit of an 
American envoy or an Arab summit meeting 


5. Symbolically significant religious or ideological events, such as the 
anniversary of Salah al-Din’s retaking of Jerusalem from the Crusaders 
in 1187 CE. 


We collected information on primary, secondary and tertiary motives, rationales and 
precipitants, judging salience on the basis of how prominently these precursors of suicidal 
violence figured in the reports we surveyed. 

Coding was conducted by this paper's junior author, a PhD. sociology student who is fluent 
in English and Arabic. A research assistant (also with a graduate education and fluency in 
English and Arabic) also was trained to code the materials. We drew a random sample of 5 
percent of the days on which suicide bombings took place between Oct. 26, 2000 and July 12, 
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2005, and the research assistant independently coded the materials for these days. The 
research assistant’s coding was then compared with that of the junior author. We found only 
a few minor differences between the two codings. This gives us confidence in the reliability 
of our measures. 


Analysis 


Timing 


Pape defines a suicide bombing campaign as “... an intended series of attacks... aimed at 
gaining political concessions from a target government. A campaign... continues until the 
terrorist leaders deliberately abandon it, either because sufficient gains have been made 
or because the leaders believe that the effort has failed.” (Pape 2005: 39-40) Accordingly, 
Pape divides the suicide bombings that took place in Israel, the West Bank and Gaza 
between 1994 and 1997 into four campaigns lasting one to 10 months each. Inexplicably, 
however, he regards the second intifada as a single suicide bombing campaign although it 
had dragged on for 55 months as of this writing and had sustained numerous periods of 
intense suicide bombing activity as well as extended lulls, some lasting longer than the 
time between campaigns (as defined by Pape) in the 1990s. According to Pape, suicide 
bombing campaigns are strategically timed to maximize the advantages of insurgent 
groups. To test his argument, one would need to measure and explain variation over time 
in the frequency of suicide attacks. Eliminating all such variation by lumping many peaks 
and troughs under the rubric of a single campaign obscures the very data required to test 
his argument rigorously. 

Figure 1 highlights the data hidden by Pape’s classification of the second intifada as a 
single campaign. The 10-day moving average of the frequency of suicide attacks varies 
from zero to .8 attacks per day. Understandably, the peaks — especially those with values 
of half the maximum or more (i.e., .4 or greater) — first capture one’s attention. There were 
11 such peak periods of suicide bombing during the second /ntifada. The fact that 10 of 
them appear in the upper left quadrant of Figure 1 means that all but one took place during 
the second /ntifada's first half. The mean duration of the 11 peak periods was a mere 4.9 
days; cumulatively, they lasted just 3.1 percent of the days in the series. Nonetheless, these 
peak periods of suicide bombing account for 19.7 percent of injuries, 21.6 percent of 
deaths and 41.3 percent of total attacks. They were staccato bursts of extraordinary 
cumulative lethality. 

Peak periods sometimes involved the coordinated activities of several insurgent 
organizations (Hamas, Islamic Jihad, the al-Aqsa Martyrs Brigade, etc.) and were in that 
sense “national” campaigns. In terms of precipitants, motives and organizational rationales, 
however, we lack sufficient data to determine exactly how peak periods differ from other 
periods.” Until such data become available, we must restrict ourselves to the more general 
task of explaining variation in the frequency of suicide attacks using our content analysis of 
newspapers and suicide bombing databases. 

Table 2 provides insight into the timing of suicide attacks. It focuses on the precipitants 
of suicide bombings — the specific preceding events that affected the timing of suicide 
bombings according to representatives of organizations claiming responsibility for them. For 
ease of interpretation, we divided precipitants into two broad categories: reactive and 
proactive. Reactive precipitants are Israeli actions that elicited a Palestinian reaction in the 
form of a suicide attack. Such actions include the assassination of organizational leaders and 
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Table 2: Precipitants of Suicide Bombings 


SS 


Reactive Precipitants 


Assassination of organizational leaders 34 
Killing of Palestinians other than organizational leaders 30 
Other Israeli actions not involving killing 18 
Proactive Precipitants 

Significant political events 13 
Significant religious or ideological events S 
Total 100 
N 106 


a 
Note: This table combines primary, secondary and tertiary rationales. 


members, the killing of Palestinians other than organizational leaders and members, and 
other actions not involving killing, such as the demolition of houses owned by the families of 
people involved in anti-Israel activities. Proactive precipitants are political, religious or 
ideological events that elicited a suicide attack without provocation by specific Israeli actions. 
In such cases, organizations used symbolically significant anniversaries, elections or 
negotiations as opportunities to further their goals by means of suicide attacks. 

Fully 82 percent of the 106 precipitants that we identified were reactive, just 18 percent 
proactive. The great majority of suicide attacks were precipitated by specific Israeli actions. Their 
timing was in that sense not of the Palestinians’ choosing. To be sure, Israel's response influenced 
the ease with which suicide attacks could be mounted. Especially after the extraordinarily 
frequent and lethal suicide missions of March 2002, Israel's actions (including the construction of 
a wall with electronic sensors and television cameras cordoning off much of the West Bank from 
Israel) significantly decreased the number of suicide bombings and increased the time between 
precipitant and reactive attacks. Still, even during the /ntifada’s less bloody second half, suicide 
bombings were often precipitated by Israeli actions. For example, the suicide bombing in 
Beersheba on Aug. 31, 2004 was declared to be a response to the assassinations of Hamas 
leaders Sheikh Ahmad Yassin and Abdel Aziz Rantisi, but those assassinations occurred, 
respectively, four and five months earlier. In contrast, insurgent response time at the beginning of 
the intifada was typically no longer than three weeks, often less. 

Thus, our finding suggests an interpretation for the timing of suicide attacks that is widely 
at variance with Pape’s. We find little evidence to support his contention that suicide attacks 
are timed to maximize the achievement of strategic or tactical goals. Our analysis of 
precipitants leads us to conclude that most suicide bombings were revenge or retaliatory 
attacks and were advertised as such by insurgents. 


Objectives 


Pape claims that not just the timing but also the objectives of suicide attacks reveal a strategic 
logic. In his view, suicide bombing is aimed at achieving specific political goals, notably coercing 
an occupying power to end its occupation. We certainly found such logic when we examined 
the individual motivations and organizational rationales for suicide attacks during the second 
intifada. We also discovered that it figures less prominently than Pape would have us believe. 
Table 3 divides the individual motivations for suicide bombing into the now-familiar 
reactive and proactive categories. Although we were able to find information on the 
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motivations of only a minority of suicide bombers in published sources, some 71 percent 
of the motivations we recorded were reactive and 30 percent proactive. (The percentages 
do not equal 100 because of rounding.) Revenge and retaliation figured prominently in the 
suicide attackers’ vocabulary of motives. For the most part, they gave up their lives to 
avenge the killing of a close relative, as retribution for specific attacks against the 
Palestinian people or as payback for perceived attacks against Islam. This finding supports 
the educated but impressionistic conclusion that political philosopher Avishai Margalit 
reached: 


Having talked to many Israelis and Palestinians who know something 
about the bombers, and having read and watched many of the 
bombers’ statements, my distinct impression is that the main motive 
of many of the suicide bombers is revenge for acts committed by 
Israelis, a revenge that will be known and celebrated in the Islamic 
world (Margalit 2003). 


Table 3: Motivations for Suicide Bombings 





Reactive Motivations 


Personal revenge or retaliation 23 
National revenge or retaliation 46 
Religious revenge or retaliation Z 
Proactive Motivations 

To regain reputation 5 
Religious 25 
Total 101 
N 57 





Note: This table combines primary, secondary and tertiary 
motivations. The total percent does not equal 100 due to 
rounding. 


In fairness, Pape’s argument is pitched mainly at the organizational level. He holds that 
all manner of motivations may prompt individual suicide bombers to attack, but 
organizations tend overwhelmingly to operate in a calculated, strategic way. To test his 
argument, we read what organizational representatives had to say about their rationales 
for suicide attacks. Thirteen percent of the 165 rationales that we identified mentioned 
long-term strategic goals such as ending the Israeli occupation. Another 21 percent 
mentioned short-term tactical goals such as disrupting security cooperation between 
Israel and the Palestinian Authority. Seven percent mentioned that suicide bombings were 
calculated to achieve religious goals — in particular, the defense or spread of Islam. But this 
still leaves a substantial majority of rationales — 59 percent — that fall into the reactive 
category. Even at the organizational level, where, according to Pape, calculated, strategic 
considerations govern action, we find that six out of 10 rationales focused on avenging 
Israeli attacks on insurgent organizations and the Palestinian people or retaliating for such 
attacks in order to maintain organizational morale. To be sure, the organizations 
responsible for suicide attacks are governed by a higher level of strategic logic than are 
the suicide bombers themselves. It is nonetheless misleading to make the wholesale 
claim that predominantly strategic considerations regarding the re-conquest of territory 
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Table 4: Rationales for Suicide Bombings 





Reactive Rationales 


Organizational revenge or retaliation 22 
National revenge or retaliation 37 
Proactive Rationales ] 
Tactical 21 
Strategic 13 
Religious 7 
Total 100 
N 165 
Note: This table combines primary, secondary and tertiary 
rationales. 


underlie suicide bombings. Whether we examine the timing or the objectives of suicide 
attacks, the reality is more complex. 


Results 


Pape’s third assertion is that suicide bombing is strategically rational in the sense that it often 
pays. In his view, organizations engage in suicide bombing because it helps them achieve 
their strategic goals relatively efficiently — with a comparatively high rate of success in terms 
of achieving strategic goals, little financial cost and minimal loss of life on the side of the 
insurgents. Here Pape echoes the common view that Palestinian insurgents must invest just 
$150 in supplies and travel costs plus a single human life to wreak havoc in Israeli society and 
achieve their goals. 

To sustain the argument that suicide bombings cost little, one must ignore many of the 
substantial costs incurred by Palestinians for their actions. For example, in response to suicide 
bombings, Israel routinely assassinates top organizational leaders (often causing collateral 
deaths of associates and family members in the process) and imprisons and tortures second- 
tier leaders. Insurgent organizations such as Hamas get branded as terrorist organizations 
internationally and find their bank accounts frozen. These retaliatory actions are entirely 
predictable and they undermine the capacity of insurgent organizations to act. We conclude 
that the costs to insurgents of suicide bombings are in fact substantial, yet high costs fail to 
prevent some Palestinians from engaging in suicide attacks. 

Pape’s claim that suicide bombing achieves a relatively high rate of success in terms of 
achieving strategic goals is also highly questionable in our judgment. Pape defines success 
as the withdrawal of occupying forces. The second intifada witnessed just one such 
withdrawal — Israel's August/September 2005 pullout from Gaza. Can the pullout be 
construed as a consequence of Palestinian suicide attacks? That is certainly the view of 
Hamas and other insurgent organizations. Hamas's official statement following the Gaza 
pullout included the telling phrase, “Four years of resistance surpassed 10 years of 
bargaining.” And in early September 2005, the “general leader” of Hamas's military wing, 
Muhammad Deif, said to his comrades that “without... your love of martyrdom, the 
liberation of Gaza could not have been achieved.” (Palestinian Information Center, our 
translation) 
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Yet an examination of the geographical location of suicide bombings and the geographical origin 
of the bombers themselves casts doubt on this interpretation. During the second intifada, Gaza was 
neither the site of a disproportionately large number of suicide attacks nor the recruiting ground for 
a disproportionately large number of suicide bombers. Only 18 percent of all suicide attacks took 
place in Gaza, the same percentage as in the West Bank. Nearly two-thirds of suicide attacks took 
place in Israel proper (see Table 5). Only 26 percent of suicide bombers came from Gaza compared 
to 72 percent from the West Bank. The number of West Bank suicide bombers per million residents 
was 1.7 times higher than the number of Gazan suicide bombers per million residents. (For 
population data, see Palestinian Central Bureau of Statistics 2005a.) A mere two of the 84 suicide 
bombers who carried out missions in Israel proper came from Gaza. Eighty came from the West 
Bank. Thus, if suicide attacks were a decisive factor in leading to territorial concessions, one would 
expect those concessions to have been made in the West Bank, not Gaza. 


Table 5: Geographical Location of Suicide Attacks and Geographical Origin of Suicide Bombers 


Percent of Bombers per Percent of 
Region bombers million Palestinians bombings 
West Bank 72 30 18 
Gaza 26 18 18 
Israel proper 1 64 
Other 1 0 
Total 100 100 


N 86 137 


This is not to suggest that suicide bombing had no effect on Israeli actions. It did. As often 
as not, however, the effect was the opposite of what was intended by insurgents. Palestinian 
moderates want Israeli public opinion to soften, leading Israel to withdraw from the West Bank 
and Gaza. Extremists want to create deep rifts in Israeli society and to destroy Israel as a Jewish 
state. Suicide bombing cannot be construed as an instrumentally rational means of achieving 
any of these objectives (cf. Elster 2005; Weber 1947: 115-18). Thus, following four and a half 
years of suicide attacks, a 2005 BBC poll of 68 countries found that Israeli Jews have by far the 
strongest national identity in the world and are more likely than people in other countries to trust 
their military and police leaders and want their military leaders to have more power (BBC News 
2005). Israeli polls demonstrate that suicide attacks helped hardliner Ariel Sharon win the 
February 2001 election and, in general, drove Israeli public opinion to the right throughout the 
second intifada (Arian 2001, 2002; Eldar 2005; Elran 2006). Suicide bombings also encouraged 
Israel to reoccupy Palestinian population centers in the West Bank and Gaza in 2002. Israel had 
withdrawn from these population centers in 1995-97 as a result of negotiations culminating in 
the 1993 Oslo Accords. But in March 2002, 135 Israeli civilians were killed in suicide attacks, the 
most infamous of which was the so-called Passover massacre at the Park Hotel in Netanya, in 
which 30 Israelis lost their lives. Within 24 hours of the Passover massacre, Israel launched 
Operation Defensive Shield. Twenty thousand reservists were called up in the biggest 
mobilization since the 1982 invasion of Lebanon and the biggest military operation in the West 
Bank and Gaza since the 1967 war. The West Bank and Gaza were almost completely 
reoccupied within weeks. Even if the strategic aim of the suicide bombings in March was purely 
to coerce Israel to withdraw completely from the occupied territories (however defined), the 
result of those attacks was just the opposite. On a broader canvas, we note that substantial 
West Bank territory has been incorporated on the Israeli side of the wall that Israel is building to 
make it harder to launch suicide attacks. Therefore, in the long run, too, suicide bombings will 
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have made it more difficult for the Palestinians to gain territorial concessions from Israel. Of 
course, many Palestinians recognize that suicide bombing is a problematic strategy that rarely 
achieves strategic territorial goals and often has unintended, negative consequences. Among 
them is Palestinian President Mahmoud Abbas, who characteristically declared the July 12, 2005 
suicide bombing in Netanya “a crime against our people.” (Al-Ra’is 2005) 

In sum, we conclude that, during the second intifada, the results, objectives and precipitants 
of suicide bombing reveal little of the strategic logic that, according to Pape, lies at its core. 


A Model of the Determinants of Suicide Bombing 


We have crudely modeled the deadly interaction between Palestinians and Israelis in Figure 2. We 
hypothesize that suicide bombings prompted Israel to take a variety of measures against 
insurgents, such as the assassination of Palestinian insurgents. In turn, assassinations led 
Palestinian insurgents to engage in more suicide bombing.‘ This positive reciprocal relationship is 
signified by the double-headed arrow linking variables 1 and 3. Suicide bombings also prompted 
Israel to take antiinsurgent measures other than assassinations, such as armed incursions. These 
operations also resulted in the death of Palestinians; and the killing of Palestinians by methods 
other than assassination led to more suicide bombings. This positive reciprocal relationship is 
signified by the double-headed arrow linking variables 2 and 3. Finally, suicide bombings 
prompted Israel to imprison a growing number of Palestinian insurgents (3 — 4). After March 
2002, however, when Israel launched Operation Defensive Shield, so many insurgents were 
imprisoned that the operation of their organizations was negatively affected and the frequency of 
suicide bombings decreased (4 — 3). Consequently, we expect the direction of the relationship 
to have shifted from positive to negative after March 2002. 

Due to its complexity, properly fitting data to our model would require another paper. 
Accordingly, we restrict ourselves here to illustrating the plausibility of the model by showing 
bivariate correlations between pairs of variables for aggregated monthly data. All the bivariate 
correlations shown in Figure 2 are moderately strong to strong. All are statistically significant 
at the .001 level or lower and their direction is consistent with our theory, although, of course, 
they in no sense confirm it. 


Toward an Interactive Approach 


In the planning offices of the Israeli security services as in the warrens of Gaza City, highly 
intelligent men strategize about how to maximize gains and minimize losses in the most recent 
phase of their 125-year battle over territory that both sides claim as their historical and religious 
birthright. One side is too weak to imagine a balance of power, so instead it concocts a scheme 
to achieve a balance of horror, justified by the idea that, “A nation whose sons vie with each other 
for the sake of martyrdom does not know defeat.” (quoted in Oliver and Steinberg, 2005: 61) The 
powertul side responds to martyrdom operations as most of its enraged population demands — 
by teaching the other side a series of lessons it won't soon forget. The weak side obliges by 
remembering well and avenging Its losses with all the fury it can muster. Some of the strategic 
thinkers in the Israeli planning offices undoubtedly recognize that murderous retribution is often 
counterproductive. They must, however, answer to their political bosses, who are in turn obliged 
to respond to public outrage by getting tough. Some of the strategic thinkers in the warrens of 
Gaza City undoubtedly know that Israel will not capitulate in response to suicide bombing. But 
they must answer to their publics too, and so they often forsake the calculation of costs and 
benefits for political expediency and a culture of mutual destruction. Hence our conclusion that 
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Figure 2: A Model of the Determinants of Suicide Bombing 
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r = .740 (to March ‘02) 


4. Monthly 
Palestinian 
prisoners 















3. Monthly 
Suicide 
bombings 


r = -.519 (post March ‘02) 





2. Monthly other 
intifada-related 
deaths 


Note: This model shows bivariate correlations between pairs of variables, all of which are 
statistically significant at the .001 level or lower. The correlation between assassinations and 
suicide bombings includes only bombings in which an assassination was mentioned as a 
precipitant. The correlation between other ipitodeaeeted deaths and suicide bombings 
includes only bombings in which other intifada-related deaths were mentioned as a 
as ogether, these bombings account for 68 of the 106 pepiants on which we 
ave information (see Table 2). Neither the monthly number of house demolitions nor the 
monthly number of Palestinians injured due to the second intifada are associated with the 
monthly number of suicide bombings. 


Data sources: Suicide bombings and Israeli assassinations (attempted and successful) of 
Palestinian members of insurgent organizations: al-Quds (2000-05): al-Quds al-‘Arabi 
(2000-05); Palestinian deaths due to the second int fada: Palestinian Central Bureau of 
Statistics (2005b); Palestinian prisoners in the Israeli prison system: Israel Defense Forces 
(2005) and Israeli Prison Service (2005). 


retribution and retaliation often trump strategic calculation in prompting suicide attacks. 

Our analysis also points to the interactive nature of suicide bombing and the necessity of 
understanding its relationship to other forms of collective violence perpetrated by states and 
non-state organizations (cf. McAdam, Tarrow and Tilly 2001; Tilly 2003). Any protracted 
conflict involving suicide bombing is marked by periods of intense violence and periods of 
relative calm. It features a complex cast of actors whose identities, goals, strategies, friends 
and enemies change over time. It involves a repertoire of tactics other than suicide bombing 
(assassinations, armed incursions by state forces, the imposition of curfews and restrictions 
on freedom of movement, land grabs, guerrilla attacks, strikes, mass demonstrations, etc.), 
the use of any one of which may make the use of others more or less likely (Lichbach 1987). 
To adequately explain the emergence of suicide bombing as a political tactic and culturally 
acceptable practice and account for variations over time in its use, one must do justice to the 
complex dynamism of a conflict like the second intifada by bringing all these actors, identities, 
coalitions, and tactics into the picture. Here we have only scratched the surface. 


Notes 


1. When Pape’s book went to press, only al-Oa‘ida’s campaign against the United States, 
which began in 1996, had lasted longer than the second intifada. Since Pape’s book went 
to press, the incidence of suicide bombings by Iraqi insurgents has exceeded by far the 
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incidence of suicide attacks during the second intifada. For a good journalistic overview of 
the second intifada, see Harel and Isacharoff (2004) and for social scientific studies, see 
Hafez (2005) and Moghadam (2003). 


2. There were 57 suicide bombings during peak periods. In our content analysis of 
newspapers and suicide-bombing databases we found peak-period data on the individual 
motives of two bombers, on the precipitants of ten bombings, and on the organizational 
rationales of 28 bombings. Ricolfi (2005: 98-101) attributes spikes in the frequency of 
suicide bombing to competition among insurgent organizations but, oddly, he defines 
cooperation as a form of competition. Like Ricolfi, we have noted that inter-organizational 
cooperation seems to be associated with spikes in the frequency of suicide bombing but 
we see no reason to view this as a form of competitive outbidding. For a critique of the 
outbidding thesis, see Brym and Araj (2006). 


3. The decisive factor determining how an occupying power will be affected by suicide 
bombings and other such attacks is its level of commitment to the territory in question. It 
took a single set of train bombings to get Spanish forces out of Iraq in 2004 but the West 
Bank is not Iraq and Israel is not Spain. 


4. We define assassinations as Israeli actions intended to kill specific Palestinians on the grounds 
that they conducted antilsrael activities during the second intifada. Ricolfi (2005: 126-8) argues 
for the asymmetry of the relationship between suicide bombing and Israeli counterterrorism in 
the period 2000-03. He shows that the correlation between suicide missions and Israeli military 
operations one week later is strong and statistically significant while the correlation between 
Israeli military operations and suicide bombings one week later is weak and statistically 
insignificant. However, his correlations are deceiving because suicide bombings have 
precipitants other than Israeli military operations, as we have shown. As Figure 2 indicates, we 
found positive and statistically significant relationships when we specified the type of 
precipitant but we chose not to lag variables because our data are monthly aggregates and the 
time between event and reaction is often less than a month. We also found a modest positive 
correlation (r = .214, sig. < .001) between (1) the monthly number of suicide bombings 
attributable to each insurgent organization (Hamas, Islamic Jihad, atAqsa Martyrs Brigade and 
other) and (2) the monthly number of assassinations of each organization's members. 


5. We would have to conduct a regression analysis using a Poisson or negative-binomial 
generalized linear model showing reciprocal effects and taking serial dependency into account. 
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Abstract 

To comprehend the developments underlying the suicide attacks of recent years, we suggest that 
the organizational approach, which until recently was used to explain this phenomenon, should 
be complemented with a social network perspective. By employing a social network analysis of 
Palestinian suicide networks, we have found that, in contrast to prevailing perceptions which 
view suicide attacks as a product of strategic decisions made within organizational frameworks, 
decisions have actually been made, to a great extent, by local activists, and struggles between 
local and family groups have proved to be the best predictor of their actions in this context. We 
also found that the peripheral nature of suicide bombers has been a characteristic common to 
most networks. Finally, we have found that the existence of cohesive subgroups, as well as the 
number of hubs in a network, has an influence on the network’s effectiveness. We conclude the 
paper by showing that the method of network analysis can also be of considerable assistance 
from the standpoint of dealing with suicide attacks. On the one hand, this is possible by 
identifying the network's central figures who are essential for its continued existence, and on the 
other, we identify the grassroots factors responsible for the emergence of these networks. 


Introduction 


In recent years, there has been a breakthrough ii) the study of suicide attacks. Robert Pape’s 
(2003) analysis of how suicide bombers have been used as a means to force countries to alter 
their policies in asymmetrical territorial disputes has paved the way for employing an 
organizational perspective and rational assumptions to explain the phenomenon. Bloom's (2004) 
study showing how organizational competition for public support helps explain the increase or 
decrease in the use of suicide bombers is a good example of this perspective. Recently, several 
studies have been published, all analyzing the interactions among the individual, the community 
and the organization (Bloom 2005; Hafez forthcoming; Moghadam forthcoming; Pedahzur 
2005; Pape 2005). Despite the wide range of factors used in these analyses, all these 
researchers view organizations and their elites as central to the initiation of suicide attacks. 
Most of these studies substantiate their claims using case studies of groups such as 
Hezbollah in Lebanon, the LTTE in Sri Lanka, the PKK in Turkey and various Palestinian groups. 
The key to this impressive theoretical advance lies in the fact that all of these groups featured 
an orderly hierarchical structure that operated according to the directives of a leader or an 
identified echelon of leaders, and that all proclaimed the liberation of territory or a struggle for 
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the rights of an ethnic minority as their raison d'etre. 

The paradox which is the current article's point of reference is that the majority of these 
groups, apart from the Palestinian ones, almost completely desisted from employing suicide 
bombers at the end of the last decade. Nevertheless, suicide attacks have not become a thing 
of the past. The number of suicide attacks in the past five years is 2.7 times greater in comparison 
to the period beginning in the 1980s and lasting until 1999 (NSSC Dataset, 2005). Even so, while 
the number of suicide attacks perpetrated by established groups with hierarchical structures is on 
the decline, the number of attacks carried out by groups lacking an easily identifiable structure 
or an established leadership is on the rise. In the past five years, the latter groups initiated 6.4 
times more suicide attacks than the former ones (NSSC Dataset, 2005). 

The major name in suicide attacks in recent years has been al-Qa'ida. The very name of the 
group (“the base” in Arabic) indicates how it differs from the groups mentioned above. Most 
studies investigating Al-Oa'ida, especially after September 11" and the American invasion of 
Afghanistan, stress that it has no clear-cut leadership or hierarchical structure, but executes attacks 
through a network of cells dispersed worldwide or in cooperation with local groups (Jones, Smith 
and Weeding 2003; McAllister 2004; Mayntz 2004). The researcher to express this understanding 
in the most coherent fashion is Marc Sageman, who has abandoned the organizational orientation 
in his research of global jihad in favor of a network analysis approach (Sageman 2004a). 

However, the dissimilarity is not only in al-Oa‘ida’s structure, but also in its goals, which are 
less distinct than those of the groups that employed suicide attacks in the 1980s and the 
1990s. The claim that the exclusive aims of al-Qa'ida and its worldwide derivatives are simply 
territorial is forced (Byman 2003; Schweitzer and Shay 2002). It appears that each of the cells 
associated with al-Qa'ida has its own aims and that their main figures are not subject to the 
directives of a central leadership (Jones, Smith and Weeding 2003). 

However, it is not only the acts attributed to al-Oa‘ida that are difficult to analyze using 
earlier theoretical models. During the al-Aqsa uprising, the Palestinian groups that, until the 
turn of the century, neatly fit these theories and even served as case studies for the chief 
researchers of suicide attacks (Bloom 2004, 2005; Crenshaw 2000; Hafez forthcoming; 
Merari 1990; Moghadam 2003; Pape 2005, 2003; Pedahzur, Perliger and Weinberg 2003) 
divested themselves of their orderly hierarchical structure. Palestinian suicide networks today 
resemble al-Qa'ida more than they do Hezbollah, LTTE or PKK. This is especially so in that 
even though Palestinian groups do not operate internationally as al-Oa‘ida does, the suicide 
attacks in the Palestinian arena are also products of local, independent, horizontal networks 
rather than of a hierarchical organizational process. As noted by a veteran Israeli general, “The 
terrorist networks in the cities and villages operate almost independently; we are not facing 
Hamas or Fatah but a Balata terrorist network and a Dheisheh terrorist network.” (Drucker and 
Shelah, 2005) (Balata and Dheisheh are both refugee camps.) 

This development may be explained by Israel's reoccupation of Palestinian territory, which 
has caused most of the Palestinian groups’ operational sectors and lower backbone to split 
up into regional cells. Israel's policy of targeted assassinations has driven their leadership, 
especially that of Hamas, underground. Having to concentrate on their survival, the leaders of 
these groups have become less and less able to remain involved in the groups’ operational 
activities. Although the Palestinian groups are still strong actors in the political arena, they 
have become much less important than the local networks in perpetrating suicide attacks. 


Suicide Attacks, A Network Approach 


To understand the suicide attacks of recent years, and especially with regard to both al-Qa'ida 
and the Palestinian groups, we suggest supplementing the organizational approach with a 
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social network perspective. Namely, we suggest shifting the level of analysis from the 
established hierarchical group to “horizontal networks,” which operate within or independently 
of the group and feature horizontal rather than hierarchical ties. 

Although the results of their suicide attacks are similar, the hierarchical groups and horizontal 
networks differ in important ways. First, while hierarchical groups have clearly identified “leaders” 
who serve over a fair time span, horizontal networks have “hubs.” Hubs are the actors at the center 
of the network with a great many connections. However, hubs are invariably local operatives who 
are not well-known, may be readily replaced, and occasionally share their influence over the 
network with other operatives or hubs. The networks they operate have no organized hierarchical 
structure, and they themselves often don't know the network's true size or all its members. 

The two types of bodies may also differ somewhat in their goals. The goals of the leaders 
of hierarchical groups, it is generally believed, combine trying to force changes in policy (Pape 
2003) with achieving objectives in the internal political arena (Bloom 2005; Bloom 2004). In the 
horizontal networks, hubs sometimes serve as local representatives of the group, receiving 
instructions, usually indirectly, from the group's leadership. In other cases, however, even as 
they continue to operate in the general context of the community struggle for political gain, 
hubs use the network's ability to dispatch suicide bombers to gain territorial control or 
political dominance in a specific region for their network or family. 

Finally, the two differ in structure and functioning. Established hierarchical groups are generally 
characterized by a systematic hierarchical structure in emulation of army units. Horizontal 
networks have a variety of structures, as shown by Arquilla and Ronfeldt (2001),' all of which are 
rather elusive and lack clear distinctions between the leadership and operatives (Makarenko 
2001). A horizontal network can materialize in a single day, go through constant changes, and 
soon disappear (Krebs 2001). The two also recruit differently. Conventional hierarchical groups 
generally initiate recruitment programs, select potential candidates, persuade or force them to 
join, and train them for various operations, including suicide missions. Horizontal networks, in 
contrast, are often formed through friendships and family relations; joining is mostly by way of “a 
friend brings a friend.” (Sageman 2004a) Training, if there is any, is generally brief and done by the 
recruiter (Merari 2004). Whereas hierarchical groups are able to isolate and train recruits in self- 
contained paramilitary programs, horizontal networks are usually part of the community in which 
they operate. In some cases, network members are able to create social enclaves and may 
succeed in isolating themselves from the environment (Sageman 2004b); in other cases, 
however, they may be influenced by the attitudes of their community and their immediate 
environment towards the idea of self-sacrifice as a part of guerilla warfare or terror. 


Social Networks Analysis 


The analysis of Palestinian networks should help in examining the majority of the hypotheses 
- raised above.2 In some cases, additional methodological approaches have been used to fill 
out the analysis. First, we try to determine the factors responsible for the initiation and 
dispatching of young men and women on suicide missions by identifying the central actors 
—the hubs — in each network and examining their motives in managing the network. Then, we 
examine how these networks operate: how they were formed, how they recruited members, 
and their relations with the surrounding community. Finally, we examine the relationship 
between the network structure and its operational capabilities. In this way, we try to examine 
what types of networks have greater potential to dispatch suicide bombers and whether 
certain features of the network can predict its effectiveness. 

Network analysis may also help to cope with suicide attacks. We will thus identify the 
network's central actors and the figures essential to its continued existence, as well as the 
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grassroots factors responsible for the emergence of these networks. We have used two types 
of analysis that are fairly common in the examination of social networks. The first are analyses 
that provide information on the status of each actor in the network on three indices of centrality: 
degree centrality,? closeness* and betweenness.° The second are mainly K-core analyses,° 
designed to help map out the network social structure by locating the subgroups within it. 


Networks of Suicide Bombers 


To map the social networks of Palestinian suicide bombers, data were culled from a broad spectrum 
of sources, including the international press, the organizations’ Internet sites and published 
documents, and studies and articles on Palestinian suicide bombers.’ Information was collected 
about the actors in each network, the ties among them and the nature of the ties. Matrices were 
then constructed for each network and processed by means of UCINET 6 software.’ We generally 
distinguished three levels of intensity of personal ties: family ties, long-term friendships and earlier 
acquaintance (such as a person met at work, school or in an Israeli prison). 

As can be seen in Table 1, which lists the networks’ characteristics, most networks came 
into existence after 2000. They were located in several geographic regions (Nablus, Jenin, 
northern Samaria and Hebron) and have dispatched a total of 42 suicide bombers, who 
constituted 21.6 percent of the entire population of Palestinian suicide bombers between 
1993 (when Palestinians started to perpetrate suicide attacks) and 2005. Another 35 suicide 
bombers who acted outside these four networks were associated with other networks and 
many more were not affiliated with any organization. These findings highlight how local 
networks have replaced established hierarchical groups in being responsible for most suicide 
attacks. Table 1 also shows an average of 2.75 hubs in each network and indicates that the 
level of density (which expresses the number of ties and their intensity) in the Hebron network 
is exceptionally high compared to that of other networks, mainly due to its multiple family ties 
—a matter that will be discussed at length below. 

It is of note that most networks are not homogeneous in their members’ organizational 
affiliation. Some members identified with or operated in different groups or did not belong 
to any group prior to becoming part of the network. To reinforce our hypothesis that the 
horizontal network began to replace the hierarchical group after intensive Israeli operations 
against Palestinian organizations beginning in 2000, we included a control group in this study 
— the Nablus network, which has been active in 1996. Although the earlier existence of this 
network seems to undermine our argument, two sets of facts bolster It. First, the Nablus 
network was the only horizontal network of suicide bombers before the al-Aqsa Intifada. All 
the other 29 suicide attacks prior to this intifada were clearly outcomes of hierarchical 
organizational processes inside Palestinian groups. After the outbreak of the intifada, however, 
we found, in addition to the three above named horizontal networks, a large number of 
smaller networks. Second, as we will demonstrate in our later analysis, the Nablus network 
was more an intermediate body that shared features with both the horizontal networks of the 
al-Aqsa uprising era and the hierarchical groups that had previously perpetrated suicide 
attacks. To demonstrate only one point, unlike the newer networks, the organizational 
affiliation of the members of the Nablus network was totally homogeneous (see Table 1). 


Motivations of Hubs 


As anticipated, all the networks examined had a limited number of hubs.’ Table 2 shows the 
hubs in each of the four networks we analyzed and their characteristics. 
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The overwhelming majority of hubs displayed high levels of closeness and betweenness. 
Hence, their level of centrality was highly correlated with their levels of closeness and 
betweenness. This means not only that the hubs are the most connected actors in the 
network, but also that they are located at strategic nodes, from where they can reach most of 
the actors with relatively few steps and few mediators. Moreover, in contrast to “regular” 
actors, who know only a few other actors in the network (mainly those relevant to their own 
tasks), the hubs were the main elements connecting network subgroups. These findings 
Suggest that once the network is created, the hubs, whether or not they themselves created 
the network, are the glue that binds all of its components: logistics, intelligence, recruitment 
and dispatching. 

To understand the factors leading to the networks’ initiation of suicide attacks we must thus 
examine the hubs’ management of the networks and the motives driving their actions. This 
does not obviate the strategic considerations of organizational elites or the personal motives of 
suicide bombers. It means, however, that these factors are less significant in networks 
because the central leadership's control over local cells has proven to be extremely limited and 
the suicide bombers’ motives were found to be relevant mainly when the bombers were linked 
to a hub who supplied them with the resources, such as logistics, explosives, training, etc. 

Our study found that most hubs had two main distinctions. First, none had previously held 
a senior position in any of the organizations. Most indeed had operative qualifications (e.g., 
Muhi al-Din al-Sharif , an engineer) or political talents (e.g., Keis Adwan headed the al-Najah 
Student Union), but they had not made an impact beyond the local arena (Frish and Waked 
2003; Hamas Website 2005; Harel and Sobelamn 2001; Rubin 2002). The Nablus network, 
however, included several high-ranking members of Hamas, most notably Mohammed Def 
and Hassan Salameh, which suggests that it was less independent in its activities than the 
newer networks. Second, all of the hubs had a long history of confrontation with Israeli 
security forces. Some had served time in Israeli jails and most had seen loved ones harmed 
or killed by Israeli security forces. For example, Muhi al-Din al-Sharif , the Nablus network hub, 
was a Close disciple of Yehiya Ayash (known as “The Engineer”), a leader of the military arm 
of Hamas in the mid-1990s. When Ayash was eliminated by Israel, Sharif set up a network of 
suicide bombers. He recruited small cells of activists in the Nablus area from the Abu Varda 
family, one of the largest families in the region, for a retaliation campaign (Dudkevitch and 
O'Sullivan 1998; Luft 2003). Hassan Salameh, a central activist in the network, stated in an 
interview that the motivating factor behind the network was not a strategic interest but the 
desire for revenge (Shiuouhki 1996). Keis Adwan, the hub of the northern Samaria network, 
had also lost a number of close associates in Israeli security forces operations (Rubin 2002). 
The death of his friend and flat-mate Zachariah Kilani early in the intifada was probably the final 
push leading him to establish a network of suicide bombers (Rubin 2002). 

Despite of the fact that hubs have been motivated by personal factors, we believe that their 
main motive was the struggle among the local networks that developed in the Palestinian 
Authority territories during the al-Aqsa Intifada. This assertion reinforces Bloom's arguments, 
but on a different analytical level. The suicide campaign stemmed from competition for political 
power, but this competition was on the /oca/, rather than the national, group level. 

An example is the Hebron network, operated by heads of the Kawasmeh clan. The activities 
of this network began as a result of the continuing struggle with the local rival network led by 
Mohammed Sidr, head of the Palestinian Islamic Jihad in Hebron, over control of the city and 
the support of the local population (Regular 2003). Accordingly, the successful attack 
committed by Sidr’s network in November 2002, in which Hebron Israeili Defense Forces 
Commander Dror Weinberg was killed — an attack that earned the network considerable 
prestige — was one of the prominent events that motivated the Kawasmeh family to begin 
recruiting activists from the local soccer club in order to establish the network (Issacharof and 
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Harel 2004:292). This network then went on to execute eight suicide bombings in the following 
six months. 

There was also fierce competition waged between the Northern Samaria and Jenin networks. 
While they were at odds over an area of overlying operational regions, the intense competition 
between the two found expression mainly in the dispatching of a series of suicide bombers in 
response to the successes of the other network's suicide bombings (Hammer 2001). 

In an interview with an aide to Jamal Abu al-Hija, the hub of the northern Samaria network, 
he likened the competition in Jenin to a type of sporting contest between local soccer clubs 
struggling for status and their spectators’ admiration (BBC Correspondent 2002). Other 
operatives repeated the fact that competition between local networks was an important 
factor leading to the launching of suicide attacks (Harel 2002). Competition was so intense 
that sometimes the networks hastened to take responsibility for operations even before it 
became evident whether these had been carried out by their own members or not — just to 
deprive the rival network of the dividends of public support. 

Thus, for example, after the suicide attack at the Sbarro restaurant in Jerusalem, the Jenin 
network was quick to take credit even though the attack was perpetrated by the northern Samaria 
network (Williams 2001). An examination of the timing of the suicide attacks by the two networks 
also illustrates the intense competition that developed between them. After the perpetration of 
four suicide attacks from December 2000 to April 2001 by the northern Samaria network, the 
Jenin network responded with five suicide attacks between May and August of the same year. 
From then on, the two networks continued to initiate suicide attacks alternatingly, each of them 
rushing to plaster the streets of Jenin with pictures of the “suicide bomber of the day” after a 
“successful” attack (Hammer 2001). The competition continued until the commencement of 
Operation Defensive Shield on March 29, 2002. In the wake of this Israeli military assault of 
unprecedented force, the networks united and operated together (Ward 2002). 

We would like to emphasize two points before concluding this section. First, our effort to 
identify factors that led to the formation of the networks and their suicide attacks should not be 
taken to imply that strategic considerations have had no impact. They have. All the networks 
have operated in the context of the Palestinian-Israeli conflict and have viewed their acts as part 
of the Palestinian struggle for sovereignty. Moreover, some of the networks’ suicide attacks did 
have strategic aims. One such aim was to show the PNA (Palestinian National Authority) that 
it could not act without taking the networks’ positions into consideration (Laub 2003)."° They 
used suicide attacks to sabotage Israeli-Palestinian negotiations, to which they objected. Thus, 
the Jenin network launched a suicide attack in May 2001 after publication of the Mitchell 
Report." The Hebron network launched a suicide attack on May 17, 2003, a day after Prime 
Minister Sharon and Palestinian Prime Minister Abu Mazen met. The other strategic aim was to 
deter Israel from assassinating Palestinian leaders. Hence, they initiated attacks in response 
to Israel's targeted killings. For example, the attack by the northern Samaria network in 
December 2001 was launched in retaliation for Israel's assassination some weeks earlier of the 
Hamas leader, Mahmoud Abu Honod (Goldenberg 2001). 

Our second point is that the hubs of the three newer networks have managed their 
networks independently, even though they maintained ties with the leadership of the groups 
in whose name they were ostensibly operating. The groups’ top leadership was located 
outside Palestinian Authority territories and Israel restricted leadership ties with the hubs to 
the provision of financial assistance and general guidance (Center for Strategic Studies 2005). 
This left operational assignments, choosing the timing for each campaign, and running the 
network in the hands of the hubs. Tabat Mardawi, the hub of the Jenin network, confirmed 
all of these details in an interview and added that the leaderships of the groups received 
reports only after attacks (Harel and Barak 2002). The main purpose of the reports was to 
enable the group with which the network wanted to be affiliated to accept responsibility for 
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the attack (Levitt 2005). The hubs’ organizational affinities were purely instrumental. Hubs 
would not hesitate to switch allegiance from one group to another to make the most of their 
changing interests.12 As already noted, the Nablus network was strongly connected to the 
Hamas leadership. According to a member of this network, Hassan Salameh, the network's 
hub, Muhi al-Din al-Sharif , was supported and sometimes directed by the leaders of ithe 
Hamas military wing (Rabin 1996). 

In short, our findings refute the prevailing view that the suicide attacks of recent years 
have been the product of strategic decisions made within organizational frameworks. They 
show that after 2000, decisions were made by local activists, and that the best predictor of 
their decisions was rivalry between local or family groups. 


Network Characteristics 


Several features were common to most of the networks we examined. First, was their size. 
All but the Nablus network had between 36 and 49 activists.'* This was large enough to 
establish an operational infrastructure that included intelligence, logistics, transport, etc. and 
could carry out suicide attacks, but small enough to maintain a certain level of secrecy. The 
Hamas support enjoyed by the Nablus network may be what enabled it to operate with fewer 
members. 

Second, all the networks had hubs who were connected to most of the actors. In other 
words, they were “scale-free.”"* This suggests that they did not develop without a guiding 
hand, but that their hubs were the main vehicle in getting actors to join the network,'® whether 
by actively recruiting them or by serving as figures whom would-be members could 
approach. On the basis of Arquilla and Ronfeldt’s typology (Arquilla and Ronfeldt 2001), we 
submit that the structure of these networks lies between the “star” form that characterized al- 
Qa’ida prior to the invasion of Afghanistan, in which one main hub (central command) has 
directed all the nodes, and the “all channel mode” network form of the current global jihad, 
where there are no hubs and all nodes are connected to all others. The Nablus network, 
which has only one hub, is significantly closer to the star network than are the three newer 
networks. This is further indication that processes of change are at work, with the newer 
networks moving towards the elusive all channel structure. 

Our claim about the hubs’ pivotal role in recruitment is reinforced by our examination of the 
networks’ recruiting methods and the prior ties between recruiter and recruit. For example, in 
the Hebron network, we identified two sources of recruitment. Some of the suicide bombers 
were recruited by heads of the Kawasmeh clan from the local mosque’s soccer team, which 
included many clan members. By visiting the soccer team members on a regular basis, one 
of the hubs, Abdullah Kawasmeh, succeeded in recruiting youngsters from outside the clan 
as well (Regular and Harel 2003a; Regular and Hare! 2003b; Tierney 2003). The second source 
of recruitment was Hebron’s Polytechnic Institute, whose offices are located in the Abu Ktila 
neighborhood, where the Kawasmeh clan lives. Their proximity to the clan made it easier for 
interested students to become network members and also enabled the hubs to recruit 
potential members through their many acquaintances in the area (Regular and Harel 2003b). 

In the northern Samaria network, Keis Adwan exploited his position as former head of the 
al-Najah Student Union to recruit students to his network. Nursi Tawalbeh, hub of the Jenin 
network, worked with other hubs to select candidates from within their extended families and 
circle of friends. Nursi Tawalbeh sent his 18-year-old brother Morad to carry out a suicide 
attack in Afula. In the end, Morad changed his mind and did not blow himself up (Hendawi 
2002). These examples suggest that the hubs used their social ties to conscript most of the 
networks’ activists from the surrounding community. 
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They also suggest the importance of collective support for the network. These 
characteristics are not unique to terrorist networks, but also can be observed in other types of 
social networks such as urban street gangs (Klandermans and Oegema 1987; McAdam and 
Paoulsen 1993). In both cases, membership in the network is supported by the community and 
serves as a source of status, identity, cohesion, self-esteem and a sense of belonging 
(Browning, Feinberg and Dietz 2005; Ruble and Turner 2000). Moreover, in both cases (Ruble 
and Turner 2000; Venkatesh 1996), the support of the community helps motivate the network to 
continue its operations and facilitates recruitment of new members by increasing what 
Klandermans and Oegema term their “mobilization potential” (Klandermans and Oegema 1987). 
Taking into account the obvious differences, it seems that the same alienation of community 
residents from the government and its officials that led to the emergence of violent street gangs 
(Venkatesh 1996) also facilitated the emergence of local Palestinian networks. In the latter case, 
the local Palestinian community became alienated from Israel and the PNA. 

The main conclusion is that, while terrorist networks differ from other less radical social 
networks or social movements in many respects, there is much resemblance in their inter- 
group dynamics and their relations with the surrounding community. 

The third, and perhaps most significant, feature that most Palestinian networks share is 
the peripheral position of the suicide bombers in the networks and their relatively low number 
of ties. Although one might expect isolated actors to be rare in networks so heavily based on 
social ties, the findings presented in Table 3 support the view, held by most researchers in the 
field, that suicide bombers are weapons in the service of the elite, which prefers to dispatch 
individuals who are of minor importance and who contribute little to the network's continued 
operation (Rothenberg 2001). 

As shown in Table 3, the average levels of the suicide bombers’ centrality, closeness and 
betweenness were all considerably lower than the network averages. The F-test for verifying the 
disparities between suicide and non-suicide bomber populations showed significant disparities 
in their levels of closeness and betweenness. This indicates that suicide bombers had no direct 
or indirect acquaintance with most network members. It also emphasizes the fact that these 
networks did not have the “all channel” structure usually found in extremist cells. 

Of course, there is an operational logic in using as suicide bombers persons with few ties 
to other network members. Since the suicide bomber is the actor most likely to be caught, it is 
imperative that he or she possess as little information as possible about the network. Moreover, 
suicide bombers cannot be nodes linking subgroups in the network; if they were, their suicide 
would lead to the network's disintegration. Figures 1, 2 and 4 in the Appendix, which represent 
suicide bomber networks in Nablus, northern Samaria and Jenin, illustrate the peripheral status 
of suicide bombers and their position on the outermost circle of the network. 

Figure 3 in the Appendix, showing the Hebron network, looks somewhat different. The 
suicide bombers in this network are not as peripheral as those in the other three networks. 
The reason is that this network is based mainly on one extended family. This difference leads 
us to conclude that the more the network is built on strong or-primordial ties, the less 
peripheral its suicide bombers. 

The fact that the status and roles of hubs and suicide bombers are identical in the different 
networks prompts us to assume that actors with similar locations in the various networks will 
adopt roles with similar features. Hence, in the next stage, we will try to identify subgroups 
with distinct roles that are present in all the networks. To this end, we conducted kK-core 
analyses, which provided an efficient method for identifying the subgroups in each network. 

These analyses showed a connection between the roles of the actors in a network and 
their internal division into subgroups. Although the results do not totally correlate with the 
actors’ roles in the network, and sometimes there is overlap among subgroups, it is still 
possible to see that the networks, apart from the one in Hebron, consist of four types of 


The Changing Nature of Suicide Attacks * 1997 





‘soydures ]TV , 
Wv=4 ech L80L 16 G82 096 9¢° 10 Gee 00° cv 91'6V (uea\) 
SsauUaaMjag 
#W9=4 OLY 0c0S 899 99°GS 26 1S OV LZ 86 0S L6eS = cb 6L 0c Or LE0G= ©0518 (ueey\) 
Ssauesoy) 
w=4 GLY 08's KANG 98°02 9Lel 61 9r 9'€ LLG vl9¢ phd Sch = GLE (ueay\) 
Ayequed 

slequiog + siequa|y Sqn siequiog siequajyy sqny sieqwiog siequiajyy sqny sieqwog siequiayy sqny 

apioing apioing apioing apiaing 

,PAOUY uluar uolqaH PLIELUES WAUON sniqen SOMON 





YIOMJON aU} Ul SIaqUIOg apLIng Jo 2[OY *¢ 2TQUL. 


1998 © Social Forces Volume 84, Number 4 * June 2006 


subgroups: (1) the hubs and main activists; (2) subgroups of collaborators on different levels 
that sometimes also include suicide bombers; (3) subgroups of suicide bombers that 
sometimes also include a number of peripheral activists; (4) suicide bombers who are so 
peripheral that they do not belong to subgroups in the network, or who, along with another 
bomber, constitute a tiny subgroup. 

The best correspondence between subgroups and the division of roles in the network was 
in the Nablus network. This indicates, once again, that this network, especially in comparison 
with the newer networks, is based less on horizontal ties and has a more hierarchical structure. 
In this network, hubs and main activists form one subgroup. Alongside, a group of 
collaborators also constitutes a subgroup. As for the suicide bombers, these belong to two 
small subgroups comprised only of bombers. In fact, every two suicide bombers formed a 
distinct subgroup. The reason for the division between the two subgroups of suicide bombers 
was related to the timing of their recruitment and where they lived. While one subgroup of 
suicide bombers was from the Nablus region, suicide bombers in the other subgroup came 
from the al-Fawar refugee camp. These circumstances, combined with the fact that there was 
no such distinction between the various subgroups in the Hebron network (all members had 
multiple family ties and lived in the same place), shows that the composition of subgroups is 
also affected by socio-demographic variables with respect to suicide bombers. This further 
reinforces the assumption that new members join the network through acquaintance, 
friendship or family ties that have developed in distinct geographic areas and not as a result 
of a recruitment process, the way it is done in established hierarchical groups. 

We conclude this section with an examination of the relationship between network 
structure and network effectiveness, defined by the network’s ability to carry out suicide 
attacks. To this end, we compared the Nablus network, which carried out only four attacks, 
with the Jenin, northern Samaria and Hebron networks, each of which perpetrated more than 
double that number. 

We found three main differences between the Nablus network and the three newer 
networks. First, as noted above, the Nablus network has fewer activists and is less based on 
horizontal ties. These features seem to have constricted its ability to exploit an important 
advantage of the horizontal network structure — the ability to absorb the loss of nodes (actors) 
without disintegrating. Second, all the more successful networks had substantially more 
cliques’® than the Nablus network. While we found only nine cliques of at least three 
members each in the Nablus network, we found 24 in the northern Samaria network, 75 in the 
Hebron network, and 118 in the Jenin network. Even though the number of cliques may be 
related to the number of members in the network, our findings showed a greater relative 
disparity in the number of cliques in the Nablus and other networks than in the numbers of 
their members. This means that the existence of cohesive subgroups within a network may 
be a predictor of its effectiveness. 

The third relevant difference, and the most important one in terms of network success, 
was the number of hubs. The Nablus network had just one hub; the other three had between 
two and five. As can be seen, the more hubs the network had, the more suicide attacks it 
perpetrated. The explanation is that the hubs assist in the preparation of the infrastructure 
before each operation. More hubs means that more attacks, including parallel attacks, can be 
carried out. 

To confirm this assumption, we identified the number of actors in each network who were 
highly connected to other members, even though they did not meet the criteria for being 
defined as hubs.’” We found that, while the Nablus network had three such actors, the 
Hebron network had 10; the Jenin network had eight, and the northern Samaria network, four. 
It is important to note that the size of the network was not an advantage if there were only a 
few hubs. A network with a large number of members but a single hub will have difficulty in 
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being effective because the regular actors, who do not have many ties in the network, will 
have limited ability in recruiting enough actors to carry out a suicide attack. 

In addition, the connection between the number of hubs and the effectiveness of a 
network also has to do with the fact that having more hubs increases network resilience. This 
point will be discussed further in the next section. 


Coping with Suicide Bomber Networks 


Scale-free networks have two basic qualities that affect their resilience. First, they can 
withstand random failures. Since there are far more regular actors than hubs, the chances of 
a random injury or elimination of a hub are low. By the same token, the loss of one of the 
regular actors will not cause the disintegration of the network because hubs are highly 
connected (Laszlo-Barabasi and Bonabeau 2003). That is why regular army units can sustain 
30 percent casualties before they become ineffective while networks have been shown to 
endure 70 percent attrition rates and remain operational (McAllister 2004). Even so, the 
second attribute of scale-free networks assumes that damage to one of the hubs will indeed 
lead to the speedy disintegration of the network so that the more hubs there are, the greater 
the resilience of the network against direct attacks. 

An illustration of this type of resilience follows: On June 23, 2003, Israel assassinated 
Abdullah Kawasmeh, a hub in the Hebron network. Basel, another hub, continued to operate 
the network. Three months later, Basel was also killed, and Imad Kawasmeh, who had not 
previously been a hub, but who did have 70 percent above the average centrality of network 
members, took over. In the case of the northern Samaria network, after the assassination of 
Keis Adwan in August 2002, he was replaced by Jamal Abu al-Hija, another hub in that 
network. In Jenin, the death of Nursi Tawalbeh during Operation Defensive Shield led to his 
replacement by lyad Sawalha, who ranked sixth in the degree of centrality in the network. 

This leads to a number of conclusions. First, it is reasonable to state that after a hub 
suffers injury or is eliminated, he will be replaced by an actor who has the highest level of 
centrality. Second, in all cases, the replacement hub will be part of the subgroup that included 
all of the network's hubs. Third, selective attacks against hubs have minor influence on the 
network, and, in most cases, do not affect its continued existence. 


Conclusions 


In contrast to most important works in this field, which have assumed that the use of suicide 
bombers is a product of hierarchical organizational systems, and, furthermore, in the wake 
of changes we observed in the features of this phenomenon in recent years, we decided to 
adopt a horizontal, rather than a hierarchical-formal, approach. The methodical analysis of 
social networks enabled us to closely examine this phenomenon from a different angle. 

The analysis of the Palestinian suicide bomber phenomenon through a network prism has led 
us to a number of main conclusions. First, networks are local and cross-organizational. They are 
based mainly on primordial ties of family and friends and many of those who joined the networks 
never belonged to any organization. In other cases, organizational affiliation was secondary to 
network membership, which, to a great extent, is a tribal quality. This confirms our contention in 
regard to the decline in the importance of hierarchical groups in producing suicide attacks. 

Second, networks do not have leaders, but rather hubs. These do not enjoy a formal 
status, but they do navigate the network's operations. In most cases, we did not manage to 
find proof demonstrating that hubs received direct orders from groups, or that their own 
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operations were based on primarily strategic motivation. Still, we did find strong evidence of 
the use of suicide bombers as a component in the struggle for political power among families 
or groups on a local level. 

Third, suicide bombers are not recruited, nor do they undergo a training process. They are 
peripheral figures in the network who join the ranks ad hoc, from the environment close to the 
network, for the purpose of carrying out a suicide attack. Fourth, we found that the structure 
of all the networks and the division of roles were almost identical. Similarly, we found a 
consistent correlation between the network's characteristics and the nature of its operations. 
Thus, for example, a large number of hubs and subgroups in a network were associated with 
a higher level of effectiveness. 

This means that, despite the amorphous image of the networks, there is a considerable 
systematic element in the structure and nature of their operations. Fifth, we can gather from 
the above on how to cope with this phenomenon, or, as shown in this paper, what 
countermeasures might seem to be ineffective. For example, we learn that targeting specific 
hubs will not — in most cases — put an end to the existence and operation of the network. 

Despite the limitations of the current research, which was based on the investigation of 
one arena and on information that was collected from overt sources only, we believe that a 
close look at the arenas of other suicide bombers through a network prism will enable theory 
to keep pace with the changing empirical reality. 


Notes 


1. They present three types of networks: (1) “chain” networks, which link two ends of the 
organization via a series of intermediaries; (2) “star” networks, in which various nodes are 
independently linked to a central hub; and (3) “all channel” networks, in which all the 
nodes are connected individually to all the other nodes. 


2. From here on, the word “networks” refers to horizontal networks as previously defined. 


3. Degree centrality is the ratio of each actor's actual ties to his potential ties (Freeman, 
Roeder and Mulholland 1980). 


4. The closeness measure helps us assess the importance of each actor's place in the 
network by measuring the distance between him and all the other actors. In contrast to 
degree centrality, which involves only direct ties, it also allows us to examine the indirect 
ties between actors (Borgatti 2005). 


5. Betweenness assists in identifying those actors who connect between subgroups or lone 
actors within the network (Borgatti 2005:59-60; Freeman 1977). 


6. A“K-core” is amaximal group of actors, all of whom are connected to some number (k) of 
other members of the group. 


7. \sraeli Media: Jerusalem Post (www.jpost.co.il); International Media: Washington Post 
(www.washingtonpost.com), New York Times (www.nytimes.com), Guardian 
(www.guardian.co.uk), BBC (www.bbc.co.uk), and the Information Center for 
Intelligence and Terrorism (http://www.intelligence.org.il/sp/default.htm). 


8. For information about the software, go to: http:/Awww.analytictech.com. 


The Changing Nature of Suicide Attacks + 2001 


9. Actors whose level of centrality is more than twice the average of all the network actors 
(Wasserman and Faust 1994:178-180). 


10. Also see interview with Hamas spokesmen Abd-al-Aziz al-Rantisi in “Hamas Official Vows 
to Continue to ‘Incite’ Against Israeli ‘Killers.’ " A/-Jazeera TV, 13:14 GMT, April 29, 2003. 


11. The Mitchell Report, published in April 2001, investigated the factors that led to the al- 
Aqsa Intifada and called on both sides to stop their violent acts. 


12. For example, Tabat Mardawi (Hendawi 2002) and Nursi Tawalbeh (Harel and Barak 2002) 
began their terrorist involvement in Fatah and then switched allegiance to the Palestinian 
Islamic Jihad. In Table 1, we indicate their affiliation at the peak of their network's terrorist 
activity. 


13. Only actors who had some role in a network suicide operation were included in the 
network. We used this criterion to avoid including friendship and family ties, which could 
involve hundreds of people who took no part in the terrorist activities. 


14. Scale-free networks include “regular” actors with an average number of ties as well as 
several actors who have well above the average. In this type of network, we can assume 
that the structure is growing due to the hub’s activity and his or her abilities to create new 
ties and expand the network (Laszlo-Barabasi and Bonabeau 2002). 


15. The fact is that in most social movements — not just violent ones — prior contact between 
the recruiting agent and the recruit is necessary. Hence, social networks are important 
factors in the growth of such groups (Snow, Zurcher and Ekland-Olson 1980:789). 


16. In a subgroup that is a clique, every actor is connected to all the other actors. 


17. Their degree of centrality was not twice the average of all the network actors (as with the 
hubs), but only 50 percent higher. 
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Abstract 

We evaluate the argument that Islamist terrorist attacks represent a distinctive “4" wave” 
of transnational terrorism that has supplanted Leftist terrorism. Drawing on ITERATE 
data for 1968-2003, the annual count of Leftist attacks has declined since the end of the 
Cold War while Islamist attacks have persisted and spiked upward in 2002-03. Pooled 
cross-sectional time-series regression demonstrates that the generation of Islamist terrorism 
is more deeply rooted in the social strains created by modernization, the competition 
between Islam and other religions, and the growth of secular government. Leftist terrorism 
was uniquely stimulated by Cold War rivalry (and subsequently declined thereafter). Both 
forms of transnational terrorism display a kindred nature, however, in that both are 
encouraged by the social strains of transitional development and the political opportunities 
created by increasing political rights. Moreover, foreign direct investment is associated with 
reduced transnational terrorism over time, calling into question theories that stress global 
order and anti-systemic violence against international capitalism. 


Between 1968 and 2003 there were more than 6,100 transnational terrorist attacks, causing 
more than 36,000 deaths and injuries (Mickolus, Sandler, Murdock and Flemin 2008). For the 
United States, the watershed event was the 2001 destruction of the World Trade Center and 
the attack on the Pentagon, which set off a new round of speculation about the genesis of 
transnational terrorism. Alongside standard explanations emphasizing inequality and the 
social strains of transitional development, many now suggest globalization (Lake 2002), 
political repression (Hafez 2003), the cultural clash between Western and Islamist values 
(Huntington 1996), and the pro-Israeli stance of the United States ak 2005) as the major 
causes of contemporary transnational terrorism. 

One salient argument conceptualizes transnational terrorism in “waves.” Arguing that the 
collapse of the Soviet Union undermined the legitimacy of revolutionary Marxist ideology 
(Chirot 1991; Snow 1996), some contend that the Iranian revolution of 1979 marked the 
beginning of a “4th wave" of international terrorism rooted in Islamist ideologies (Rapoport 
2004; Laqueur 2001; Snow 1996). Where “3rd wave” Leftist terrorism traditionally used a 
national liberation framework to make appeals to the middle and lower classes of less 
developed countries, this new Islamist terrorism is seen as making broader multi-class 
appeals, using more lethal tactics justified in religious terms, and is more organizationally 
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consolidated (Rapoport 2004; see also Hoffman 1998; Enders and Sandler 2000; Laqueur 
2001; Stern 2002; Juergensmeyer 2003). 

Another way of looking at this change in transnational terrorism is as a clash of civilizations 
(Huntington 1998). In this view, the end of the Cold War ushered in a new geopolitical era in 
which the traditional superpower clash and related debates between Marxist and capitalist 
ideologies have been supplanted by cultural clashes defined in religious and civilizational 
terms. This has generated changes in the way nationalism is used to justify terrorism. In place 
of secular theories of nationalism, religion has become a new basis for nationalist ideology 
(Juergensmeyer 1995) and is therefore used to justify transnational terrorism. 

This perspective suggests continuities and significant differences in the “root” causes that 
drive the two kinds of transnational terrorism. Both may be essentially anti-colonial struggles, 
at least in the minds of the terrorists themselves. Moreover, as cultural traditionalism erodes 
under the rapid onslaught of social change, social disorganization likely instigates both forms 
of terrorism. On the other hand, the orientation of the followers of these two ideologies 
toward the state should differ. Leftist terrorists are not hostile to the idea of the secular 
national state, whereas Islamist terrorists may well be. For example, the power or 
intrusiveness of the state may not stimulate Leftist terrorism, who might well cheer such 
“social progress,” whereas a strong, intrusive state might spark Islamist resistance. We would 
also expect differences in the social frames and the interpretation of symbolic global events. 
Specifically, the Iranian revolution might have inspired Islamist terrorism and led to 
proselytizing attempts to export this revolutionary model to other Muslim societies, while the 
collapse of the Soviet Union might have reduced Leftist terrorism. 

This study examines whether there have been distinct third and fourth waves of 
international terrorism and, second, their sociological origins. We use the ideology of the 
groups that committed transnational terrorism to identify Leftist and Islamist transnational 
terrorist attacks between 1973 and 2002. While several studies of transnational terrorism have 
focused on the country contexts in which attacks occur, our analysis focuses on the country 
contexts that produce transnational terrorism (i.e., those countries that produced the 
terrorists themselves). By transnational terrorism, we mean the “premeditated use of extra- 
normal violence or brutality by subnational groups to obtain a political, religious or ideological 
objective through the intimidation of a huge audience, usually not involved with the policy 
making that the terrorists seek to influence.” (Enders and Sandler 2002: 145-46; also Mickolus 
et al. 2003; Li and Schaub 2004; Li 2005) We apply a set of common variables to the two 
samples to detect the social factors that drive these seemingly different forms of 
transnational terrorism. 


Explaining Transnational Terrorism 


The existing literature on terrorism largely focuses on the ideology and psychology of 
terrorists (e.g., Juergensmeyer 2008, Stern 2008), terrorist recruitment (e.g., Sageman 2004) 
and types of attacks (e.g., Pape 2005, Bloom 2005), or grand debates over the impoverished 
vs. privileged backgrounds of individual terrorists (e.g., Krueger and Maleckova 2003). While 
these are useful, they do not tell us all that we might wish to know about the social, political 
and economic contexts that encourage transnational terrorist violence. Many of the current 
arguments about neo-colonial dependency, U.S. military intrusion and civilizational clash can 
best be tested with a macrosociological approach. We begin by applying some standard 
theories of political violence to the generation of transnational terrorism, and then discuss 
newer perspectives involving globalization and civilizational clash. We organize these theories 
into broad thematic “families” below. 
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Theories of Social Disorganization/Strain 


Some classic arguments posit that rapid development creates social disorganization and 
strains, thereby encouraging political violence. Terrorism (like other forms of political violence) 
is more likely at intermediate levels of economic development and in traditional societies 
experiencing rapid economic change (Huntington 1968, 1996). Traditional societies often 
exhibit a uniformity (or social segmentation) and internal cohesion that make conflict less 
likely (Simmel 1964; Coser 1964). Modern societies also exhibit equilibrium; they have greater 
functional differentiation and social interdependence, affluence and less inequality. During 
transitional development, however, rapid urbanization, growing inequalities, the presence of 
foreigners, the erosion of traditional social norms and the rapid growth of new institutions 
(such schools and urban employment) create both grievances and ideal recruiting grounds for 
terrorist organizations. 

A related issue is demographic change. While demographics alone cannot explain 
international terrorism, many say that population structure plays a role in creating grievances. 
Rapid population growth redistributes resources away from labor, which may increase 
economic grievances (Goldstone 1997). Several researchers have argued that a ‘youth bulge’ 
of unattached, unemployed men encourages generalized violence and terrorism because 
they are easily recruited by radical causes (Crenshaw 1981; Huntington 1998; Hudson and 
DeBoer 2002). Although several studies have found some effect of GDP per capita on the 
targets of international terrorism (i.e., more developed countries are more likely targets), 
neither development nor economic growth seem to affect the creation of attacks or their 
country location (Krueger and Laitin 2003; Lai 2004; Li and Schaub 2004). These findings, 
however, are based on limited time periods and samples that need further evaluation. 

Researchers have also noted that terrorists often have some university education (Schmid 
et al. 1988). In developing societies, students in formal education may be culturally caught 
between two worlds, one traditional and the other modern. Given that schools and 
universities are often incubators for idealized views of the world, they expose the young to 
ideologies that promote radical change and social activism. Thus formal educational systems, 
particularly those that are not balanced by a dynamic economy and strong civil society, may 
provide recruiting grounds for radical causes. While Kurrlid-Klitgaard et al. (2005) find no 
effects of secondary schooling on international terrorism in the late 1990s, Testas (2004) finds 
positive effects in a subsample of Muslim majority countries. 


Theories of Political Order 


Many argue that quasi-authoritarian and partial democracies are more prone to internal armed 
conflicts and civil war (Schock 1996; Marshall and Gurr 2003; Hegre et al. 2001). These states 
are semi-open, creating opportunities for dissident mobilization but are sufficiently repressive 
to stoke moral outrage at restrictions on political activities. Studies to date have found mixed 
results. Lai (2004) found that strong autocracies produce less international terrorism, while 
Krueger and Laitin (2003) found that civil liberties are negatively related to the generation of 
terrorism as well as to the likelihood of becoming its target. On the other hand, Eubank and 
Weinberg (2001) and Li and Schaub (2004) found that stable democracies are the targets of 
more attacks. As a refinement, Li (2005) suggests that democratic participation reduces 
terrorism but that political rights (in the form of constraints on executive power) encourage 
such attacks. In addition, regime transitions, especially shifts to autocratic rule, generate more 
attacks (Lai 2004; Eubank and Weinberg 1994, 1998). This latter idea suggests that repressive 
states, especially those that combine “hard” repression (coercive violence) with claims to 
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popular responsiveness, are likely to incite dissident terrorism. Several studies show that the 
effects of hard repression on nonviolent protest are curvilinear with intermediate levels 
stimulating the most protest (Rasler 1996; Francisco 1996, 2004). However, Testas (2004) 
found the opposite effect on the production of international terrorist attacks in asample of 37 
Muslim majority countries, which again suggests the need to differentiate ideological 
traditions. It is also possible that both repression and opportunities generate attacks in a 
larger sample of countries. 

Strong states (those with the fiscal and administrative capacities to contro! their 
populations) typically generate sufficient economic growth to both foster legitimacy and 
finance the armed force and intelligence needed to thwart potential rebellion (Goodwin 2001; 
Jenkins and Schock 2004). By extension, weak states that lack fiscal and administrative 
capacities should be more likely to generate transnational terrorism, but this proposition has 
received mixed results. Lai (2004) found that government expenditures as a percentage of 
GDP reduces the production of international terrorism, but Li and Schaub (2004) found that 
government capability was positively related to the production of attacks. These arguments 
need further assessment. 


Theories of Global Order/Anti-Systemic Violence 


Some theories implicate neo-colonial control, international dependency, and the political- 
economic hegemony of core nations in international terrorism. Just as earthquakes are 
telltales of continental drift, some see international terrorism as anti-systemic violence that 
indicates shifts in global hegemony and economic order. Terrorism is a political tool of the 
weak, and as global hegemony shifts or falters, the production of transnational terrorism 
increases in the poorer zones of the world-system with core countries as the major targets 
(Bergesen and Lizardo 2004). 

These arguments have received mixed empirical support. In a cross-country time-series 
analysis, Li and Schaub (2004) find no relationship between foreign direct investment and 
portfolio investment based on the country location of international terrorist attacks. Yet 
Lizardo's (2004) global aggregate time series finds that world trade and the number of trade 
agreements is negatively related to anti-U.S. attacks, suggesting that, as U.S. global 
dominance falters and trade crises occur, attacks against the core increase. In contrast, Sobek 
and Braithwaite (2005) find that greater U.S. global dominance produces more anti-U.S. 
attacks. Midlarsky et al. (1980) frame this as a simple question of global exposure: the 
increasing spread of Western targets (foreign manufacturing plants, embassies, military 
bases, tourists) creates more targets for attack. Pape (2005) contends that U.S. military basing 
policy in the Middle East is a key precipitant of anti-U.S. suicide terrorist attacks, but his 
research is narrowly focused on some 300 acts of suicide bombing. Clearly, tests of theories 
of global order may require a more finely-grained analysis of types of transnational terrorism. 


Identity/Civilizational Theories 


Although Huntington's (1996) “clash of civilizations” thesis is primarily about new fault lines in 
post-Cold War international relations, he also argues that this clash, especially that between 
“Islam and the West,” has mobilized violent attacks. Civilizations, as dense bodies of cultural 
understandings and social organization, provide a significant component of human identity as 
the largest meaningful in-groups in the world. Civilizational constellations outside one’s own 
are viewed as the “out-group,” setting the stage for confrontation. According to Huntington, 
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the collapse of the globe-striding modernity project has left in its wake a reassertion of an 
older cultural basis of global organization, which he terms “civilizational zones.” Transnational 
terrorism is one form of this struggle between civilizations, and its increase is due to declining 
Western hegemony and the resurgence of the world’s other cultures. Transnational terrorism 
should be concentrated in fault zones between the major world civilizations with targets 
defined by civilizational division. In arguing that declining Western hegemony encourages 
these attacks, this is similar to world-systems theory, but the “seismic shifts” are cultural 
rather than economic. 

This argument has special relevance to Islamist terrorism. According to the logic of 
“civilizational clash,” we would expect higher production of Islamist attacks originating in 
countries characterized by greater competition between Islam and other religions. Moreover, 
radical Islamist ideologies lead us to expect that particular social changes provide the impetus 
for terrorism. Unlike Leftists, who see the state as a tool of ultimate social reform, many 
Islamist radicals reject the secular state and its social innovations. Among the most salient 
“statist” innovations is the freedom/liberation of women. As Juergensmeyer (2003: 198-202) 
notes, because some Islamist interpretations see gender relations as divinely ordained, the 
state’s tendency to elevate the status of women is a major grievance. In addition, state 
intrusion into civil affairs, especially the regulation of religious activities, may be seen as a 
particular threat. 

Unique historical events are also likely to exert strong path dependencies on the 
trajectories of terrorism. Some scholars point to the successful Iranian Revolution of the late 
1970s as a watershed event, demonstrating to a global audience that a resurgent Islam could 
overcome secular states and their superpower sponsors (Shughart 2004; Laqueur 2001; 
Hoffman 1998), a lesson reinforced by the Soviet debacle in Afghanistan. For Leftist terrorism, 
the Soviet collapse of the early 1990s also changed the geopolitics of international insurgency. 
The perceived failure of the global socialist program deprived many “third wave” Marxist 
groups of legitimacy and the ability to play on superpower rivalry to mobilize military and 
political support (Lewis 2003: 61; Snow 1996). Dissidents had to develop a new political 
agenda to legitimate their claims, which may have facilitated the rise of some Islamist as well 
as ethno-separatist movements (Snow 1996). It also meant that many former Soviet client 
states experienced a major loss in coercive capacity and were no longer able to fend off 
insurgents. 


Methods 


Measuring Leftist and Islamist Transnational Terrorism 


To capture the structural sources of Leftist and Islamist transnational terrorism, we examine 
the number of transnational terrorist attacks by groups affiliated with these ideologies 
recorded in the ITERATE database (Mickolus et al. 2003) between 1973 and 2002. We focus 
on actual attacks, such as bombings, hijackings, hostage-taking, small arms and chemical 
attacks, and other uses of coercive force. We exclude hoaxes and threats because it is 
impossible to evaluate the credibility of these threats. Most of these likely reflect lower levels 
of insurgent organization. To qualify as a transnational terrorist attack, the attack must involve 
multiple nationalities (defined in terms of country of origin) in terms of its victims, the primary 
actors, and/or the location of the attack relative to the actor and target.' They must also be 
conducted by an autonomous non-state actor (i.e., a group that is not directly controlled by a 
sovereign state), have political goals, make use of extra-normal violence, and at least 
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ostensibly be designed so as to induce anxiety among various targets, including the general 
public (Mickolus et al. 2004: 2). 

We coded all ITERATE groups in terms of their primary ideology and selected those with a 
Marxist or general leftist orientation and those following an Islamist ideology. Figure 1 shows the 
count of annual attacks produced by this coding into Leftist and Islamist ideology. Ideology is 
identified by the group discourse outlining its major goals, self-justification and avowed 
collective identity. We relied on multiple sources for information on group ideology, including the 
published volumes of ITERATE (Mickolus et al. 2003 and various years); the Terrorism 
Knowledge Base website (www.tkb.org); the Information Project, a website maintained by 
international journalists that lists more than 4,600 groups (www.theinformationproject.org); 
Amnesty International (www.amnesty.org); the Global Security Project (www.globalsecurity.org); 
and general internet searches for press releases and scholarly publications. (The list of some 
1,200 ITERATE groups and their dominant ideologies is available from the authors). A small 
handful of groups (N = 3) justify themselves using both Leftist and Islamist rhetoric, calling 
simultaneously for a class revolution as well as Islamist rule. In these cases, we identified a 
primary or dominant ideology, which is used in Figure 1. In the regression analysis below, we 
also examined whether classifying these groups by their secondary ideology made a difference, 
finding the same pattern of statistically significant predictors. We therefore show regression 
results using primary ideology. 

Our classification of ideology depends on the identity of the group, thus a key question is 
how accurately the ITERATE project captures group identity. In most cases, terrorist groups 
make their goals known to the general public by a communiqué to the press or government 
officials. The goal is to alert the public as to their goals and the justice of their cause. The 
ITERATE project uses multiple news sources and published documents, including those 
released by police and government officials, to establish group identities. Fifty-five percent of 
all international terrorist attacks in the ITERATE data set are linked to a named group or one with 
sufficient information to assess the ideological makeup of the group. We excluded unidentified 
attacks (e.g, groups listed as “unknowns,” “no group involved” and various “indeterminate 
{nationality}") from our data. These attacks are likely to be conducted by less organized groups, 
often “lone wolves” and small groups that dissolve or are arrested after one or a few attacks. 

Islamist groups treat Islamic faith as their primary justification for social change. They 
advocate the implementation of sharia law and the theocratic governance of society by a body 
of religious clerics in place of a secular regime (democratic or otherwise).? In the final count, 7 
percent of all transnational terrorist attacks in ITERATE between 1968 and 2002 were 
conducted by Islamist groups. 

Leftist organizations draw on Marxist and various neo-Marxist theories to justify their 
revolutionary strategy. A few groups are anarchist or anarchist-syndicalist or environmentally- 
oriented leftist groups that espouse the destruction of global capitalism. These groups are 
united by their anti-capitalist orientation and insistence on the need for a social revolution that 
frees the exploited (i.e., workers/peasants, other oppressed peoples, animals and the 
environment) from the capitalist system to become participants in a communal/collectivist or 
socialist-communist economic system. Seventeen percent of all transnational ITERATE 
attacks between 1968 and 2002 were conducted by Leftist groups. Table 1 provides 
descriptive statistics for our measures. 


Measuring Independent Variables 


The total population size of a country might matter in that a more populous country might 
simply produce more groups and individuals willing to engage in transnational terrorism. This 


Figure 1. The Number of Islamist and Leftist Transnational Terrorist Attacks, 1968-2003 





i ox 
‘ 
- 
wagons =< | 
a je 
ee 
TS 
a 
— 
ie 
deca Pirlo 
eee een 2 
ie 
ec = 


180 


Social Origins of Islamist and Leftist Terrorism * 2015 








160 





140 





120 





100 

















ta acloee ot 





€002 


6002 
L00¢ 
0002 
6661 
8661 
2661 
9661 
S661 
v661 
£661 
C661 
L661 
0661 
6861 
8861 
2861 
9861 
S861 
v86l 
€861 
C86 
L861 
0861 
6261 
8261 
L161) 
9261 
GZ6L 
vL6l 
€Z6L 
cL6) 
L461 
0Z61 
6961 


8961} 





—@— LEFTIST ATTACKS 


—g_ ISLAMIST ATTACKS 





2016 * Social Forces Volume 84, Number 4 * June 2006 


Table 1: Descriptive Statistics of Dependent and Independent Variables 


LS 


Variable N Mean _ S. Dev. Min Max 
Islamist Terrorist Attacks 3006 11 81 .00 27.00 
Leftist Terrorist Attacks 3006 36 1.83 .00 33.00 
% Urban 3006 50.18 23.86 4.13 97.39 
GDP/per capita 3006 7.60 1.56 4.44 10.76 
GDP/per capita Square 3006 ~=—- 60.21 24.44 19.71 115.68 
Political Rights (Freedom House) 3006 4.53 2.16 1.00 7.00 
Civil Liberties (Freedom House) 3006 4.41 1.84 1.00 7.00 
Trade /GDP 3006 = 70.99 38.61 6.32 282.40 
Foreign Investment /GDP 3006 2.93 5.36 00 145.20 
Western Military Dependency 3006 AT 50 .00 1.00 
Iranian Revolution (>1979) 3006 91 .28 .00 1.00 
Cold War (<1991) 3006 04 50 .00 1.00 
Government Consumption / GDP 3006 =: 116.30 6.79 2.98 64.39 
Log of % Muslim 2797 1.89 1.70 .00 4.61 
Log of % Muslim Square 2797 6.45 7.89 .00 Ziet 
Female Labor Force (%) 3006 37.44 9.07 5.52 52.72 


Female Labor Force X Govt Consumption 3006 610.49 309.88 57.41 2930.50 





also controls for logistic problems of policing large countries and other possible omitted 
covariates. We use a country’s total population size logged to correct for skewness. 

To capture strain and social disorganization, we use the quadratic of the natural log of the 
gross domestic product per capita (World Bank 2004). This captures the idea that low to 
intermediate levels of development produce increasing social strains and disorganization but 
that, as development reaches more advanced stages, such conditions ease and reduce social 
disorganization.? Second, we include a measure of the percentage of a nation’s population that 
is urban, assuming that this captures strains associated with urbanization (World Bank 2004). 
We also examined population effects (age-dependency ratio, the youth bulge) and educational 
effects (e.g., secondary school enrollments) but these variants of the disorganization/strain 
model failed to attain statistical significance (analyses available upon request). 

Theories of political order are captured by three measures. The idea that political democracy 
provides a conducive context for transnational terrorism is captured through two indicators: 
political rights and civil liberties (Freedom House 1972-2004). Political rights tap freedom of 
political participation and the right to political competition and representation, while civil liberties 
capture freedom of the press, association and speech as regulated by the state. The latter is 
especially important to religious expression and therefore key to Islamist radicals. We reversed 
the original scoring for these two measures, making “7” represent the “most free” and “1” those 
countries that are “least free.” The third measure is the size/revenue consumption of the state, 
which suggests the degree of government intrusiveness in a given society. We capture this with 
government consumption as a percent of GDP (World Bank 2004). 

Globalization is represented by three variables. Economic globalization is captured by two 
indicators: (1) trade openness (or imports plus exports as a percent of GDP); and (2) foreign 
capital penetration, measured by direct investment as percentage of GDP (World Bank 2004). 
These test the idea that dependent incorporation into the global economy produces a greater 
likelihood of transnational terrorism. 
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To capture political globalization, we use a dummy variable indicating dependence ona 
major Western power for arms supplies. The idea is that Western military presence 
undermines the autonomy and legitimacy of the state, encouraging political rebellion (Boswell 
and Dixon 1993) and the perception that the state is an agent of foreign powers, thereby 
increasing the likelihood of transnational terrorist attacks. This is measured as a dummy 
variable (1 = yes; 0 = no) representing less developed countries depending on a Western 
country for their arms supply (US ACDA 2003). 

Theories of identity and civilizational conflict suggest that Islamist terrorism may be rooted 
in political competition between religious groups, in particular competition between Muslims 
and non-Muslims. In the developing world, religious identities have become a primary basis for 
nationalist mobilization in general (Juergensmeyer 1995) and for insurgent mobilization in 
particular (Juergensmeyer 2003; Stern 2002). We gauge religious competition by using the 
quadratic of the percent of the population that is Muslim (Barrett 1982; Barrett et al. 2001). The 
linear effect should be positive, indicating that a larger Muslim population is more likely to 
spawn transnational terrorism, and the squared term should be negative, indicating a downturn 
as Muslims become numerically dominant and face less competition. We use the natural log 
to correct for skewness. Second, reflecting demonstration effects and transnational promotion 
of Islamist groups by the Iranian government and its political allies, we use a time dummy 
representing all years from 1979 onward (1 = yes; 0 = no). 

On the leftist side, we expect the Cold War years to have been the high point of 
transnational terrorism. Not only did the fall of the Soviet Union de-legitimate Marxist- 
Leninism, it also removed the Soviets’ financial and military assistance for various 
transnational terrorist groups around the world (O'Brien 1996). We therefore include a time 
dummy (scored 1 = yes; 0 = no) marking the years through 1990. 

Increased participation by women in the labor force may represent a distinctive cultural 
threat to Islamist radicals. We therefore use the percent of the labor force that is female 
(World Bank 2004). To test the Islamist response to the state’s promotion of women’s 
increased status in society, we introduce an interaction term between female workforce 
percentage and central government consumption. 


Analytic Design 


We use a pooled cross-sectional time-series design to capture the factors contributing to the 
number of transnational terrorist attacks produced by a country's citizens in a given country- 
year conducted by Leftist and Islamist groups. In contrast to other analyses that have focused 
on the country /ocation of transnational terrorist attacks (e.g., Enders and Sandler 2002; Li and 
Schaub 2004: Li 2005), we focus on the country production of transnational terrorist attacks. 
The country production is more relevant if social conditions explain these attacks. While 
country of location overlaps in about 52 percent of our attacks, this context of the attack may 
reflect more the location of the target than the social conditions stimulating the attack. 
Because our data on transnational terrorist attacks are in the form of yearly counts, we use 
negative binomial regression, a variant of Poisson regression, which is typically used to 
analyze event counts (Barron 1992; Land, McCall and Nagin 1996; Cameron and Tivendi 
1998). Negative binomial regression corrects for overdispersion in the count of events, a 
common condition that occurs when the variance of the count is greater than its mean. 
Negative binomial regression effectively controls for overdispersion by adding a stochastic 
component to the model (Land, McCall and Nagin 1996). Tests indicate that indeed our 
measure is overdispersed and that the Poisson method is inappropriate.* Tests also indicated 
the presence of autocorrelation, which we correct for by using the General Estimating 
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Equations (GEE) approach developed by Liang and Zeger (1986) and Zeger and Liang (1986). 
We used XTGEE in Stata 8.0, controlling for an AR1 term. 

We also conduct three robustness tests, comparing our GEE results with those from a 
Zero-Inflation Negative Binomial (or ZINB) model (Cameron and Tivendi 1998: 150-2; Long and 
Freeze 2001: 243-262), an OLS model with panel corrected standard errors, and regular 
negative binomial regression with fixed effects. All three robustness tests identified the same 
set of statistically significant predictors, strengthening our confidence in the results presented 
below. We lag all independent variables one year ahead to establish a stronger test of causal 
order. In our results, we report two-tailed tests of significance. Due to missing data on 
independent variables, our regression analysis is limited to 1973-2002. 


Results 


Table 2 provides the results of our general linear negative binomial models applied to Islamist 
and Leftist transnational terrorist attacks. In models 1 and 7, we present the effects of our 
strain indicators on Islamist and Leftist terrorism respectively. Economic development has a 
curvilinear effect on Leftist attacks with low to intermediate levels of development generating 
greater attacks while intermediate to greater development generates fewer attacks. The 
inflection point is about $3,752 per capita (about the development level of South Africa), at 
which point advanced development integrates potential radicals and thereby reduces Leftist 
attacks. For the main effect of development, the change in expected leftist terrorist attack 
counts ranges between 5.8 and 3.9 points for every one unit increase in development, while 
the squared term ranges between -.35 and -.24. However, development does not have a 
consistent effect on Islamist terrorism, attaining significance only in the last two full equations 
(Equations 5 and 6). This suggests that social strains associated with economic growth are 
more relevant to Leftist insurgencies that engage in transnational terrorism. 

Conversely, urbanization has a small but statistically significant positive association with 
Islamist terrorism, but has no significant effect on Leftist attacks. This suggests that social 
strains attendant on urbanization provoke a religious reaction in the form of Islamist attacks 
rather than Leftist attacks, militating against the ecological thesis that urbanization facilitates 
attacks by providing high-density targets and communications/travel networks, which should 
affect both forms of transnational terrorism. 

Next we control for globalization (Models 2-6 and 8-12). Initially, trade openness, foreign 
investment and western military dependency are all significant for both types of transnational 
terrorism. Trade openness positively affects Islamist terrorism and negatively affects leftist terrorism. 
This suggests that Islamist attacks are in part a response to global economic participation, while 
global economic integration apparently reduces Leftist terrorist mobilization. Western military 
dependence positively drives both forms of terrorism, supporting the idea that such dependence 
undermines state legitimacy and fuels insurgency. Significantly, foreign investment negatively 
affects both forms of transnational terrorism, though the effect loses statistical significance in the 
full models for Islamist terrorist attacks. This contravenes world-systems/dependency theory, 
suggesting that foreign investment may contribute to economic growth and improved social 
welfare, thereby reducing the grievances behind transnational terrorism. (For evidence on economic 
and social benefits of foreign investment, see Firebaugh 1992, 1996; Firebaugh and Beck 1994: de 
Soysa and Oneal 1999.) The subsequent introduction of controls for government consumption and 
percent female labor force eliminates the statistical significance of trade openness for both forms of 
terrorism (Models 3, 4 and 10,11), while foreign investment is washed out only in the Islamist 
models. This suggests that globalization is linked to a much broader set of development processes 
that, in general, tend to reduce the likelihood of producing terrorism. 
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Next we introduce internal political conditions. Political rights positively affect both types 
of terrorism, indicating that increased political opportunities facilitate attacks by insurgents. 
As some have noted, political democracies are especially vulnerable to terrorist attacks in 
general (Wilkinson 2001; Laqueur 2001), but far fewer have recognized that democracies are 
also much more likely to generate different forms of political mobilization, including terrorism 
(Goldstone 2004). The negative effect of civil liberties for Islamist terrorism isn't significant 
for Leftists, but this effect loses significance once we control for percent female labor force 
(Model 4), suggesting that these effects are tied up with underlying modernization processes. 
Government consumption of GDP stimulates Islamist terrorism but has no effect on Leftist 
attacks, suggesting a cultural response to the growth of the secular state (as predicted). 

Next we control for the percent of the labor force that is female, religious diversity (for 
Islamist attacks), and global political effects (Models 4 and 10). Female labor force percentage 
negatively affects Islamist attacks while having no effect on Leftist terrorism. Apparently the 
simple presence of women in the labor force reduces Islamist terrorism, although this is not 
the whole story (see below). 

Religious diversity is positively associated with Islamist attacks. This is indicated by the 
statistically significant quadratic for percent Muslim, which shows positively for the linear term 
and negatively for the squared term. This indicates that Muslim populations generate increasing 
Islamist terrorism as they move from low to intermediate societal dominance, after which the 
relationship attenuates. The inflection point for this downward turn is at about 70 percent 
Muslim, suggesting that intermediate levels of Islamist “strength” produce the most volatility. 

The Cold War dummy is positive for Leftist attacks but the Iranian revolution dummy is not 
statistically significant for Islamist attacks. This confirms that the Cold War generated more 
Leftist transnational terrorism, presumably due to the legitimacy of Marxist-Leninist ideology and 
Soviet/Chinese sponsorship of insurgent groups. Although some Islamist attacks were inspired 
and actively promoted by the Iranian government (e.g., Lebanon early 1980s), these dynamics 
cannot be generalized for the entire post-1979 period. This may reflect the heterogeneity of 
Islamist insurgencies (the Taliban movement in Afghanistan was Sunni while Hizbullah in 
Lebanon is Shia) as well as the limited transnational effects of the Iranian revolution. This 
significantly qualifies Rapoport’s (2004) “fourth wave” thesis, supporting the idea that, while the 
end of the Cold War brought a decline in Leftist transnational terrorism, the Iranian revolution did 
not apparently kick off a unique “fourth wave” of Islamist terrorism. Indeed, Figure 1 aptly 
demonstrates that Islamist attacks were relatively commonplace prior to 1979 and that, indeed, 
such attacks display peaks and valleys throughout the study period. 

Models 5 and 11 add the interaction between the percent female labor force and government 
consumption. The interaction term is statistically significant only for Islamist attacks. The main 
effect of female labor force remains negative and statistically significant, indicating that as female 
labor force grows Islamist attacks decline. Nonetheless, this relationship grows increasingly 
positive as central government consumption increases. It is possible (perhaps even likely) that a 
state’s role as employer of women elevates the public profile of women and their unique social 
agenda, thereby inciting a violent response on the part of Islamist activists. 


Robustness Tests 

We also tested these effects using zero-inflated negative binomial regression, simple 
negative binomial regression, a negative binomial regression with fixed effects, and panel- 
corrected standard errors regression (an adjusted OLS-based technique), finding the same set 
of statistically significant effects. We also added to our full model (Models 6 and 12) a year- 
count control (i.e. year 1 = 1, year 2 = 2, etc.), which was not statistically significant and did 
not alter the statistical significance of any of our effects. This suggests that there are no linear 
trended covariates that are omitted and reinforces confidence in several of our effects that are 
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trended (GDP per capita and percentage female labor force). Overall, these tests suggest that 
our findings are relatively robust. 


Conclusions 


Some argue that Islamist attacks represent a “fourth wave” of transnational terrorism that 
differs in its timing, methods, lethality and underlying social origins from earlier Cold War-era 
“third wave" leftist terrorism (Rapoport 2004; Laqueur 2001; Snow 1996). Spurred by the 
lranian revolution and post-Cold War international system in which the superpower rivalry was 
replaced by a new set of civilizational rivalries (Huntington 1998), Islamist attacks are seen as 
more lethal, more likely to target anonymous publics, and to stem from the religious 
competition between Islam and other religions. 

We found some support for this idea. Countries characterized by greater religious 
competition between adherents of Islam and other religions were more likely to produce 
Islamist attacks. Moreover, as discussed below, Islamist norms are often invoked in what appear 
to be cultural responses to the urbanization of society (rather than the ease of access to high- 
density targets in urban areas), Western military dependency, political rights/opportunities, the 
growth of the secular state, and growing female participation in the public labor force. At the 
same time, these Islamist attacks were not necessarily accelerated by the Iranian revolution, as 
symbolic as that event was. A more complex understanding of the various types of Islamist 
belief and international diffusion/sponsorship of terrorism needs to be developed. As we have 
shown elsewhere (Jenkins, Crenshaw and Robison 2004), Islamist transnational attacks are 
more likely to target anonymous/symbolic publics as well as U.S. interests in general, and they 
are more lethal, especially during the 1990s, which partially supports the “fourth wave" thesis. 

We found stronger support for the Cold War thesis, which was marked by greater Leftist 
attacks and, after 1991, by declining attacks. The end of the Cold War did reduce the frequency 
of this type of transnational terrorism. It did not, however, eliminate it, making clear that the 
idea of distinctive terrorist “waves” is overly simplistic. These ideological traditions are more 
linked by common beliefs and claims than by coordinated or diffused actions. Moreover, the 
presence of a modest number of groups that combined Islamist and Leftist ideology indicates 
the need to recognize the ideological complexity of transnational terrorist groups. 

Some of our results point to a cultural clash between Islamist radicals and certain 
modernization processes. Although transitional development contributed to both forms of 
transnational terrorism, urbanization and the growth of government consumption in 
combination with female labor force were uniquely relevant to Islamist attacks. This is 
reinforced by the aforementioned religious competition between Muslims and other religions, 
suggesting a civilizational clash driven by underlying modernization. 

We found very little evidence supporting a world systems/dependency interpretation of 
either form of transnational terrorism. Foreign investment militates against both forms of 
attacks and international trade is only relevant to Islamist attacks and loses statistical 
significance once government consumption is controlled. This suggests that, contrary to 
world systems theory, global international integration is destabilizing, at least with respect to 
Islamist transnational terrorism, only where It is anchored in cultural cleavages. Others have 
argued that foreign investment spurs economic growth (de Soysa and Oneal 1999) and may 
trickle down to improved social welfare (Firebaugh 1992, 1996; Firebaugh and Beck 1994), 
reducing transnational terrorism. This deserves further analysis that examines these 
intermediary links, the core country of origin, and the type of foreign investments. Military 
dependence on Western countries does, however, undermine the legitimacy of states in the 
developing world, thereby encouraging transnational terrorism. This conclusion is supported 
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for both types of attacks, indicating a need for the stricter separation of economic and political 
globalization processes. 

Islamist transnational terrorism is also uniquely affected by constraints on civil liberties, 
although this is not a particularly robust finding. Conservative Muslims may view these restraints 
as especially threatening to religious organization and activities and therefore respond to this 
type of state intrusion. Leftists, by contrast, do not appear to be affected by these constraints. 

Although Islamist and traditional Leftist ideology may be logically incompatible belief 
systems, our analysis indicates their deep kinship at the level of terrorist action. Both spring 
from the social strains of transitional development and are facilitated by political opportunities 
associated with increased political rights. Both are likewise spurred by Western military 
dependency and yet are reduced by foreign investment. On the other hand, civilizational 
cleavages appear to be uniquely relevant to Islamist transnational attacks, necessitating 
distinctive analyses of social contexts and origins. 

We conclude that terrorism is a single species, but with local variation. While ideologies 
uniquely shape some aspects of clandestine violence, the deeper wellsprings of transnational 
terrorism can be found in the basic contexts of social life. In short, there is hope for a unified 
theory of terrorism. 


Notes 


1. This includes attacks that have transnational implications by virtue of the target, such as 
attacks on embassies and airline hijackings 


2. In addition to the above-discussed question of double ideologies, there are also 
Palestinian ethnonationalist groups that have an exclusively Muslim membership and 
currently use Islamist rhetoric, even if secular goals dominated their early formation. We 
also tested the effects of treating these as Islamist groups but found the same statistically 
significant predictors. 


3. We also tested a quadratic for urbanization as a measure for transitional development, but 
its results were decidedly weaker. We therefore present the results for the development 
quadratic only. 


4. Univariates show that the variance is more than 10 times the mean for our dependent 
variables and a PR counts test indicates that the negative binomial method better fits the 
distribution of the data. Alpha tests for each of our models also suggest that Poisson is 
not appropriate. 
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A Theory of Categorical Terrorism 


Jeff Goodwin, New York University 





Abstract 

When revolutionaries or insurgents, broadly defined, indiscriminately attack civilians, they 
generally attack “complicitous civilians,” i.e., those categories of noncombatants which the 
revolutionaries see as benefiting from, supporting and/or having a substantial capacity to 
influence the states that the revolutionaries are attempting to displace or overthrow. Such 
“categorical” terrorism will be most extensive when revolutionaries view these states (or 
complicitous civilians themselves) as perpetrators of extensive, indiscriminate violence against 
the revolutionaries and their constituents. However, if significant numbers of complicitous 
civilians are seen by rebel groups as potential supporters (or as capable of being influenced by 
nonviolent appeals or protests), then they will not be indiscriminately attacked. Whether 
specific categories of civilians will be perceived as potential allies by revolutionaries depends 
mainly on the prior history of political interaction and cooperation between these civilians and 
the revolutionaries. Categorical terrorism is most likely where there has been little such 
interaction or cooperation, resulting in weak political alliances between the revolutionaries 
and complicitous civilians — for example, where the revolutionaries and complicitous civilians 
speak different languages, practice different religions, claim the same land, and/or are 
territorially segregated. 


The terrorist attacks of Sept. 11, 2001, have spurred many social scientists to explore the dynamics 
of terrorism, most for the first time. Before 9/11, terrorism research was the exclusive preserve, 
with very few exceptions, of small networks of political scientists and non-academic “security 
experts,” relatively few of whom were interested in social-science theory. Descriptive case studies 
abound, replete with ad hoc, case-specific explanations of terrorism. Curiously, most scholars of 
rebellion and revolution have had virtually nothing of significance to say about terrorism. More 
generally, the strategic choices of social movements — of which terrorism is one — have received 
much less scholarly attention than the causes and consequences of such movements. 

In this article, | begin with a definition of terrorism and revolutionary terrorism; | then discuss 
several extant theories of terrorism, noting their insights and shortcomings; and | outline my own 
theory of “categorical” terrorism, a type of terrorism that has been employed by insurgents with 
increasing frequency in recent years. Although my theory requires more rigorous empirical testing 
than | can provide here, | believe that it helps to explain — better than extant approaches — why some 
revolutionary movements, but not others, employ or try to employ a strategy characterized by the 
use of violence against anonymous civilians or noncombatants who share a particular collective 
identity. It is terrorism in this sense which today alternately fascinates, repels and inspires women 
and men across large parts of the globe. 


What is Terrorism? 


Like “democracy,” “power,” “class,” “revolution” and so many other “essentially contested 
concepts,” there is no commonly accepted definition of “terrorism.” And yet explanation 
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requires a clear analytic definition or demarcation of the phenomenon to be explained, even 
if, empirically, terrorism is not always easily distinguished from cognate phenomena. “Leaving 
the definition implicit is the road to obscurantism.” (Gibbs 1989:329) 

Several representative definitions of terrorism are presented in Table 1. The de/iberate use 
of violence in order to influence some audience (or audiences) is common to most of these 
definitions, but there are also several areas of divergence. First, there is no consensus as to 
who can practice terrorism: Can states as well as oppositional groups engage in terrorism? 
Several definitions (Hoffman; the U.S. State Department; Black; Bergesen and Lizardo) seem 
to imply that “state terrorism” is either a misnomer or at least something that needs to be 
distinguished from terrorism proper and presumably analyzed in its own right. Second, there 
is disagreement as to who can be the target of terrorist threats or violence: Can anyone be a 
target of terrorism — or just civilians (or “noncombatants”)? Several definitions (Carr; Ganor; 
the U.S. State Department; Black; Bergesen and Lizardo) suggest that only civilians or 
noncombatants can be the targets of terrorism, properly understood. Finally, there is 
disagreement as to whether terrorism necessarily produces terror among some audience — 
that is, extreme fear or anxiety. Several definitions (Tilly; Crenshaw; Ganor; the U.S. State 
Department; Black; Bergesen and Lizardo) make no mention of terror or fear. Carr is more 
ambiguous, defining terrorism as intending to “destroy the will” of civilians to support certain 
leaders or policies. These definitions seem to imply that terrorism may not always terrorize 
and hence may in fact influence audiences by other means. 

| believe that non-state or oppositional terrorism is most usefully defined as follows: 
Terrorism is the strategic use of violence and threats of violence by an oppositional political 
group against civilians or noncombatants, and is usually intended to influence several 
audiences. This definition, which is fairly conventional, agrees with those that suggest 
terrorism involves violence or threats by civilians against other civilians, thus differentiating 
terrorism from guerrilla warfare, on the one hand, and state violence, on the other. (This 
definition is also agnostic as to whether terrorism is intended literally to terrorize.) What we 
must explain in order to explain terrorism is not why political groups sometimes resort to 
violence, but why they employ violence against civilians or noncombatants in particular 
(Goodwin 2004). Following this definition, revolutionary terrorism may be defined as the 
strategic use of violence and threats of violence by a revolutionary movement against civilians 
or noncombatants, and is usually intended to influence several audiences. | employ the 
concept of “revolutionary movement” broadly here, meaning any organization or network, and 
its supporters, which seeks to change the political, and in some cases the socioeconomic, 
order in more or less fundamental ways. The audiences that revolutionaries attempt to 
influence with terrorism generally include the category of civilians whom they have targeted 
for violence, the state(s) that they wish to change, the revolutionaries’ constituents, the 
revolutionaries’ own ranks, and those other states and publics that might influence the 
outcome of the conflict. 

These definitions do not imply that state violence or “state terrorism” does not exist or 
should not be studied. On the contrary, state terrorism is an enormously important subject; 
it is incontestable, for example, that state terrorism has claimed many more victims than has 
terrorism as | define it here. (See, for example, Herman and O'Sullivan 1989: chaps. 2-3; 
Gareau 2004). But state terrorism and oppositional forms of terrorism need to be clearly 
distinguished if only to understand better the relationship between the two. 

My definition of revolutionary terrorism also has the advantage of demarcating a widely 
recognized political strategy that has been employed by revolutionary groups with some 
frequency, especially since the 1960s — a strategy clearly distinct from, albeit sharing family 
resemblances with, such violent oppositional strategies as coups d'état, conventional and 
guerrilla warfare (directed at a state's armed forces), and economic sabotage. Indeed, at least 
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Table 1: Definitions of Terrorism 


Government by intimidation as directed and carried out by the party in power in France during 
the Revolution of 1789-94 ... 2. A policy intended to strike with terror those against whom it is 
adopted; the employment of methods of intimidation ..." - Oxford English Dictionary 


Terrorism “means an activity that ... appears to be intended to (i) intimidate or coerce a civilian 
population; (ii) to influence the policy of a government by intimidation or coercion; or (iii) to 
affect the conduct of a government by assassination or kidnapping.” - U.S. Code (cited in 
Chomsky 2001:16) 


Terrorism is “the deliberate creation and exploitation of fear through violence or the threat of 
violence in the pursuit of political change.” — Bruce Hoffman (1998:43) 


Terrorism is “asymmetrical deployment of threats and violence against enemies using means 
that fall outside the forms of political struggle routinely operating within some current regime.” 
— Charles Tilly (2004:5). 


Terrorism is “the contemporary name given to, and the modern permutation of, warfare 
deliberately waged against civilians with the purpose of destroying their will to support either 
leaders or policies that the agents of such violence find objectionable.” - Caleb Carr (2003:6) 


Terrorism is “the premeditated use or threat of symbolic, low-level violence by conspiratorial 
organizations.” — Martha Crenshaw (1981:379) 


Terrorism is “organized political violence, lethal or nonlethal, designed to deter opposition by 
maximizing fear, specifically by random targeting of people or sites.” - Austin T. Turk 
(1982:122). 


“Terrorism is the intentional use of or threat to use violence against civilians or against civilian 
targets, in order to attain political aims.” - Boaz Ganor (1998) 


Terrorism is “premeditated, politically motivated violence perpetrated against noncombatant 
targets by subnational groups or clandestine agents, usually intended to influence an 
audience.” - U.S. State Department (cited in Hoffman 1998:38) 


“Pure terrorism is self-help by organized civilians who covertly inflict mass violence on other 
civilians.” - Donald Black (2004:16) 


Terrorism is “the use of violence by nonstate groups against noncombatants for symbolic 
purposes, that is, to influence or somehow affect another audience for some political, social, or 
religious purpose.” — Albert J. Bergesen and Omar Lizardo (2004:50). 





since the 1960s, terrorism has become part of the “repertoire of contention” (Tilly 1995), 
which is culturally available to virtually all insurgent groups around the globe, whatever their 
political goals may be. 

Finally, my definition of revolutionary terrorism emphasizes, conventionally, that the groups 
that employ terrorism are oppositional political groups which view terrorism, for the most part, 
as a political strategy. These groups may also be nationalist, ethnic, religious, class-based 
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and/or criminal in nature, but they seek — whatever else they may seek — to change the political 
(and perhaps socioeconomic) order more or less fundamentally. Revolutionary groups that 
employ terrorism as a strategy (like those which eschew it) seek, minimally, to overthrow or 
terminate a state's power within a specific territory (including colonial and military occupations) 
or to secede from a political order and set up a new state (or join a preexisting one). 
Accordingly, violence directed at civilians by vigilante or paramilitary groups that are not 
attempting to change the political order in fundamental ways would count, depending on the 
circumstances, as either state-sanctioned or oppositional terrorism, but not as revolutionary 
terrorism. ; 

Two types of terrorism need to be analytically differentiated, both of which differ from 
conventional and guerrilla warfare, insofar as the latter are directed against a government's 
armed forces, state-sanctioned paramilitaries or armed civilians (see Table 2).’ Of course, as 
Donald Black points out, “those popularly known as guerrillas may Sometimes engage in 
terrorism [when they attack civilians], and those popularly known as terrorists may sometimes 
engage in guerrilla warfare [when they attack military facilities or personnel].” (2004: 17) 

One type of terrorism, which we may call “selective” or “individualized,” is directed against 
noncombatants who are targeted because of their individual identities or roles; in fact, these 
individuals are typically known by name to at least some members of the revolutionary 
movement. These individuals typically include politicians and (unarmed) state officials, usually 
those held responsible, directly or indirectly, for the social and political arrangements and 
government policies that the revolutionaries oppose (including “counter-terrorist” policies). 
They may also include competing oppositional leaders and political activists, presumed 
government collaborators and spies, unsympathetic intellectuals and journalists, and common 
criminals who prey upon the revolutionaries’ supporters. This type of terrorism — essentially a 
strategy of “targeted assassination” - was employed by some 19th-century Russian 
revolutionaries, a number of anarchist groups and several radical European groups of the 
1960s. Terrorism of this type, especially when targeted at real or imagined collaborators whom 
the government cannot (or will not) protect, may result in an exceedingly large number of 
casualties, as in Peru during the 1980s and Algeria during the 1990s (see, e.g., Kalyvas 1999). 


Table 2: Three Types of Armed Struggle 


Targets of Revolutionaries 
Combatants Noncombatants 


Defense Forces Politicians Anonymous members of an 
ethnicity, religion, nationality, 
social class, etc. 


Security Forces State Administrators ‘ 
Paramilitaries/ Leaders/Activists of Competing 
Armed Civilians Oppositions 


Presumed Collaborators 
Common Criminals 


1. Conventional or 2. Selective or Individualized 3. Categorical or Indiscriminate 
Guerrilla Warfare Terrorism; Targeted Assassination — Terrorism 
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Targeted assassination or selective terrorism is very different from “indiscriminate” or what 
| term “categorical” terrorism, which is directed against anonymous individuals by virtue of 
their belonging (or seeming to belong) to a specific ethnic or religious group, nationality, social 
class or some other collectivity. This type of terrorism — the focus of this article — is typically 
called indiscriminate or “random” terrorism because it makes no distinctions among the 
individual identities of its targets. In another sense, however, such terrorism is very 
discriminate, being directed against specific categories of people and not others. For this 
reason, “categorical terrorism” is a more accurate label than “indiscriminate terrorism” for this 
Strategy. Revolutionaries who practice terrorism, whether selective or categorical, also display 
varying degrees of tolerance for harming anonymous bystanders, that is, people who are not 
the individual, or who do not belong to the category, targeted for violence. The U.S. military 
refers to such casualties as “collateral damage.” 

Following the general definition of terrorism given above, categorical terrorism may be 
defined as the strategic use of violence and threats of violence, usually intended to influence 
several audiences, by oppositional political groups against civilians or noncombatants who 
belong to a specific ethnicity, religious or national group, social class or some other collectivity, 
without regard to their individual identities or roles. In much, if not most, popular discourse, as 
well as for many scholars (e.g., Turk 1982; Senechal de la Roche 1996; Black 2004), 
“terrorism” fundamentally /s categorical terrorism in this sense. “Indiscriminate” violence is 
seen by many as an essential property of terrorism. Terrorism in this sense, particularly as it is 
practiced (or not) by revolutionary movements, is what | seek to explain with my own theory 
of categorical terrorism. 

There is substantial variation in the extent to which revolutionary movements employ 
categorical terrorism as a strategy. Table 3 lists some of the more important revolutionary 
groups that arose during the period since World War II as well-as their principal violent 
strategies (excluding economic sabotage). Of course, revolutionary groups generally employ 
a number of both violent and nonviolent strategies in pursuit of their goals, and their mix of 
strategies typically changes over time. For this reason, it can be quite misleading to describe 
some such groups as “terrorist organizations” or others as “guerrilla groups.” These labels 
falsely “essentialize” revolutionary movements not just in terms of their strategic orientation — 
a rather limited view of what any movement is all about — but also in terms of just one of their 
strategies at any given moment. 

The first five sets of revolutionary groups listed in Table 3 perpetrated or continue to 
perpetrate extensive categorical or indiscriminate terrorism. The other listed organizations 
were (or are) generally much more selective or individualized in their use of terrorism; at least 
one group, the Sandinista National Liberation Front in Nicaragua, employed virtually no 
terrorism to speak of. The Irish Republican Army (IRA) and Basque Homeland and Freedom 
(ETA) are borderline cases. Before it went on cease-fire in 1997, the IRA typically engaged in 
attacks on security forces as well as some selective terrorism, but it also occasionally carried 
out bombings and other sectarian killings of ordinary Protestants in both Northern Ireland and 
Britain, especially during the height of “the Troubles” in Northern Ireland during the mid-1970s 
(English 2003). Historically, ETA has directed most of its violence against the Spanish military 
and police presence in the Basque region and against politicians of parties that oppose 
Basque independence. However, it has also engaged in occasional bombings and attempted 
bombings against civilians (Clark 1984). 

Clearly, any adequate theory of categorical terrorism needs to explain the wide variation 
shown in Table 3. That is to say, it is necessary but not sufficient to explain why some 
revolutionary movements have practiced categorical terrorism; an adequate theory must also 
explain why other revolutionary movements have not carried out categorical terrorism or have 


done so relatively infrequently. 
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Table 3: Coercive Strategies of Revolutionary Organizations 


Country Dates Main Organization(s) 


Algeria 1954-1962 National Liberation Front (FLN) 
categorical and selective terrorism; guerrilla warfare 


Palestine/Israel 1964-present Palestine Liberation Organization (PLO), Popular 
Front for the Liberation of Palestine, Hamas, 
Palestinian Islamic Jihad 
categorical and selective terrorism; guerrilla warfare 
Sri Lanka 1983-2002? Liberation Tigers of Tamil Eelam (LTTE) 
categorical and selective terrorism; conventional and 
guerrilla warfare 
Various countries 1988-present al-Qa‘ida 
categorical and selective terrorism; guerrilla warfare 
Chechnya/Russia  1996-present Chechen separatists 
categorical and selective terrorism; guerrilla warfare 


Nicaragua 1961-1979 Sandinista National Liberation Front (FSLN) 
very limited selective terrorism; guerrilla warfare 
South Africa 1961-1990 African National Congress (ANC) 


very limited categorical terrorism; some selective 
terrorism; guerrilla warfare 
Basque Country/ 1968-2006? | Basque Homeland and Freedom (ETA) 


Spain very limited categorical terrorism; selective terrorism; 
guerrilla warfare 
N. Ireland/U.K. 1969-1997 Irish Republican Army (IRA) 


some categorical terrorism (esp. mid-1970s); 
selective terrorism; guerrilla warfare 

El Salvador 1980-1992 Farabundo Marti Front for National Liberation 
(FMLN) 
limited selective terrorism; guerrilla warfare 





Extant Theoretical Approaches 


When social scientists attempt to explain why revolutionaries employ terrorism, how do they 
do so? Here, | briefly review three theoretical claims: (1) terrorism is a product of the 
weakness and desperation of revolutionaries; (2) terrorism is a response to state terrorism; 
and (3) terrorism is a result of extreme “social polarization” between groups. While these 
claims offer important insights into terrorism, they are ultimately unsatisfactory. 

Before examining these claims, however, | should note that a typical explanation for 
terrorism in much public discourse, and by many scholars, is grievance-based (Stern 2003). 
That is to say, many people suggest that the “root causes” of terrorism are found in the 
specific and variable grievances of those who utilize terrorism as a strategy. But this argument 
is extremely problematic. The removal of grievances that presumably motivate revolutionaries 
should end their acts of terrorism. But those who make this argument never explain which 
kinds of grievances require or somehow cause terrorism to be utilized by revolutionaries as 
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opposed to other political strategies. Grievances may be a necessary cause of collective 
action, but it is less clear how they cause people to act in the ways that they do. In fact, ends 
do not explain means any better than they justify them. Thus, as they are typically articulated, 
grievance-based accounts of terrorism are at best incomplete and at worst quite misleading 
(Goodwin 2004). 

After grievance-based explanations, perhaps the most frequently cited hypothesis is that 
terrorists are very weak, lack popular support, and yet are desperate to redress their 
grievances. This claim — or rationalization — also seems very popular among many groups that 
employ terrorism. The core idea here is that groups that lack the capacity to organize 
nonviolent protests or conventional or guerrilla warfare against repressive governments, or 
which fail to attain their goals when they do employ these strategies, will turn to terrorism as 
a “last resort.” Martha Crenshaw, for example, suggests that “Terrorism is the resort of an 
elite when conditions are not revolutionary... terrorism is most likely to occur precisely where 
mass passivity and elite dissatisfaction coincide.” (1981:384) For these reasons, Rubenstein 
(1987) calls terrorists “alchemists of revolution.” 

Disaffected elites, according to Crenshaw, turn to terrorism because it is easier and 
cheaper than strategies based on mass mobilization, especially when government repression 
makes mass mobilization difficult if not impossible. “In situations where paths to the legal 
expression of opposition are blocked, but where the regime’s repression is inefficient, 
revolutionary terrorism is doubly likely, as permissive and direct causes coincide.” (Crenshaw 
1981:384) Revolutionary groups will presumably employ categorical terrorism, moreover, 
because it is generally even cheaper than selective terrorism (Kalyvas 2004). There may be 
only so many opportunities available for killing a particular politician or competing opposition 
leader, for example, but setting off a bomb in a pub or bus may be relatively simple and will 
also produce more casualties. “The observation that terrorism is a weapon of the weak,” 
Crenshaw concludes, “is hackneyed but apt. At least when initially adopted, terrorism is the 
strategy of a minority that by its own judgment lacks other means. When the group perceives 
its options as limited, terrorism is attractive because it is a relatively inexpensive and simple 
alternative, and because its potential reward is high.” (1981:387) (For similar views, see 
Rubenstein 1987; Irvin 1999:chap. 2; Pape 2005:92-4; Bloom 2005.) 

There are a number of logical and empirical problems with this theory of terrorism. Most 
importantly, this theory by itself does not explain why attacking or threatening ordinary 
civilians would be perceived by revolutionaries as beneficial instead of detrimental to their 
cause. Terrorism may be cheaper than many other strategies, but why employ it at all? We 
need to know what revolutionaries believe they will gain by attacking specific categories of 
civilians. What are their strategic goals and expectations? Would not attacks on civilians 
simply undermine the popularity of revolutionaries or their own morale? To say that the 
revolutionaries are weak begs these questions. 

Second, there does not seem to be a particularly strong empirical relationship between the 
organizational strength of revolutionary groups and their use (or not) of terrorism. Some quite 
formidable revolutionary movements have employed terrorism. For example, the Liberation 
Tigers of Tamil Eelam (LTTE) in Sri Lanka are, by all accounts, a very powerful revolutionary 
group. The LTTE has sometimes even employed methods of conventional warfare against Sri 
Lankan government forces. Yet the (predominantly Tamil) LTTE has also occasionally engaged 
in indiscriminate attacks on ordinary ethnic Sinhalese civilians, and it has done so long after 
it decimated rival Tamil nationalist groups (Bloom 2005:chap. 3). Why? 

One can also point, conversely, to quite weak revolutionary organizations that have 
eschewed terrorism. Perhaps the best example of this is the armed wing of the African 
National Congress (ANC) in South Africa. In 1961, as many of its leaders were being arrested 
and many others driven into exile, the ANC established an armed wing called Umkhonto we 
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Sizwe ("Spear of the Nation” or MK). The ANC explicitly adopted “armed struggle” as one of 
its main political strategies. By most accounts, however, this armed group failed to become 
an effective guerrilla force, as the South African Defense Forces were simply too strong and 
effective. And yet MK did not adopt a strategy of terrorism, despite the fact, as Gay Seidman 
points out, that, “In a deeply segregated society, it would have been easy to kill random 
whites. Segregated white schools, segregated movie theaters, segregated shopping centers 
meant that if white deaths were the only goal, potential targets could be found everywhere.” 
(2001:118) However, as Stephen Davis notes, “since the exile leadership sought to portray the 
ANC as a principled and responsible contender for power, it imposed restrictions against 
terrorist tactics that specifically targeted noncombatant whites.” (Davis 1987:121) The then- 
president of the ANC, Oliver Tambo, “even went to the extent of signing a protocol of the 
Geneva Convention which legally bound the ANC to avoid attacks on civilian targets, and to 
‘humanitarian conduct of the war,’ marking the first time a guerrilla group had ever done so.” 
(Davis 1987:121-22) 

In short, weak revolutionary groups do not necessarily adopt a strategy of terrorism, and 
strong revolutionaries do not necessarily eschew this strategy. As Turk concludes, “Because 
any group may adopt terror tactics, it is misleading to assume either that ‘terrorism is the 
weapon of the weak’ or that terrorists are always small groups of outsiders — or at most a 
‘lunatic fringe’ . . . Terror is organized violence, but the nature of the organization cannot be 
specified in defining terror.” (1982:122) 

Some of the many investigations of the socioeconomic backgrounds of insurgents who 
have engaged in terrorism, to the extent that they bother to address issues of causation, also 
point to a type of weakness as the source of terrorism. One idea in this literature is that poor, 
and poorly educated, people are especially likely to become terrorists because they are 
desperate for resources, status or power, but lack the ability to use other strategies for 
securing these. The evidence for this claim, however, is quite weak. In fact, as Crenshaw’s 
thesis suggests, much research demonstrates that the members of revolutionary groups that 
practice terrorism are just as likely, if not more likely, to come from elite social strata as from 
working- or lower-class backgrounds (Krueger and Maleckova 2003). 

The insight of the “weakness theory” of terrorism is that oppositional groups do generally 
seem to take up arms after they have concluded that nonviolent politics cannot work or that 
it works too slowly or ineffectively to redress urgent grievances. But notice that this does not 
tell us why armed groups would employ terrorism as opposed to guerrilla warfare. Moreover, 
the argument that attacking “soft” targets such as unprotected civilians is cheaper and easier 
than waging guerrilla warfare against government forces does not explain why a// oppositional 
groups do not engage in terrorism. It would seem irrational not to do so. 

Another hypothesis claims that terrorism is a response to state terrorism. Leftist analysts 
of terrorism often make this claim, and it is mentioned by Herman and O'Sullivan (1989). They 
suggest that the “retail” terrorism of oppositional groups is caused or provoked by the 
“wholesale” or “primary” terrorism of states, especially powerful Western states, above all the 
United States. The terms “wholesale” and “retail,” which have also been employed by Noam 
Chomsky, are meant to remind readers that state terrorism has been much more deadly than 
oppositional terrorism. This claim certainly has an intuitive plausibility. Why else would 
oppositional groups turn to violence except when they confront a government or state that 
is itself unmoved by and indeed uses violence against peaceful protesters? Avenging such 
state violence, or trying to deter its future use, would seem to be reason enough for 
opposition groups to employ violent strategies. And yet, as an explanation of terrorism, this 
hypothesis Is also beset by both logical and empirical problems. 

It is certainly true that indiscriminate state violence, especially when perpetrated by 
relatively weak states, has historically encouraged the development of revolutionary 
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movements (Goodwin 2001). But why would these movements attack and threaten ordinary 
civilians as opposed to the state’s armed forces? In other words, if they are responding to 
state terrorism, why would revolutionaries not target only the state? State terrorism would 
seem to be a better explanation for a strategy of guerrilla (or conventional) warfare than for a 
strategy of terrorism. 

Empirically, one can also point to revolutionary organizations that have arisen in contexts 
of extreme state violence which have nonetheless eschewed the strategy of terrorism. For 
example, Central American guerrilla movements of the 1970s and 1980s, including the 
Sandinistas (FSLN) in Nicaragua and the Farabundo Marti Front for National Liberation (FMLN) 
in El Salvador, confronted states that engaged in extensive violence against noncombatants, 
yet neither revolutionary group engaged in categorical terrorism, and the Sandinistas engaged 
in virtually no terrorism of any type. Another such example is, once again, the ANC in South 
Africa. Interestingly, Herman and O'Sullivan's book devotes considerable attention to both 
South African and Israeli state terrorism (1989:chap. 2). And yet, while they note the “retail” 
terrorism of the Palestine Liberation Organization (PLO) during the 1970s and 1980s - 
emphasizing that Israeli state terrorism was responsible for a great many more civilian deaths 
during this period — they do not discuss the oppositional terrorism in South Africa which their 
theory would seem to predict. In fact, as we have noted, the ANC simply did not carry out 
much terrorism. State terrorism, clearly, does not always cause or provoke categorical 
terrorism. 

Having said this, it is indeed difficult to point to a revolutionary group that has carried out 
extensive terrorism which has not arisen in a context of considerable state violence. Leaving 
aside for the moment the case of al-Qaida, all of the groups in Table 3 that have engaged in 
extensive categorical terrorism are drawn from, and claim to act on behalf of, populations that 
have themselves suffered extensive and often indiscriminate state repression (in French 
Algeria, the West Bank and Gaza, Sri Lanka and Chechnya). The question is what to make of 
this correlation. Why, in these particular contexts, have revolutionaries attacked certain 
categories of civilians as well as government forces? 

A final hypothesis worth considering is that terrorism is the result of extreme “social 
polarization” between groups. Such polarization is said to exist when such groups are 
relationally distant (i.e., they have little if any intimate contact), culturally distant (i.e., they differ 
in terms of language, religion, dress and other “expressive” characteristics), functionally 
independent (i.e., they do not cooperate with or depend on one another for their well-being), 
and extremely unequal in terms of wealth, status and power (Senechal de la Roche 1996). 
“Enduring grievances” against, or “intractable offenses” by, socially distant groups allegedly 
provide the motive for organizing terrorist attacks against them. Thus, “terrorism is most likely 
in polarized conflicts where the grievance endures.” (Senechal de la Roche 1996:120; see 
also Black 2004: 18) By contrast, “closer civilians such as those of the same or similar ethnicity 
are largely immune to terrorism, especially its deadlier forms. If closer collective conflicts lead 
to violence at all, they produce different forms with fewer civilian casualties, such as riots, 
assassinations, kidnappings, and guerrilla warfare.” (Black 2004:20) 

This theory is also intuitively plausible. It is indeed difficult to imagine that two populations 
would engage in a violent conflict if they have extensive face-to-face contact, belong to the 
same ethnic group, speak the same language and worship the same god, depend on one 
another for their livelihoods, and are more or less equal in terms of wealth, status and power. 
In fact, we would presumably not recognize two such populations as distinct “groups” at all. 
Furthermore, it is quite easy to imagine that members of a subordinate group would come to 
hate, and even view as inherently evil, members of a dominating group with whom they have 
little intimate contact and who belong to a different “race,” speak a different language, 
practice a different religion, and are much wealthier and more powerful. Empirically, there 
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does seem to be a huge “social distance” separating those revolutionaries who practice 
categorical terrorism (and those they claim to speak for), on the one hand, and their civilian 
targets, on the other. The conflicts in French Algeria, Palestine/Israel, Sri Lanka and Chechnya 
were or are characterized by extreme polarization — in terms of wealth, status, power, ethnicity 
and religion. 

Nonetheless, the social-polarization theory of terrorism is plagued by a number of 
ambiguities. To begin with, how could “enduring grievances” or “intractable offenses” arise 
between groups that are in fact functionally independent, groups that do not cooperate with 
one another economically, politically, militarily or otherwise? (Senechal de la Roche 1996:111) 
Indeed, would not functional independence actually discourage violence between groups by 
making it difficult for one group to plausibly blame the other for its grievances? If my own 
livelihood really does not depend in any way on some other group, why would | care if its 
members have more money than me or dress differently or do not recognize me on the 
street? Extremely socially distant groups care little about one another, so why would one 
blame the other for its ills, let alone attack it? 

Proponents of the social-polarization thesis are vague about the precise forms of social 
distance that encourage extreme hatred, moral repulsion or at least callous indifference 
between groups — sentiments that would seem to be necessary for groups to support 
terrorist violence against civilians. They speak in very general terms when describing the goals 
of revolutionary groups, and they say virtually nothing about the strategic goals of terrorist 
violence. Accordingly, it is difficult to determine from this literature precisely why revolutionary 
groups would indiscriminately attack civilians, be they “socially distant” or otherwise, instead 
of employing other means to attain their goals. Will just any type of “enduring grievances" 
lead to terrorism? According to Senechal de la Roche, “Invasions, military dictatorships, and 
other patterns of domination provide fertile conditions for terrorism.” (1996:119) But would 
not military invasions and military dictatorships encourage their opponents to adopt a strategy 
of conventional or guerrilla warfare against military forces? Why attack civilians in these 
contexts? “Terrorists typically demand a restoration of the past,” claims Donald Black, “such 
as political independence, lost territory, or a customary way of life.” (2004:18) But, again, why 
attack civilians indiscriminately in pursuit of such goals? How do these goals require or 
encourage terrorism? 

Finally, there have in fact been situations in which groups have massacred people who are 
seemingly “socially close” to them, even neighbors. Jan Gross (2001), for example, recounts 
how the Christian residents of the Polish town of Jedwabne brutally and indiscriminately killed 
their Jewish neighbors (or stood by silently) in July 1941 — 1,600 men, women and children 
in all. Previously, according to at least one (non-Jewish) resident, the Jews of Jedwabne were 
“on good terms with the Poles. Depending on each other. Everybody was on a first-name 
basis, Janek, Icek . . . Life here was, | would say, somehow idyllic.” (Gross 2001:18) There 
seem to have been extensive economic relations between Jedwabne’s Jewish and non- 
Jewish residents (most Jews were craftsmen and merchants), as well as considerable 
residential integration. 


A Theory of Categorical Terrorism 


We can begin to move toward a better understanding of categorical terrorism by considering 
the precise categories of civilians which revolutionaries (sometimes) target for violence. How 
revolutionaries “socially construct” and attribute blame to their enemies is something the 
aforementioned theories generally do not examine. Yet, clearly, revolutionaries do not 
indiscriminately attack just any civilians or noncombatants. Indeed, revolutionaries are also 


A Theory of Categorical Terrorism * 2037 


usually interested in winning the active support or allegiance of certain civilians. So which are 
the “bad” or blameworthy civilians whom they attack? 

When they employ a strategy of categorical terrorism, revolutionaries generally threaten 
and attack what we might call “complicitous civilians.”* Revolutionaries view these categories 
of civilians as complicitous insofar as they are believed to (1) routinely benefit from the actions 
of the government or state that the revolutionaries oppose, (2) support the government or 
state, and/or (3) have a substantial capacity to influence or to direct the government or state. 
Such complicitous civilians are akin to what Charles Tilly calls “polity members,” that is, 
groups which “can routinely lay claim to the generation of action or yielding of resources by 
agents of the government.” (Rule and Tilly 1975:55) 

The precise categories of civilians that revolutionaries view as complicitous depend on 
how revolutionaries construe the extant political order that they are trying to change. Different 
types of complicitous civilians are generally associated with different regime types (see Table 
4). Precisely how revolutionaries construe or socially construct political regimes, hence 
complicitous civilians, is a complex process, one that depends, among other things, on an 
organization's ideology, collective memory and practical experience (as filtered through its 
ideology). One group might view the United States, for example, as a genuinely 
representative democracy in which the general citizenry is complicit in government policies, 
whereas another group might view the United States as a “bourgeois democracy” in which 
only the bourgeoisie or wealthy businesspeople are complicitous. To take another example, 
some Palestinians believe that Israeli Jewish settlers in the West Bank and Gaza are the 
complicitous civilians of a settler regime, while other Palestinians believe that a// Israeli Jews, 
including Jews living within Israel's pre-1967 borders, are the complicitous civilians of a settler 
regime, understood in a very different sense. In any event, one will not be able to understand 
why certain revolutionary groups target particular categories of civilians without grasping the 
revolutionaries’ understanding of the political order that they confront and the complicitous 
civilians associated therewith. 


Table 4: Targets of Categorical Terrorism: “Complicitous Civilians” 


eve ee ariarerye aie re fad) Wit eae ee ee eS ee 
How do revolutionaries construe the extant political regime? 


Political Regime Complicitous Civilians 

Autocracy Cronies of the autocrat (usually not anonymous), the wealthy 
Oligarchy or “Bourgeois Dominant economic class, businesspeople, the wealthy 
Democracy” 


Single-party Authoritarian Party members 


Regime 

Ethnocracy Dominant ethnic group 

Settler Regime Settlers 

Colonial or Neocolonial Expatriate and comprador bourgeoisie, metropolitan 
(‘Puppet’) Regime bourgeoisie, general citizenry of the metropolis 


Democracy General citizenry 
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But why indiscriminately attack or threaten complicitous civilians? To answer this question, 
we need to understand not only how revolutionaries socially construct specific civilians as 
enemies but also how they weigh the costs and benefits of violently attacking such enemies 
once they have been defined. What, then, are the potential strategic benefits of terrorist 
attacks against complicitous civilians? The main strategic objective — the primary incentive — 
of categorical terrorism is to induce complicitous civilians to stop supporting, or to proactively 
demand changes in, certain government policies or the government itself. Categorical 
terrorism, in other words, mainly aims to apply such intense pressure to complicitous civilians 
that they will demand that “their” government change or abandon policies that the 
revolutionaries oppose. Research on terrorism suggests that there are several other strategic 
objectives behind indiscriminate attacks on complicitous civilians, including: 


* Jo provoke a violent overreaction by the government against the 
revolutionaries and their presumed supporters, ultimately benefiting 
the revolutionaries 

* Jo undermine efforts at peace or reconciliation between the 
government and competing oppositional groups, ultimately 
benefiting the revolutionaries 

* To attract, retain and/or boost the morale of revolutionaries and 
Supporters 

* To avenge specific acts for which complicitous civilians are held 
responsible 

° Jo seize or recover territory trom the government 


The fact that a group or movement may have one or more of these strategic objectives or 
incentives does not automatically “cause” categorical terrorism. Whether such goals will 
actually induce revolutionaries to adopt a strategy of categorical terrorism depends on a 
number of other factors, including whether revolutionaries believe they have the 
organizational and technical capacity to wage an effective campaign of terrorism. More 
generally, the strategic choice of revolutionaries to employ categorical terrorism Is much 
complicated by the fact that this strategy also has many potential costs as well as benefits. 
Research on terrorism suggests that there are several reasons why revolutionaries might 
reject a strategy of categorical terrorism: 


° Complicitous civilians may be potential members or allies of the 
revolutionary movement. 

¢ Nonviolent appeals or protests may influence complicitous civilians 
more effectively than threats or violence. 

* Jerrorism may anger or repel the members or supporters of the 
revolutionary movement. 

* Jerrorism may harm or prevent alliances with actually or potentially 
sympathetic third parties. 

° Jerrorism may provoke state repression for which the revolutionaries 
will be blamed by their constituents. 

* Terrorism may provoke state repression that will severely weaken or 
even destroy the revolutionary movement itself 


Because there are often very good reasons to employ and reject a strategy of terrorism, 
revolutionaries usually confront a number of “strategic dilemmas.” (Jasper 2004:7-10) There 
may be no simple solution to these dilemmas, because revolutionaries may have imperfect 


A Theory of Categorical Terrorism * 2039 


information about the political regime, complicitous civilians or even their own presumed 
constituents, and thus cannot easily predict reactions to terrorism. Because these dilemmas 
are not always easily resolved, moreover, they do not clearly direct revolutionaries along a 
particular line of action. This may help account for the sheer unpredictability of some terrorist 
attacks as well as the seemly quixotic or self-defeating character of others. © 

One strategic dilemma, which Jasper labels “naughty or nice” (2004:9), has to do with 
whether collective action is more effective when it involves friendly persuasion or coercion. 
For revolutionaries, one concern is whether complicitous civilians will be more effectively 
influenced by nonviolent appeals or protests or by violence and threats. Nonviolent appeals 
are relatively cheap, but they may not work; and violence, even when efficiently executed, 
may backfire, inducing complicitous civilians, for example, not to pressure “their” government 
to change its ways but to redouble their support for it. The less familiar revolutionaries are 
with the cultural beliefs and assumptions of complicitous civilians (i.e., the greater the cultural 
distance between these groups), the more likely they are to strategically miscalculate. 

Another strategic dilemma, which Jasper calls “reaching out or reaching in,” concerns the 
“issue of whether to play to inside or outside audiences once they are defined.” (2004:10) As 
noted above, one reason why revolutionary groups sometimes employ terrorism is to attract, 
retain and/or boost the morale of activists, but this may only serve to alienate potential allies. 
On the other hand, heeding such allies and rejecting terrorism may undermine the morale of 
activists and risk losing the political initiative to more violent organizations (Bloom 2005:chap. 
4). This dilemma is evident among Palestinian groups. Terrorism against Israeli Jews has 
sometimes won the approval of many Palestinians (and other Arabs), but at the cost of 
alienating potential allies outside the Middle East.° 

Revolutionaries’ calculations about whether they should employ categorical terrorism as a 
strategy are not made in a vacuum, outside of any social, political or cultural context. A theory 
of categorical terrorism needs to specify the key contextual factors that create incentives or 
disincentives for revolutionaries to engage in such terrorism. Sometimes these factors pull 
in both directions, but sometimes they may convince revolutionaries that their strategic 
dilemmas may be more or less adequately resolved by a consistent course of action. 

The key factors in a theory of categorical terrorism must be those that either b/ur or brighten 
the cognitive, affective and moral distinctions between the political order that a revolutionary 
group seeks to change and the civilians who live under that political order. Factors that b/ur this 
distinction encourage categorical terrorism, insofar as the latter rests upon the revolutionaries’ 
failure or refusal to draw a moral, affective or even conceptual distinction between a hated 
political regime and “its” citizens or some subset of them. Factors that brighten this distinction 
discourage terrorism by dissociating civilians from the targeted regime and its policies. 

My theory proposes that three key contextual factors strongly influence the decision of 
revolutionaries either to employ or not to employ categorical terrorism (see Table 5). First, and 
most importantly, terrorism is encouraged when revolutionaries perceive that certain 
categories of civilians — complicitous civilians — benefit from, support, demand or tolerate 
extensive and indiscriminate state violence or state terrorism against the revolutionaries and 
their presumed constituents. This perception is more or less strongly encouraged by the 
ideologies of revolutionary organizations, but also by institutional arrangements and everyday 
practices that (a) blur the boundaries between the government and these complicitous 
civilians (e.g., elections) and/or (b) blur the boundaries between the military and complicitous 
civilians (e.g., universal conscription). In other words, practices that tend to elide the 
distinction between state and citizen, on the one hand, or between combatant and 
noncombatant, on the other, also tend to elide the willingness or even capacity of 
revolutionaries (and others) to make moral, affective or even cognitive distinctions between 


these categories. 
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Table 5: Key Variables Affecting Revolutionaries’ Use of Categorical Terrorism 


a eD 


+b _ 
1. Complicitous civilians support extensive state vs. State violence is limited and/ or 
violence/state terrorism conspicuously opposed by 


complicitous civilians 
(a) Boundaries between the state and citizens are blurred vs. State autonomy from society 


(b) Boundaries between the military/ combatants and vs. Military autonomy from society 
civilians/ noncombatants are blurred and/or the political regime 

2. Complicitous civilians are numerous and relatively vs. | Complicitous civilians are few and 
unprotected well protected 

3. Weak or absent political alliances or cooperation vs. Significant political alliances 
between revolutionaries and complicitous civilians (i.e., between revolutionaries and 
strong linguistic, religious, and/or territorial complicitous civilians 


segregation) 


| noted earlier that revolutionary groups that have employed a strategy of categorical 
terrorism are usually drawn from, and claim to act on behalf of, populations that have suffered 
extensive and often indiscriminate state repression (in French Algeria, the West Bank and 
Gaza, Sri Lanka and Chechnya). In each of these cases, moreover, there was or is a perception 
by the revolutionaries of substantial civilian support for (or toleration of) that repression (by 
European settlers, Israelis, Sinhalese and Russians, respectively). Indeed, the governments 
that carried out the repression in these cases had a substantial measure of democratic 
legitimacy among complicitous civilians; some still do. Democratic rights and institutions are 
often effective at creating the impression (especially at some social distance) of substantial 
solidarity between citizens and “their” states. When extensive and indiscriminate state 
violence is seen to be supported by civilians and/or orchestrated by democratically elected 
governments, it is hardly surprising that revolutionaries would tend to view both repressive 
states and the civilians who seem to stand behind those states as legitimate targets of 
counter-violence, typically justified as “self-defense.” Nor is it surprising that retribution for 
such violence would be directed at civilians as well as at the state’s armed forces. And it 
would also be reasonable under these circumstances for revolutionaries to conclude that 
attacking civilians might cause the latter to put substantial pressure on “their” states to 
change their ways. Extensive state (“wholesale”) terrorism begets extensive oppositional 
(“retail”) terrorism, in other words, only when there exists a citizenry with significant 
democratic rights. The latter would appear to be a necessary precondition for extensive 
categorical terrorism (Pape 2005, Goodwin 2006). 

A second contextual factor that encourages categorical terrorism is the existence of a 
large and relatively unprotected population of complicitous civilians. By contrast, categorical 
terrorism is discouraged when the category of complicitous civilians is comparatively small 
and, accordingly, such civilians are few and far between and/or well protected (usually 
because of their wealth and/or political status). Hence, categorical terrorism is much more 
likely when an entire ethnic group or nationality is viewed by revolutionaries as complicitous 
as compared, for example, to a small social class or the cronies of an autonomous, “above 
class” dictator. And, in fact, all major cases of categorical terrorism seem to have entailed the 
use of violence and threats against a large ethnic or national group. 

This factor helps to explain why the Sandinista Front in Nicaragua carried out virtually no 
terrorism during their armed conflict with the personalistic Somoza dictatorship, an otherwise 
bloody insurgency during which some 30,000 people were killed (Booth 1985). Complicitous 
civilians in this context consisted of a tiny number of Somoza cronies and a loyal elite 
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opposition, both of which were drawn mainly from Nicaragua's small bourgeoisie. Virtually 
all other civilians in Nicaragua, from the lowliest peasant to Somoza’s bourgeois opponents, 
were viewed by the Sandinistas as potential allies, and indeed many would become such 
(Everingham 1996). Had the Somoza dictatorship been supported by more or larger social 
strata or by a particular ethnic group of substantial size, the Sandinistas would cele: 
have engaged in more categorical terrorism against these groups. 

The third main contextual factor that encourages terrorism is a particular type of “social 
distance,” namely, the weakness or absence of political alliances between revolutionaries and 
their presumed constituents and complicitous civilians. In fact, where one finds this type of 
political distance as well as mass-supported state violence, ideologies and/or cultural idioms 
that depict complicitous civilians as blameworthy and perhaps even inherently evil are especially 
likely to resonate among the victims of that violence. Simply put, these victims will not have 
experienced any positive political interactions with complicitous civilians to weigh against the 
palpable evil that the latter are seen to countenance or directly perpetrate. There has in fact been 
a huge political distance — in this particular sense — separating those revolutionaries who practice 
categorical terrorism (and those they claim to speak for) and their civilian targets. 

By contrast, categorical terrorism is discouraged when there are significant political 
alliances or forms of cooperation between revolutionaries and complicitous civilians. (For a 
similar argument about ethnic rioting, see Varshney 2002.) In other words, when revolutionary 
groups and their constituents have a history of collaborating politically with significant 
numbers of complicitous civilians, they are not likely to classify complicitous civilians as 
enemies. To attack such civilians indiscriminately would jeopardize politically valuable 
alliances and the resources and legitimacy attached to them. 

| believe that this last factor is extremely important for understanding why the ANC in 
South Africa rejected a strategy of categorical terrorism. The ANC eschewed this strategy 
even though the apartheid regime that it sought to topple employed very extensive state 
violence against its opponents. This violence, moreover, was clearly supported (or tolerated) 
by large segments of the white, especially Afrikaner, population. The Nationalist Party 
governments that unleashed the security forces against the regime's enemies were elected 
by the white population. So why did the ANC adhere to an ideology of “multiracialism” and 
refuse to view whites as such as enemies? The answer lies, | believe, in the ANC’s long 
history of collaborating with white South Africans, especially of British background — as well 
as with Indian and “colored” (mixed race) South Africans — in the anti-apartheid struggle. 
Perhaps especially important in this respect was the ANC’s long collaboration with whites in 
the South African Communist Party. Tellingly, an important, long-time leader of MK, the ANC’s 
armed wing, was Joe Slovo, a white Communist. (Try to imagine an Israeli Jew leading 
Hamas's armed wing or an American Christian directing al-Oa’ida.) For the ANC to have 
indiscriminately attacked South African whites would have soured this strategic relationship, 
which, among other things, was essential for securing substantial Soviet aid for the ANC. In 
sum, given the long-standing multiracial — including international — support for the anti- 
apartheid movement, a strategy of categorical terrorism against complicitous (white) civilians 
made little strategic or moral sense to ANC leaders. 

In sum, my theory of categorical terrorism predicts — and seems to retrodict accurately — 
that when revolutionaries indiscriminately attack civilians, they generally attack those large 
and unprotected categories of civilians or noncombatants, and only those who are perceived 
as benefiting from, supporting and/or having a substantial capacity to influence states that 
employ extensive, indiscriminate violence against revolutionaries and their constituents. 
However, if revolutionaries perceive significant numbers of these “complicitous civilians” as 
actual or potential supporters (or as capable of being influenced by nonviolent appeals or 
protests), then they will not be attacked. Whether specific categories of civilians will be 
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perceived as potential allies by revolutionaries depends mainly upon a history of political 
interaction and cooperation (or lack thereof) between these civilians and the revolutionaries. 
Categorical terrorism is most likely where there has been little such interaction or cooperation, 
resulting in weak political alliances between the revolutionaries and complicitous civilians — for 
example, where the revolutionaries and complicitous civilians speak different languages, 
practice different religions, claim the same land, and/or are territorially segregated. 

The essential elements of this theory can be represented by means of a conceptual space, 
represented in Figure 1. The first variable is a measure of the extent to which revolutionaries 
perceive civilian support for (and/or toleration of) extensive and indiscriminate state violence 
or state terrorism against themselves and their constituents. This variable can be said to be 
"strong" if state violence is great and revolutionaries believe that large numbers of civilians 
support or tolerate it. This variable would be “weak” if state violence itself is limited or if 
revolutionaries believe that relatively few civilians support it, however extensive or limited that 
violence may be. The second variable is the strength of the cross-cutting political alliances 
that connect (or fail to connect) revolutionaries and their constituents and complicitous 
civilians. Categorical terrorism occurs when and where revolutionaries perceive strong civilian 
support for extensive state violence and cross-cutting political ties between the 
revolutionaries and complicitous civilians are weak (i.e., when revolutionary groups fall into 
the upper left region of the conceptual space in Figure 1). Otherwise, revolutionaries will 
adopt strategies of conventional and/or guerrilla warfare and/or targeted assassination. 


Figure 1. A Theory of Categorical Terrorism 


Cross-cutting political alliances and/or cooperation 
between revolutionaries and complicitous civilians 


Weak Strong 
Strong 
French Algeria Apartheid South Africa 
Perceived civilian support for Israel/Palestine 
extensive state violence against Sri Lanka 
revolutionaries and their Chechnya 


constituents [categorical terrorism] 


Northern Ireland (1970s-80s) | Basque Country 


Weak _ El Salvador (1980s) Nicaragua (1970s) 


Conclusion 


Al-Qa‘ida and 9/11 


By way of conclusion, let me turn to the question of whether the theory outlined here explains 
why al-Qa'ida and affiliated Islamist groups have carried out extensive categorical terrorism, 
including the attacks of Sept. 11, 2001. | believe that it does. To be sure, al-Qa‘ida and its affiliates 
differ from other revolutionary organizations that have practiced categorical terrorism insofar as 
they represent a genuinely transnational revolutionary movement. That is to say, al-Qa'ida not 
only has militants in more than one national society, which is by no means unusual among 
revolutionary groups, but it also opposes and seeks to overthrow not just one, but several 
political orders (Gunaratna 2002). Al-Oa’ida is also unusual insofar as it has employed extensive 
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categorical terrorism against the citizens of states that it is trying to influence but not overthrow. 

Al-Qa‘ida has a pan-lslamic revolutionary project, viewing itself as the vanguard and 
defender of the transnational umma or Muslim community. Unfortunately, in al-Qa’ida’s view, 
this multiethnic, transnational community is currently balkanized and violently oppressed by 
“apostate” secular and “hypocritical” pseudo-lslamic regimes, from Morocco to Mindanao, as 
well as by the “Zionist entity” in Palestine. And standing behind these regimes — and now 
occupying Iraq — is the powerful U.S. government (and, to a lesser extent, other Western 
governments, especially the United Kingdom). This perception that the United States is the 
ultimate power which is propping up repressive, un-Islamic regimes in the Muslim world is 
the fundamental source of al-Qa’ida’s conflict with the United States. Al-Qa’ida believes that 
until the U.S. government — the “far enemy” — can be compelled to end its support for these 
regimes — the “near enemy” — and withdraw its troops from Muslim countries, local struggles 
against these regimes cannot succeed (Anonymous 2002, Doran 2001). 

But why kill ordinary Americans in addition to U.S. armed forces? Why would al-Qa'ida 
target the World Trade Center, for example, in addition to U.S. military installations? Shortly 
after 9/11, Osama bin Laden described the rationale for the attacks in an interview that first 
appeared in the Pakistani newspaper Ausaf on Nov. 7, 2001: 


The United States and their allies are killing us in Palestine, Chechnya, 
Kashmir, Palestine and Iraq. That’s why Muslims have the right to 
carry out revenge attacks on the U.S... . The American people should 
remember that they pay taxes to their government and that they voted 
for their president. Their government makes weapons and provides 
them to Israel, which they use to kill Palestinian Muslims. Given that 
the American Congress is a committee that represents the people, 
the fact that it agrees with the actions of the American government 
proves that America in its entirety is responsible for the atrocities that 
it is committing against Muslims. | demand the American people to 
take note of their government's policy against Muslims. They 
described their government's policy against Vietnam as wrong. They 
should now take the same stand that they did previously. The onus Is 
on Americans to prevent Muslims from being killed at the hands of 
their government. (Quoted in Lawrence 2005: 140-1.)° 


Bin Laden seems to be saying here that because the United States is, in his view, a 
genuinely representative democracy, ordinary citizens are responsible for the violent actions of 
“their” government (and, indirectly, of governments supported by the United States) in Muslim 
countries (Wiktorowicz and Kaltner 2003:88-9). Al-Oa‘ida views ordinary American citizens as 
complicitous civilians — morally culpable for the U.S.-sponsored “massacres” of Muslims in a 
number of countries. This idea was also expressed by Mohammad Sidique Khan, one of the 
four suicide bombers who killed more than 50 people in London on July 7, 2005. In a videotape 
broadcast on a/-Jazeera television on Sept. 1, 2005, Khan said, “Your democratically elected 
governments continuously perpetuate atrocities against my people all over the world. And your 
support of them makes you directly responsible, just as | am directly responsible for protecting 
and avenging my Muslim brothers and sisters.” (BBC News 2005) 

To be sure, al-Oa’ida’s precise strategic goal in attacking U.S. citizens remains unclear: Was 
9/11 a reprisal for massacres carried out or supported by the United States? Was 9/11 meant 
to “wake up” Americans to what their government was doing in the Islamic world, in the hope 
that they would force it to change its policies? Or was the goal perhaps to provoke a violent 
overreaction by the U.S. government, luring it into Afghanistan, where it would become 
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bogged down (like the Soviet Union before it) in an unwinnable war? Or some combination of 
these goals? What is certain, however, is al-Oa’ida’s belief that it is logical and indeed just for 
it to attack ordinary Americans in order to bring about change in “their” government's policies. 

As in similar cases in which revolutionaries have turned to a strategy of terrorism, al-Oa‘ida 
perceives that the state violence and oppression which it and its constituents suffer has 
widespread civilian support in the United States. At the same time, al-Qa'ida and its Islamist 
sympathizers obviously do not have the type of history of political collaboration with American 
citizens which might lead them to reject a strategy of categorical terrorism; language, religion 
and geography have created a formidable chasm between the two. The confluence of these 
factors, as elsewhere, has strongly encouraged, and continues to encourage, al-Oa’‘ida’s 
terrorist strategy against the United States and its allies. 


Notes 


1. Some authors seem to assume that the tactic of suicide bombing is inherently terrorist in 
nature (e.g., Bloom 2005; Pape 2005). This is incorrect. Suicide bombings may either be 
part of a strategy of conventional or guerrilla warfare (if they are aimed at military and/or 
political targets) or a strategy of terrorism (if aimed at ordinary civilians or noncombatants) 
(Goodwin 2006). 


2. Several indiscriminate bombings were carried out by ANC cadres during the mid-1980s, 
contrary to ANC guidelines, but these caused relatively few casualties and were publicly 
denounced by the ANC leadership. There were also many incidents of violence against 
cadres in ANC camps outside of South Africa, including the killing of dissidents and 
presumed dissidents. But these were targeted killings, not the kind of indiscriminate 
violence that | wish to explain in this article. 


3. Of course, | do not mean to claim that this massacre is an example of categorical 
terrorism. It was not perpetrated by an oppositional political group; on the contrary, It was 
encouraged by the governing Nazi occupation forces in Poland. Accordingly, the 
Jedwabne massacre is not a case of terrorism as | am using the term, although it clearly 
has affinities with terrorism, like other state-sanctioned atrocities. 


4. My thanks to Steven Lukes for suggesting this term. 


5. See the various polls conducted since 1993 by the Palestinian Center for Political and 
Survey Research (http:/\www.pcpsr.org/index.html). 


6. Prior to 9/11, ina December 1998 interview that appeared on a/-Jazeera television, bin Laden 
remarked, “Every American is our enemy, whether he fights directly or whether he pays 
taxes. Perhaps you have heard the recent news that three-quarters of the American people 
support Clinton in attacking Iraq. This is a people whose votes are won when innocents die, 
whose leader commits adultery and great sins and then sees his popularity rise — a vile 
people who have never understood the meaning of values.” (quoted in Lawrence 2005:70) 
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Abstract 

We examine how the shift from state socialism affects gender inequality in the labor market 
using multivariate models of employment exit, employment entry, job mobility and new 
job quality for 3,580 Russian adults from 1991 through 1997. Gender differences changed 
in a complex fashion. Relative to men, women gained greater access to employment, but 
female disadvantage in the quality of new jobs widened. Although these two trends appear 
to be opposite, they are closely related. Both are connected to the introduction of market 
institutions, not gender differences in human capital or structural location in the labor 
market. 


How does the shift from state socialism affect gender inequality in the labor market? 
Many predicted that surging unemployment and the reallocation of jobs across economic 
branches and sectors would hurt women disproportionately. Others argued that market 
reforms would benefit women. Most studies addressing this question analyze gender 
wage gaps with cross-sectional data. We advance the debate by examining how gender 
differences in rates of /abor market transitions (entry to or exit from employment, job 
mobility) and in the quality of new jobs change over time. We analyze employment 
histories spanning 1991-1997 from a representative sample of 3,580 Russians using 
hazard models for labor market transitions and logistic regressions for new job quality. 

Women are disadvantaged on the labor market to the extent that, relative to men, they 
have higher rates of layoff and voluntary employment exit, /ower rates of employment 
entry and job mobility, higher odds that their new jobs are low-quality positions, and /ower 
odds that they are high-quality. We model changes in the “gross” (zero-order) and “net” 
(controlling for covariates) effects of gender on these outcomes after market reforms. We 
compare gross and net effects to see if observable human capital and structural location 
in the labor market produce gender inequality. We find gross and net gender effects of 
similar magnitude throughout the period. These effects change over time, but in a manner 
that defies simple characterization as beneficial or detrimental to women. Relative to men, 
women gained greater access to jobs; however, female disadvantage in the quality of new 
jobs widened. 
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author. We thank Hana Ayalon and Leslie McCall for helpful comments on earlier drafts presented 
at the Research Committee on Social Stratification of the International Sociological Association (RC 
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tgerber@ssc.wisc.edu. 
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Gender, Labor Market Outcomes and Post-Socialist Transition 


The Soviet Legacy and Market Reforms 


Soviet women never reached earnings parity with men, despite the egalitarian claims of 
socialism. The Soviet-era gender wage gap was comparable to levels found in advanced 
capitalist societies (Brainerd 2000). Official Soviet ideals and chronic labor shortages pushed 
women into the labor force. State-sponsored childcare and maternity leaves helped women 
participate in the labor market at nearly the same rate as men. The centralized system for 
determining wages left managers little leeway to differentially allocate pay, making sex 
segregation by occupation and economic branch the primary sources of observed wage 
differentials. On average women earned about 70 percent of what men did, but the narrow 
Soviet wage distribution meant a lower gender gap in absolute wages than in societies with 
wider distributions (Flakierski 1993). 

In January 1992 the new Russian government introduced sweeping market reforms. 
They removed most controls on prices, wages, trade and currency exchange (exposing 
Russian firms to foreign competition), eliminated planning and mandatory state orders, 
curtailed enterprise subsidies, removed restrictions on private ownership of productive 
assets, and privatized small firms. Mass privatization of medium and large state firms 
started in 1993. 

Russia's reformers hoped competitive pressures would compel managers to respond to 
market signals by improving efficiency and re-structuring their product lines and workforces. 
Enterprises would be divested of their social distribution functions and transformed into 
profit-making, competitive entities. Unemployment would increase, as inefficient enterprises 
closed and successful enterprises shed excess labor. In fact, the reforms, in combination with 
fluctuating monetary policies and prior structural conditions, set off hyperinflation, steep 
declines in output and real wages, steady growth in unemployment, and a sharp rise in 
inequality until growth resumed after 1998 (Blasi, Kroumova and Kruse 1997). 


Increasing Gender Inequality? 


Many observers expected the hardships wrought by the institutional changes and resulting 
recession to hit women harder than men (Posadskaya 1994; Standing 1994; Koval 1995; 
Bridger and Kay 1996; Linz 1996). These predictions translate into a simple hypothesis of 
growing gender inequality during post-socialist transition: 


Hypothesis 1 — Over time, women’s gross labor market disadvantages 
increase in magnitude. 


Theories identify three factors that could produce such a growth in female disadvantage: 
(1) gender differences in human capital; (2) occupational, branch and sector segregation by 
sex (inherited from the Soviet era); and (3) increased employer discrimination against women. 

Human capital theory attributes female labor market disadvantages to lower levels of 
human capital on the part of women (Mincer and Polachek 1974). According to “market 
transition theory” (Nee 1996), market institutions intrinsically reward human capital more than 
state socialist institutions do. If women in post-socialist societies lag behind men in human 
capital endowments, increasing economic returns on human capital imply greater female 
labor market disadvantage (Shu and Bian 2003). Although Russian women equaled and 
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surpassed men in average educational attainment in the 1970s (Gerber and Hout 1995), they 
concentrate in less lucrative fields and forms of study (Gerber and Schaefer 2004). Thus, 
women and men with the same level of education may differ systematically in their specific 
skills. Women’s position as primary family caretakers may have afforded them fewer years 
of work experience (Ogloblin 1999). Women were underrepresented in the Communist Party 
of the Soviet Union (CPSU); CPSU membership is a reasonable proxy for unobserved human 
capital (ambition, motivation, organizational skills) not captured by education and experience 
(van der Lippe and Fodor 1998; Gerber 2000, 2001).' 

If human capital differences account for gender inequality on the labor market, then 
statistical controls for education, experience, and CPSU membership will reduce estimates of 
both stable and dynamic gender gaps in labor market outcomes: 


Hypothesis 2a — Women’s labor market disadvantages, net of human 
capital, are smaller than their gross disadvantages. 


Hypothesis 2b — Women’s labor market disadvantages, net of human 
capital, increase more slowly than their gross disadvantages. 


Our measures may inadequately capture gender differences in human capital, particularly 
given gender parity in levels of education, which masks gender differences in the types of 
degrees received within each level (Gerber and Schaefer 2004). Therefore, we might observe 
some residual female disadvantage, net of such controls, even if the human capital 
explanation is valid. However, the returns on human capital appear not to have increased 
during Russia's market transition (Gerber and Hout 1998; Gerber 2002). Thus, even if gender 
differences in human capital persist, they may not drive changes in gender inequality in the 
labor market. 

The second explanation for women’s anticipated labor market disadvantages during 
market transition emphasizes structural segregation — gender differences in locations within 
occupations, branches and (private vs. public) sectors (Ogloblin 1999). The sex-typing of 
occupations and industries leads to women’s earnings disadvantages (Kilbourne et al. 1994). 
State socialist economies exhibited high levels of sex-segregation by occupation (Trappe and 
Rosenfeld 2004). Women held lower routine non-manual, low-grade technical, and poorly paid 
professional occupations. During market transition wages stagnated in these types of jobs 
(Gerber and Hout 1998). Women also have been concentrated in services. While services 
have expanded dramatically since the onset of market reforms, those service industries 
where women predominate (trade, catering, personal services, healthcare, education) may 
have lower wages or slower growth (Ahlander 2001). Finally, data from 1991-1995 show that 
Russian women were underrepresented in private sector firms, which usually pay higher 
wages than state-owned firms (Gerber and Hout 1998). 

Gender differences in structural locations within occupations, branches and sectors might 
adversely affect not only women’s relative earnings, but also their labor market opportunities. 
Low earnings in an occupation could reflect low effective demand for that occupation relative 
to supply2 In that case, incumbents in low-paying occupations should have higher layoff and 
quit rates, as employers respond to contracting demand for their products during market 
transition. The segmentation by branch and sector of Russia's labor market (Gerber 2002) 
implies that women's initial locations in poorly performing branches and the state sector limit 
their mobility to dynamic branches and the private sector. 

lf gender differences in labor market location explain women’s disadvantages then 
statistical controls for occupation and firm characteristics will reduce estimates of stable and 
dynamic gender gaps in labor market outcomes: 
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Hypothesis 3a — Women’s labor market disadvantages, net of 
structural location, are smaller than their gross disadvantages. 


Hypothesis 3b - Women’s labor market disadvantages, net of 
structural location, increase more slowly than their gross 
disadvantages. 


The structural explanation also directs our attention to job transitions because gender 
differences in frequency and type of job shifts perpetuate sex segregation by occupation, 
branch and sector. 

The third theory predicting women’s growing disadvantages posits increased 
discrimination against women. The transition removed external constraints on gender 
discrimination by employers. Patriarchal attitudes of male superiority persisted under the 
socialist regime, but they were suppressed by the Soviet state (Ogloblin 1999; LaFont 
2001). Following the Soviet collapse, official support for gender equality gave way to social 
and political pressure on women to assume traditional roles of mother and homemaker 
(Degtiar 2000). State institutions and policies that limited gender discrimination — e.g., 
centralized wages, protective provisions in labor law and state-provided maternity and 
child care benefits — receded (Khotkina 2001; Rzhanitsyna 2001; CEDAW 1999). Latent 
patriarchal attitudes were given free reign, resulting in overt discrimination against 
women. Russian trade unions do little to protect women from discrimination 
(Podshibiakina 2001). The nascent women’s movement has struggled (Sperling 1999). 

Whether or not employers are pre-disposed to discriminate due to taste or culture, they 
may have increased incentives to statistically discriminate against women, if they perceive 
women as less ambitious and committed to work, more oriented toward family concerns, 
or more likely to demand maternity and sick leave. The competitive pressures of the 
market system and the withdrawal of state subsidies (though not formal mandates) for 
maternity leave and childcare increased the costs of hiring employees with such traits. 
Managers interviewed by Clarke and Kabalina (2000) said business needs justified hiring 
men because female employees are less flexible, less ambitious, and more expensive. A 
study of 225 privatized enterprises revealed violations of women’s labor and social rights, 
massive layoffs of women, and reluctance to hire women following privatization 
(Liborakina 2001; CEDAW 1999). Thus, the growth of the private sector might augment 
women's disadvantages. 

This perspective attributes growing gender inequality to discrimination — not human 
capital or structural location — and thus contradicts hypotheses 2b and 3b: 


Hypothesis 4-— Women’s disadvantages net of human capital and 
structural location increase as rapidly as their gross disadvantages. 


Net gender effects on some outcomes — particularly voluntary exits from work — may 
result from women's own choices, which reflect, among other things, socially conditioned 
cultural norms. As Ashwin and Bowers (1997) observe, some argue that Russian women 
typically resent having to work and prefer to remain at home now that they can. Yet, their 
new freedom not to work may be offset by increased economic pressure: most 
households need multiple incomes to survive during the post-socialist recession (Ahlander 
2001; Khotkina 2001). Therefore, voluntary job quits could reflect pressures to quit — 
perhaps in response to wage arrears, loss of benefits or harassment — rather than a 
preference to stay home. 
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Change of Direction: Can Market Transition Reduce Gender Inequality? 


Some research on Russia (Ashwin and Bowers 1997; Monousova 1998; Reilly 1999) and 
Eastern Europe (Fodor 1997; van der Lippe and Fodor 1998) suggests market reforms actually 
reduce gender inequality. These studies indicate that women maintain their employment more 
effectively than men, despite having fewer opportunities in the most promising areas of the 
economy. In this view, women have lower rates of job loss because they are concentrated in 
branches (like services) that expand after market reforms. Some Russian employers may 
discriminate statistically in favor of women, viewing them as more reliable and more sober 
workers. Women’s lower wages make them cheaper to hire and retain. Women are more likely 
to have forms of specialized human capital (Tam 1997) such as skills in clerical work and 
accounting that become increasingly valued in the market environment. The competitive 
theory of discrimination (Becker 1971) holds that employers who discriminate cannot compete 
for long. Unlike the situation under state socialism, in Russia's nascent market economy 
competitive disadvantages from discrimination against female employees and applicants could 
put employers out of business. In contradiction to hypotheses 1-4, human capital, structural 
location and employer discrimination may all favor women in post-socialist Russia. 

Another perspective implies no change in gender effects on labor market transitions: the 
status of women deteriorates in the post-socialist economies mainly due to change in the 
shape of the overall wage distribution, not gender-specific factors (Brainerd 2000; Glinskaya 
and Mroz 2000). Market reforms decentralized wage-setting, producing a more skewed wage 
distribution. Women’s initial concentration at the bottom of the distribution meant their wage 
disadvantage relative to men increased as the distribution expanded at top and compressed 
at the bottom. Structural changes in the wage distribution could explain gender wage gap 
trends only if men and women maintain their relative positions in the wage distribution. But 
systematic gender differences in labor market transitions and the quality of new jobs almost 
certainly change the gender balance at different points of the wage distribution. 


Variation by Education 


In post-Soviet Russia, the gender earnings gap is proportionally smaller among university 
graduates than among those with less education (Gerber and Hout 1998; Gerber and 
Schaefer 2004). If the typical skill mix of highly educated women is more valuable in a market 
economy (Fodor 1997), it implies: 


Hypothesis 5 — Women’s labor market disadvantages are lower 
among those with university education. 


By checking for an interaction between gender and level of education, we also improve our 
measure of human capital, which yields stronger tests of Hypotheses 2a and 2b. 


Data and Methods 


Our data come from the Survey of Employment, Income and Attitudes in Russia (SEIAR). 
Omnibus “Monitoring of the Socio-economic Situation in Russia,” surveys fielded by the All- 
Russian Center for Public Opinion and Market Research (VTsIOM) in January and March 1998 
incorporated a pre-tested battery of questions on the timing of all changes in respondents’ 
employment status and jobs, as well as the occupation, sector, branch and firm size of each 
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job since January 1991. For each survey, a sample representing the Russian population aged 
16 and older was drawn using multistage techniques.’ Altogether, 4,818 interviews were 
completed: 2,410 in January, 2,408 in March. 


Outcomes and Models 


We estimate continuous-time hazard models (where the time metric is months) for four labor 
market transitions: inter-firm job shifts, job losses (layoffs), voluntary employment exits (quits) 
and employment entries (from non-employment). Inter-firm shifts generally represent 
“nositive” outcomes. Because they incur a cost to the worker, they must generally bring some 
advantage that outweighs the cost (Gerber 2002). Preemptive shifts to avoid impending 
layoffs, which we cannot distinguish from freely chosen shifts, are still preferable to dismissal. 
For obvious reasons, we treat employment entry as a positive outcome and job loss as 
negative. We provisionally treat quits as negative outcomes, because they can result from 
poor working conditions, wage arrears or pressure from employers. But they might also proxy 
confidence in future labor market prospects or favorable household finances. 

Our models treat the baseline hazard as invariant across months within years and explicitly 
parameterize various specifications of year-to-year change. Adult, working-age respondents 
(women 18-55, men 18-60) with completed education are at risk for the first three events 
during months when they are employed for hire and not on maternity leave or under 
conscription.’ 3,472 respondents were in this risk set at some point during the period 
analyzed. Respondents who change jobs remain at risk for all three outcomes. The risk set for 
employment entry includes working-age adults with completed education who were 
unemployed (by the ILO definition) or out of the labor force: 878 respondents met these 
criteria in some month. Of our original 4,818 respondents, 3,580 have at least one valid spell 
in at least one of the event history analyses.° For all analyses we apply post-sampling weights 
that reproduce the four-way cross-classification of birth cohort, education, gender and type of 
residence (urban vs. rural) from the 1,994 Russian micro-census. 

We estimate logistic regressions for the odds that an observed job shift is “high quality” or 
“low quality.” We treat, somewhat implausibly, the quality of a shift as exogenous to its 
occurrence, to simplify analysis and avoid conflating the probability of any shift with the 
probability of a particular quality of shift. Conceptually, a new job's “quality” is based on what 
it pays. Lacking data on earnings prior to the survey, we measure job “quality” based on the 
average earnings associated with, respectively, the sector (public vs. private), branch and 
occupation of a particular job. Accordingly, we treat a new job as high-quality if it is ina new 
private firm, a privatized firm, a firm in a high-wage branch or a better-paying occupation. New 
jobs in low-wage branches or worse-paying occupations are low-quality.’ The logit models for 
transition to new private, privatized, high-wage industry and low-wage industry firms are 
estimated on the sample of inter-firm transitions with non-missing data on, respectively, the 
property form (N = 907) and industry (N = 913) of both origin and destination jobs. Those for 
upward (N = 1017) and downward (N = 1014) earnings mobility are estimated on the sample 
of all job shifts (including within-firm), excluding those ineligible for upward (downward) due 
to ceiling (floor) effects. 

Our hypotheses postulate changes in gender differences following market reforms, but 
theory does not clearly specify a particular temporal pattern. Therefore, for each outcome we 
test different specifications of the interaction, woman*f(y) — a dummy variable for woman 
multiplied by some function of y, years elapsed since 1990. We consider four plausible 
specifications of f(y): (1) fy) = (V > 0), a dummy variable indicating all years subsequent to 
1991, represents a “one-time shock therapy” effect immediately following the 1992 reforms 
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with no subsequent change; (2) f(y) = /nfv+7) denotes a “diminishing shock therapy” scenario 
of reform effects that continue but decelerate as the transition proceeds; (3) fy) = y2 reflects 
an “accelerating impact” pattern, whereby the influence of market reforms starts out weak 
but increases exponentially over time; and (4)fy) = Infy+7) + y2, -the log-quadratic 
specification, can capture an “impact reversal” process of non-monotonic change. 

To test for change in gross gender differences, our baseline model includes only dummy 
variables for women and for all years subsequent to 1991. We assess changes in the zero- 
order gender effect'by adding, in turn, each of the four specifications of woman *fy). If none 
of these specifications improve the log-likelihood significantly, we conclude the effect of 
gender is Static. If at least one specification significantly improves model fit, we adopt 
whichever specification yields the greatest improvement.® That model provides our best 
estimate of the gross gender differences during any year from 1991 through 1997. 

To assess change over time in net gender differences, we test the woman*f(y) specifications 
in the same manner using a preferred multivariate model as the initial baseline. Our preferred 
multivariate models include the significant effects of education, experience, CPSU membership, 
employer tenure, place of residence (locality size, regional unemployment, average wages and 
service sector development), employer characteristics (branch characteristics, sector and size), 
occupational percent female (a measure of the degree to which an occupation is “feminized”), 
time and significant interactions among some of these variables.° We also test a 
woman*college interaction to assess hypothesis 5. For each outcome, we obtain preferred 
models including only significant effects using a procedure described in the Appendix and 
illustrated in Table A1. Employer and occupational characteristics do not apply in the models 
for work entry; instead, we examine effects of non-employment status (unemployed vs. non- 
participant) and its interaction with gender. From the preferred models for each outcome we 
calculate expected gross and net gender differences for each year from 1991 through 1997, 
which we use to assess hypotheses 1-4. 


Descriptive Statistics 


Snapshots from the employment histories of the SEIAR respondents taken at the outset of 
1991 and 1998 permit comparisons of men and women and of pre- and post-transition 
periods with respect to human capital, structural location, labor force status and labor market 
activity (Table 1). Because we censor observations at retirement age, the men in our sample 
have more average work experience in any given year, though the gap is offset somewhat by 
higher male mortality. Men outstrip women in CPSU membership, both at rank-and-file and 
cadre (administrative) levels. But women hold an advantage in educational credentials, which 
grows from 1991 to 1998. Although different forms of human capital vary by gender in Russia, 
there is no consistent male advantage or disadvantage. 

Our data capture the decline of hired employment and the rise of non-participation, 
unemployment and self-employment for both men and women following Russia's transition to 
the market. From 1991 to 1998 female unemployment grew at a faster rate (814 percent) than 
male unemployment (529 percent). But women remained less likely to be unemployed than 
males by 1998. Conversely, the increase in voluntary non-employment for males (362 percent) 
outstripped that for females (186 percent), yet by 1998 men were still less likely to be voluntary 
non-participants. Cross-sectional unemployment and non-participation rates reflect job loss, 
quit and employment entry rates, so these cross-sectional data are difficult to interpret. 

Our data show clear gender differences in structural locations within branches, sectors, firm 
sizes and occupations, both pre- and post-transition. Prior to the market reforms men were over- 
represented in extractive industry, agriculture, construction, transportation/communications and 
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public administration, women in trade/catering, health/social services, education/science and 
credit/finance/insurance. These differences tended to remain stable or grow from 1991 to 1998. 
Women remained over-represented in the state sector throughout despite some inroads into 
private firms. 5 

Naturally, women work in occupations where the percentages of female incumbents are 
much higher. More surprisingly, even larger proportions of both male and female employees 
work in occupations that, in the long term, have higher percentages of females than males. 
During the market transition the occupational structure has shifted toward more feminized 
occupations and away from male-dominated occupations. 

All four transitions we analyze were more frequent in 1997 than in 1991, for both sexes. 
Women had higher layoff rates than men in 1991, but in 1997 men had higher layoff rates than 
women. Women caught up to men in inter-firm shifts and job entry. In 1991 no respondent 
entered employment from non-participation, and unemployed males and females entered new 
jobs at about the same rate. In 1997, unemployed males entered less frequently, but non- 
participating males entered more frequently. Women caught and exceeded men in shifts to 
new private firms, but fell far behind in shifts to privatized firms. Most strikingly, women already 
had fewer shifts to high-wage branches and more to low-wage branches in 1991. These 
disparities grew in 1997. Finally, women had less upward occupational earnings mobility in 
1991 but more in 1997, and they had higher rates of downward mobility in both years. 


Statistical Results 


Labor Market Transitions 


We focus on the expected gross and net gender effects based on our preferred models, which 
contain only statistically significant variables."° Using the coefficients from our zero-order and full 
models, we calculated annual gross and net “gender gaps” in the hazards of inter-firm job shifts, 
job entry, involuntary job loss and voluntary employment exit (Table 2). The “gap” equals 1-female 
hazard/male hazard for the first two (positive) outcomes, and 1-male hazard/female hazard for the 
latter two (negative) outcomes. Thus, larger positive numbers denote greater male advantage, zero 
denotes gender equality, and more negative values denote greater female advantage. 

Russian women have lower rates of inter-firm shifts than Russian men. Contrary to 
Hypothesis 1, their’ disadvantage remains stable throughout the period. The gross gender gap 
is .30. The preferred full model includes a significant interaction between gender and college 
education consistent with Hypothesis 5: the net gap is .44 among those without college, but 
only .17 among the college educated. Russian women, especially those without college, have 

fewer job opportunities outside of their current firm than Russian men. Differences in human 
capital and structural location actually favor women, contrary to hypotheses 2a and 3a: the 
average net gender gap is 9 percent greater than the gross gender gap." 

As many feared, women had higher layoff rates early in the transition. However, the gross 
and net gender gaps declined over time, actually reversing in 1996: henceforward women 
had /ower layoff rates than men. The gross and net gender effects are very similar: gender 
differences in human capital and structural location do not drive either the overall gender gap 
or its trend. The gender gap in layoffs does not vary by education (nor do the gaps in 
employment entry or employment quits). Russia's market transition changed the gender gap 
in layoffs, ultimately in a manner favorable to women. 

The same conclusion follows from the trend in the estimated net gender effect on 
employment entry. Initially, non-employed women entered jobs at significantly lower rates than 
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Table 2: Gross and Net “Gender Gaps” in Labor Market Transitions, Russia 1991-1997 


en SEL EEE 
A. Positive Outcomes (gender gap defined as 1 - female to male hazard ratio) 
1991 1992 1993 1994 1995 1996 1997 


Inter-firm Job Shift 
Gross _.30 Oey Bre00 30 30 30 30 
Net Overall* 39 39 39 139 39 39 roo 
Net Non-college  .44 44 44 44 44 44 44 
Net College Grads .17 mk he ne ate cl AL 
Employment Entry 
Gross .74 iS 10 65 55 39 11 
Net Overall* .81 19 10 66 56 44 21 
Net Unemployed __.43 At .36 .26 08 = -.20 -.66 
NetNLF  .86 86 84 82 18 M1 .60 


ee eee re epee eer 
B. Negative Outcomes (gender gap defined as 1 - male to female hazard ratio) 
1991 1992 1993 1994 1995 1996 1997 


Involuntary Job Loss (layoff) 
Gross .28 as 22 A, DAF ee TS -.37 
Net .28 ee 23 ie 08  -.06 -.25 
Voluntary Employment Exit 
Gross _.00 .00 .00 .00 .00 .00 .00 
Net -.91 AB ie 20 Maier Ou esl .00 .06 


non-employed men with the same characteristics. Also, the gender gap has been consistently 
larger among those outside the labor force.” This conforms to stereotypical gender roles, which 
stigmatize men and valorize women who choose not work. But women in both labor force 
categories gained rapidly on men. Among those actively seeking work, women reached rough 
(net) parity in 1995. By 1997 they were substantially more likely than men to find a job. Among 
non-participants, women closed the gap from 86 to 60 percentage points. Comparing the 
average net and gross gender gaps, differences in human capital favor women, not men."° 

In 1991 women were /ess likely than men to quit employment, but only controlling for the 
effects of human capital and structural location: there were no gender differences in gross 
rates at any time. Considering other results, this probably reflects women’s greater concerns 
that finding a new job would prove difficult if they quit their job. Economic necessity 
apparently outweighed any preferences to abandon employment for home life. The data belie 
the widespread view that women are more prone to leave the labor force voluntarily than 
men. However, women’s net “advantage” on this count disappeared, but our interpretation 
suggests it did so due to growing parity in opportunities for new employment, not increased 
pressure on women to quit. 

Our models for the gross and net gender gaps in labor market transitions contradict 
hypotheses 1 and 4. Plots of the net gaps clearly illustrate (Figure 1) that the disadvantages 
women faced during the first years of the transition diminished, reversed or remained stable as the 
transition proceeded. No form of female disadvantage increased. By the late 1990s, Russian 
women were less likely to be laid off than men, and unemployed women were more likely to find 
a job than similar men. Women still had lower inter-firm job mobility, but no more so than before 
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market transition. As their opportunities for employment grew, so did women’s confidence in the 
prospects for new employment, hence their rates of quitting caught up with men’s. Hypotheses 
2a, 2b, 3a and 3b receive no support. If anything, gender differences in human capital and 
structural location enhance women’s access to employment, and they play no role in explaining 
the trends over time. Hypothesis 5 finds limited support: for the college educated, the net gender 
gap in inter-firm shifts is lower. 


Quality of Job Shifts 


lf trends during Russia’s market transition in access to employment favor women, the trends 
in the quality of job shifts favor men (Table 3)."* Gross and net female disadvantage increased 
steadily for shifts to privatized firms and to high- and low-wage branches. The gender gap 
increased after initially decreasing for downward occupational mobility. Net of controls, male 
job changers were about twice as likely as females to move to a better-paying occupation. 


Table 3: Gross and Net “Gender Gaps” in Quality of Job Changes, Russia 1991-1997 


LE 


A. High Quality Shifts (gender gap defined as 1 - female to male hazard ratio) 
1991 1992 1993 1994 1995 1996 1997 


To Private Firm 


Gross 54 OU 19 07 -.05 -.15 -.25 
Net Overall* 1 33 14 -.09 -.14 -.24 -.30 
Net Non-college 64 A7 oo 24 14 .06 -.02 
Net College Grads she, -.27 -.07 -83  -1.06 -1.27 = -1.47 
To Privatized Firm 
Gross .00 .03 10 22 36 50 63 
Net .00 .04 to oe 49 .65 18 
To High-wage Branch (1-F/M) 
Gross 48 .50 09 .62 10 18 85 
Net .00 .04 16 we .50 .66 19 
Upward Occupational Earnings Mobility 
Gross .00 .00 .00 .00 .00 .00 .00 
Net i00 53 (53 ABS 53 53 ae 


B. Low Quality Shifts (gender gap defined as 1 - male to female hazard ratio) 
1991 1992 1993 1994 1995 1996 1997 
To Low-wage Branch 


Gross 56 OT .60 63 68 14 19 

Net Overall* 40 43 A5 46 59 66 14 

Net Non-college 05 rO0 109 .63 .69 14 .80 

Net College Grads ~—--.10 -.07 .00 10 2a FOr Ed 
Downward Occupational Earnings Mobility 

Gross 14 18 -.32 -.45 -.22 19 ye 


Net .69 38 ming 13 22 40 .60 
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The findings on job quality support hypotheses 1 and 4. Trends of the gross and net 
effects were identical, so plots of the net effects for high-quality shifts reveal the broad 
pattern (Figure 2). Initially, women had the same odds as similar men of entering jobs in a 
high-wage industry or privatized firm. But with few privatized firms in 1991, those odds were 
low for all. As the transition progressed, women’s (conditional) mobility to privatized firms and 
high-wage industries fell sharply (and, in fact, nearly in tandem) relative to men’s, suggesting 
growing barriers to female entry to high-quality jobs. Gender inequality in upward earnings 
mobility remained stable during the reform period. 

Women did gain greater access to new private firms, where wages tend to be higher. 
Interestingly, college-educated women had more access to jobs in such firms than college- 
educated men from the start. Women with less education started out with less access than 
their male peers, but they closed the gap by 1997. This trend probably reflects the impact of 
market pressures on female employment: sma// private employers recognized that female 
labor of equal productivity comes cheaper than male labor and that they could realize a 
competitive advantage by hiring women. 

Russia's market transition — particularly from 1995 onward — increased women’s entry to 
poor quality jobs relative to men’s (Figure 3). Already in 1991 women without college were 
more likely to get low-wage branch jobs, and the odds ratio increased as the transition 
progressed. By 1997 women without college who changed employers were nearly five times 
more likely than equivalent men to land in a low-wage industry. Initially, women with college 
had parity with similar men, but by the end of the period they too disproportionately landed 
such jobs. Women who changed jobs in 1991 were 4.4 times more likely than otherwise 
similar men to move to a lower-paying occupation. Market reforms initially reduced, without 
eliminating, this disadvantage. However, the best-fitting interaction is curvilinear. By 1995 the 
gender gap in downward mobility began trending upward. By 1997 it substantially exceeded 
its 1992 level. 

Gender differences in human capital and structural location play a small and inconsistent 
role in generating women’s disadvantages." The relevant controls slightly narrow the gaps 
with respect to shifts to high-wage and low-wage branches, but only initially. Otherwise, the 
controls either do not affect the gender gap or they widen it. In the case of upward 
occupational mobility, there is no gross gender gap at all, but a net gap of .53.'° Contrary to 
hypotheses 2a, 2b, 3a and 3b, gender differences in human capital and structural location 
tend to offset, not explain Russian women’s disadvantages. For shifts to new private firms 
and low-wage branches the gender gap is smaller among the college educated. Thus, 
Hypothesis 5 is supported for three of the 10 outcomes we analyze. 


Discussion 


Table 4 summarizes our results regarding the trends in the gross gender gap (Hypothesis 1), 
the comparison of net with gross gender gaps (hypotheses 2a to 3b), trends in the net gender 
gap (Hypothesis 4), and the interaction of gender and college education (Hypothesis 5). 
Gender differences in labor market outcomes changed during the course of Russia’s market 
transition, but the changes uniformly harmed neither women nor men. 

At the end of the Soviet era, women faced disadvantages in rates of layoff, employment 
entry, and among those without a college education, inter-firm job changes. Following market 
reforms, the first two disadvantages declined and even reversed. By 1997 men had 
substantially higher layoff rates than women, and unemployed women found work faster than 
unemployed men. Improving opportunities for female employment help explain growing 
gender parity in voluntary employment exit rates. As women grew more confident that they 
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Table 4: Summary of Results from Models of Labor Market Transitions 





Net for 
College 
Educated 
Trend in Net vs. Trend in vs. Non- 
Gender Gap: Gross _ Gross Net college 
Access to Jobs 
Inter-firm Shifts stable wider stable smaller 
Employment Entry decreasing wider decreasing same 
Involuntary Job Loss (layoff) | decreasing same decreasing same 
Voluntary Employment Exit stable N/A* N/A* same 
Quality of Jobs 
Shift to New Private Firm decreasing same decreasing smaller 
samethen 
Shift to Privatized Firm increasing wider increasing same 
narrower 
Shift to High-wage Branch __ increasing then same increasing same 
Shift to Higher Earning 
Occupation stable wider stable same 
narrower 
Shift to Low-wage Branch _ increasing then same increasing smaller 
decreasing, decreasing, 
Shift to Lower Earning then then 


Occupation increasing same increasing same 


*For this outcome, females begin with a net advantage that diminishes to zero over time. 


could find future employment, they were less afraid to quit their jobs. Women’s access to 
employment increased relative to men’s during Russia's post-socialist transition period. These 
findings contradict claims (Posadskaya 1994; Koval 1995) that women have been over- 
represented among the new unemployed in Russia. 

At the same time, the gender gap in the quality of new jobs has remained stable or 
widened. With one exception — access to jobs in new private firms — Russian women have 
been increasingly excluded from high-quality jobs and channeled into low-quality jobs. Even 
as the labor market has given women more opportunities for employment, it has become 
more segmented by gender in ways that increase women's economic disadvantages. 

How can market reforms both increase women’s relative access to employment and 
channel them away from high-quality jobs toward low-quality jobs? We see both trends as 
related to one another and to the process of post-socialist transition. The combined forces 
of market competition and lower female wages underlie expanded female access to 
employment. Early in the transition, employers who wished to discriminate against women 
could do so after the state withdrew supports for female employment. Initially they perceived 
few economic incentives not to discriminate, due to lack of familiarity with market institutions 
and uncertainty as to how long market reforms would last. But as the transition progressed, 
they learned that they had to cut labor costs without sacrificing productivity to compete with 
domestic and foreign firms. Meanwhile, they realized that women offer a better bargain in 
productivity per unit cost. Prior gender discrimination implies that remaining female 
employees were more productive on average than the remaining male employees. Also, 
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female wages were lower, and increasingly so, for comparable work. Eventually the forces 
of market competition and labor market gender segmentation combined to provide strong 
incentives for hiring and retaining female rather than male employees. We doubt that Russian 
employers lost their taste for discrimination against women; instead, they-eventually found it 
too costly to satisfy. 

Apart from the trends we have identified, three other findings are consistent with this 
interpretation. First, between January 1991 and January 1998 the average occupational 
percent female increased for both men and women by 3 to 4 percentage points. This implies 
that employers favored hiring people for jobs in female-dominated occupations. Presumably, 
they were motivated by the lower wages of such occupations: the correlation between the 
mean logged earnings and the percent female in an occupation is -.54. Russian employers 
facing competitive pressure could cut costs by shifting the balance of their workforces 
toward feminized occupations, making whatever adjustments necessary in their product lines 
and operations. Expanding female-dominated occupations should increase female 
employment opportunities. 

Second, early in the transition female job changers with college degrees were more likely 
than equivalent males to land in new private firms. In contrast to privatized firms, the 
managers and owners of new private firms had to be concerned with competition and cost- 
cutting from the outset. The early timing of their preference for highly qualified females over 
highly qualified males confirms our interpretation that market pressures (combined with lower 
female wages) enhance female access to employment. 

Third, growing female disadvantage in the quality of new jobs provides evidence for the 
second part of the equation: female wages have decreased (lowering the relative cost of 
female labor) due to gender segmentation of the labor market. In turn, the decrease in 
women's wages stems proximately from their greater propensity to take jobs in low-wage 
industries, lower-paying occupations and the state sector, as well as their lower propensity 
to take jobs in high-wage branches and (net of control variables) better-paying occupations. 
Thus, increasing female disadvantage in the quality of new jobs complements women's 
improving access to employment in post-socialist Russia. As the average quality of women’s 
jobs declines, so do women’s wages, making them more attractive to employers. 

Why are Russian women more likely to take poorer quality jobs? They must have, on 
average, lower reservation wages than Russian men. Female-headed households are 
especially poor, so female heads are desperate to accept any job that is available. In dual- 
headed households, women often have secondary earner status. The gender gap in 
reservation wages may have increased during the economic crisis of the 1990s, which has 
put greater pressure on the budgets of all types of households. Gender norms inherited from 
the Soviet era could make lower female wages self-reproducing by reducing women’s 
expectations, leading them to accept jobs men would not. 

Ashwin (2002) argues, based on interviews, that the Soviet gender order encouraged 
women to work, but also to put their families first and accept secondary status at work as 
natural. Still guided by such norms, when women look for work they emphasize stability of 
income rather than to the level of pay or status. In contrast, men define their status — as in 
Soviet times — through their work, so they prioritize pay and status over providing for their 
families. Ashwin’s female interviewees saw little chance for improving their job positions, 
while the men were more optimistic. 

Other cultural factors —-e.g., sex-stereotyping of certain occupations, branches of the 
economy and types of work — could also play a role. Such factors might also account for the 
lower propensity of women to enter better-paying occupations (which tend to be male 
dominated) and high-wage branches. Of course, employers could discriminate. Our data do 
not contain the information needed to rigorously test these alternative explanations."” But 
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whatever the explanation, our results clearly show growing female disadvantage in the quality 
of new jobs, which (somewhat ironically) makes women more attractive employees in an 
increasingly competitive context by driving down their average wages. 

The gender gap in earnings cannot be attributed to changes in the overall wage 
distribution, as some have argued (Brainerd 2000; Glinskaya and Mroz 2000). Gender affects 
movements in and out of employment, job mobility and the quality of the new jobs that result 
from job mobility, all of which affect wages. The pattern whereby Russian women have 
become more likely than men to be employed, but also to be employed in poorer quality jobs, 
could account for growth in the gender wage gap, particularly if men at the bottom of the 
earnings distribution are more likely to be unemployed (and hence have no observed wages). 

The patterns we identify in Russia may not hold true for other former socialist countries. 
The impact of market transition on inequality generally depends on the particular policies by 
which reforms are implemented, .pre-existing structural conditions and the precise 
institutional trajectories in a given country (Walder 1996; Gerber 2002). Prior gender 
differences in human capital and structural location, as well as cultural and political factors 
such as the prevalence of patriarchal attitudes and the strength of feminist movements, might 
also shape how markets affect gender inequality. Only rigorous cross-national comparisons 
can reveal whether there are common patterns of dynamic gender effects in post-socialist 
labor markets. But such comparisons require more systematic empirical work on individual 
countries. Our study of Russia shows that a dynamic approach to modeling the effects of 
gender on movements in and out of employment, job mobility and the quality of new jobs can 
yield unique insights into the connection between gender inequality and market reforms. 

One important task for future research on gender differences in the Russian labor market 
will be to expand the time frame of analysis. Since 1998, the Russian economy has improved 
steadily, and market institutions have established firmer roots compared to the mid 1990s. 
Perhaps a different gender dynamic has emerged in the context of improving employment 
and wages. A second key task will be to incorporate measures (lacking in our data) of marital 
status, household size and household structure, to explicitly test whether the effects of 
gender on labor market outcomes vary by family situation (Trappe and Rosenfeld 2004). 


Notes 


1. Some attribute the advantages of former CPSU membership to social capital (Rona-Tas 
and Guseva 2001). But even if selection into the CPSU did not imply higher levels of 
unobserved human capital, participation in CPSU-related activities could have cultivated 
skills associated with increased productivity. 


2. For example, the oversupply of engineers and other professionals relative to the needs 
of the Soviet and post-Soviet economies helps explain the low earnings of many 
professional occupations in late Soviet and post-Soviet Russia (Gerber and Hout 1998: 
Gerber 2008; Gerber and Schaefer 2004.) 


3. Primary sampling units (urban settlements and rural administrative regions) were grouped into 
66 strata based on geography, administrative status, population size and settlement type. 
Twelve of Russia's 89 provinces were excluded due to political strife (war in Chechnya) or their 
remoteness and population sparseness. Target sample volume was distributed proportionally 
to the size of the adult population within each stratum. PSUs (108 in January, 109 in March) 
were selected randomly with probabilities proportionate to size and Moscow and St. 
Petersburg self-representing (automatically included in the sample). Secondary sampling units 
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were electoral districts in urban areas and settlements in rural areas. Within SSUs, addresses 
were selected using a random walk algorithm. At each address, the occupant over 15 years 
old with the closest birthday was selected for interview. The response rates were 61.9 
percent in January and 66.1 percent in March. Interviews were verified at 298 randomly 
chosen addresses in January, 180 in March. See Gerber (1999) for more details. 


Self-employment is employment outside the labor market; therefore, we do not consider 
gender differences in self-employment entry or exit. Gerber (2004) reports persistent 
female disadvantages on both counts in Russia during the 1990s. 


We excluded 1,158 respondents due to age restrictions. Another 80 did not complete 
their education by the time of the survey, retired prior to 1991 due to injury or illness, were 
self-employed throughout the entire period 1991-1997, or had missing data on 
employment status for the entire period. 


Earnings are conventionally treated as the most important aspect of job quality. We lack 
the data to consider other aspects, such as benefits and part-time or contingent status 
(Kalleberg, Reskin and Hudson 2000). 


In post-Soviet Russia average earnings are higher in private-owned firms than in state-owned 
firms, even controlling for other determinants of earnings (Gerber and Hout 1998; Gerber 2000, 
2002). “High-wage” branches include Extraction, Construction, Transport/Communications, 
and Finance/Insurance. “Low-wage branches” are Agriculture, Trade/Catering, Health/Social 
Services, Education/Science, and Culture/Entertainment. Upward (downward) occupational 
earnings mobility is defined as a change occupation where the destination occupation is at 
least one fifth of a standard deviation higher (lower) on a scale of Russian occupational 
earnings estimated at the four-digit ISCO88 level using nine Russian surveys spanning February 
1992 to March 2000 (valid N = 12,023). Gerber (2002) provides more details on these 
measures. 


Because the log-quadratic model uses two degrees of freedom, we compare its fit to the 
best fitting of the models nested with the In(y+1) and y2 specifications of fly). 


We scaled occupations at the 4-digit |SCO88 level by their proportion female based using 


data from nine Russian surveys spanning February 1992 to March 2000 (valid N = 18,825). 


We will provide details upon request. This variable, “occupational percent female” 
measures the degree to which an occupation is “feminized,” i.e., the degree to which 
women predominate among those who work in the occupation. Apart from occupational 
percent female, which we describe in the Appendix, we use the same specifications for 
these variables as in Gerber (2002). 


Because the models for employment entry and voluntary employment exit have not been 
reported previously, we provide parameter estimates and fit information for the preferred 
models in the appendix. Parameter estimates from the preferred models for the other 
outcomes are all very similar to those reported by Gerber (2002). 


The statistically significant interaction between gender and college education complicates 
evaluation of hypotheses 2a, 2b, 3a and 3b. For the purpose of a direct comparison of 
equivalent effects, we report the “average” net effect in such cases — that is, the weighted 
average of the net gender gap for college graduates and non-college graduates. 
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12. This follows from the statistical significance of a woman*unemployed interaction term in 
the preferred full model. 


13. Firm and job characteristics are not relevant to employment entry because no members of the 
risk set are employed. The average net effect for employment entry is computed using the 
annual proportions of females in the risk set who are unemployed and not in the labor force. 


14. These effects pertain to the odds that a shift is to a “high” or “low” quality job (measured 
variously), Conditional on the occurrence of a shift. This complicates interpretation, 
because the occurrence is, no doubt, partly endogenous to its quality. But gender-specific 
patterns in the quality of actual shifts offer insights into gender differences in labor market 
opportunities and orientations. Women’s higher conditional odds of shifts to low quality 
jobs reflect not only lower availability of high quality jobs, but also men’s greater 
unwillingness to change employers if doing so leads to a low quality job. 


15. Ham et al. (1999) reach a similar conclusion in their analysis of unemployment spells in the 
Czech and Slovak republics in the early 1990s. 


16. The main indirect effect is structural: women are more concentrated in poorly paying 
occupations. They have greater chances for upward occupational mobility simply because 
more higher-paying occupations are available. In the reduced-form model, this indirect 
effect masks the strongly negative direct female effect. We also found that occupational 
percent female reduces the (conditional) odds of a high-wage industry job and raises the 
odds of downward earnings mobility. 


17. Had we data on the marital status and household composition of respondents prior to the 
time of the survey, we might formally analyze how family obligations shape the labor 
market behavior of Russian women (or men). 
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Appendix 


Our variable specifications are identical to those in Gerber (2002), though we add an additional 
variable: occupational percent female. For this measure, we scaled occupations at the four- 
digit ISCO88 level by their proportion female using data from nine Russian surveys spanning 
February 1992 to March 2000 (valid N = 18,825). 

We obtained our preferred multivariate models following the steps illustrated in Table A1: 
(1) Estimate a baseline model (main effects of all variables and dummy variables for years 
1992-1997) and test experience-squared and woman*experience. (2) Test interactions 
between education and structural variables: new private firm, logged firm size, branch 
employment change, branch percent private employees, regional services index, and regional 
unemployment. (3) Evaluate change over time in the effects of experience, education, CPSU 
membership, sector, and branch percent private by testing (for each variable x) the four 
specifications of the f(y) component in the expression f(y)*x discussed above (see p.17). (4) 
Incorporate the best-fitting forms of significant interactions from Step 3, trim non-significant 
main effects, and smooth the pattern of change over time in the baseline hazard using the 
same specifications for f(y). The best-fitting specification gives us our initial preferred model. 
We build our model for employment entry using only those sub-models that are relevant. 

Once we have initial “preferred” models for each outcome, we perform two more steps: 
(5) test the woman* college interaction and the main effect of occupational percent female. 
For some outcomes — such as voluntary employment exit —- we must first restore the main 
effect of gender in the model, since it was not statistically significant originally. 6) Test the four 
specifications of the f(y)*woman interaction. 

Parameter estimates for the final full models for voluntary employment exit and 
employment entry are provided in tables A2 and A3, respectively. The preferred specifications 
of the models for the other outcomes are identical to those reported in Gerber (2002, Table 
A1), with the addition of those main and interaction effects from steps 5 and 6 that proved 
statistically significant. We will provide full information on all models estimated upon request. 
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Table Al: Model Selection Example: Hazard Models for Voluntary Employment Exit* 


Part 1: Repeat Steps from Gerber (2002) to Obtain "Preferred" Model with Main Additive 
Gender Effect 


Mode 2 
| Variables Added -2LL Contrast x df p 

Step 1: Estimate Baseline Model and Specify Experience Effect 

[1] Baseline Model 2509.72 

[1a] Experience Squared 2508.09 vs. [1] 1.62 Teens 

1b Woman*Experience 2509.60 vs. [1 re Laelia 
Step 2: Test Interactions Between Education and Structural Variables 
[2a1] [1] + College*Newpriv 2509.71 vs. [1] 01 1  .926 
[2a2] [1] + College*Ln(firmsz) 2508.67 vs. [1] 1.04 eee ate, 
[2a3] [1] + College*Br%EmCh 2505.20 vs. [1] 4.52 1 .034 
[2a4] [1] + College*Br%pri 2509.42 vs. [1] 30 1 ~ 884 
[2a5] [1] + College*Services 2508.11 vs. [1] 1.61 fe eue205 
[2a6] [1] + College*UnempRat 2509.04 vs. [1] .67 if PPe412 


(2b], Benoa equivalent sub-steps 1-6 for interactions with "less than secondary" — none 
significant. 


Bee 3: Test Interactions of Time with Experience, Education, CPSU Membership and 
roperty 


[3a1]  [2a3] + Experience*In(y + 1) 2498.69  vs.[2a3] 6.51 Lede Cd 
[3a2]  [2a3] + Experience*y” 2492.58 vs. [2a3] 12.62 1 .000 

2a3] + Experience*y + 
[3a3] xperience’*In(y + 1) 2491.64 vs.[2a3] 13.56 Z ten DOd 
[3a4]  [2a3] + Experience*(y > 0) 2505.19 vs. [2a3] 0.01 10 toh 


(30) through [3i]: Repeat equivalent substeps 1-4 for interactions involving: b. college; c. less 
than secondary; d. CPSU cadre; e. CPSU rank-and-file; f. new private firm; g. privatized firm; 
and h. branch % private. 

Step 4: Add All Interactions, Trim Non-significant Effects, Smooth Trend in Baseline 


[2a3] + Interactions [3a2], [3c2], 
[4a] and [3g1] 2484.73 vs. (2a3] 20.47 3.000 


[4b] [4a] - Non-significant Main Effects 2507.23 = vs. [4a] 22.49 17 ~~ «166 
oe - All Year Dummies (Null 





[4c] hange Over Time) . 2537.01 vs. [4b] — 29.79 6  .000 
Replace Year Dummies with Smoothed Functions of Year: 
[4d] fy)=y’ 2520.36  vs.[4b] 13.14 Semele 
vs. [4c] 16.65 Te 000 
[4e] fy) = In(y + 1) 2511.68 vs.[4b] 4.46 5 486 
vs. [4c] 25.33 1  .000 
[4f] fly) =In(y+ 1)+y’ 2911.64 vs.[4b] 4.42 4 352 
vs. [4c] 25.37 2 000 
[4g] fy=>0) 2526.53 vs. [4b] 19.30 pe OuZ 
vs. [4c] 10.48 bee 00i1 
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Table Al: continued 


Model Selection Example: Hazard Models for Voluntary Employment Exit* 


Part 2: Test for Effect of Percent Female in Occupation and Interactions of Gender with 
College Degree and Time Using Preferred Model 


Mode 


| Variables Added -2LL Contrast re df p 


Step 5: Test for Interaction of Gender with College Degree and Main Effect of Percent 
Female in Occupation 


[5] [4e] + Main Woman Effect 2509.72 vs. [4e] 1.97 tere 
[Sa] [5] + Woman*College 2509.06 ——_-vs.[4e] 2.62 Ze eet 

vs. [5] .66 1 418 
[5b] = [5] + Occ%Female 2507.31 vs.[4e] 4.38 2 At2 


vs.[5] 2.41 1 121 
Step 6: Test for Change Over Time in the Effect of Gender 


[6a] [2a3] + Woman*in(y + 1) 2505.93 vs.[4e] 5.75 2 .056 
vs[5] 3.78 1  .052 
[6b] [2a3] + Woman*y? 2508.59 vs.[4e] 3.09 2 213 
vs[5] 142 1 289 
[2a3] + Woman*y’ + Woman*in(y 
[6c] +1) 2508.59 vs.[4e] 3.09 2 213 
vs[5} 112 1 289 
[6d]  [2a3] + Woman*(y > 0) 2508.78 vs.[4e] 2.90 2 .234 
vs.[5 .94 Taree 


‘Complete fit statistics for this model and models for other outcomes available from author. 
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Table A2: Preferred Model for Voluntary Employment Exita 


ET 


B SE 
Constant ASOT sare el 
Woman 646 ™* .289 
Experience FU cae piaenmenall AU 
Less Than Secondary Education ‘GG ee ere tT 
CPSU Rank-and-File Member SOUS ie weed 
Defense Industry 418 233 
Agriculture 7 OO a ee 
Tenure > Year -.885 7. 3148 
Privatized Firm -2.687 “* 1.224 
Branch Percent Employment Change DOE Sho1e8.003 
Firm Size (In) 099 ™* 042 
Ln(year + 1) 3407) *% 148 
Interactions: 
College x Branch Percent Employment Change 030 a aceeat 
Experience x Year? POU sei eee) 
Woman x Ln(year + 1) Oza y * .188 


Privatized x Ln(year + 1) 1.740 eet 


Number of Observations 3174 
Person-months at Risk 210748 
Number of Events 378 
Log-likelihood -1253 


*Risk set: Currently employed, working-age respondents with completed education. 
Standard errors are robust. Dummy variables for missing industry, property, firm 
size, and Communist Party membership are included, but not shown. "Year! 
denotes integer years elapsed since December 1991. 

*p < .05, one-tailed **p < .05, two-tailed 
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Table A3: Preferred Model for Employment Entrya 





B SE 


Constant -3,.245 ss .230 
Woman -1.977 = 227 
| Age (-18) -.016 ie .006 
College Degree 334 ‘ 178 
CPSU Cadre 1.399 2 317 
Specialized Secondary Degree .369 a ao 
Population of Locality (In) ue 3 033 
Moscow -.576 oY .269 
Regional Mean Wage 1.312 ‘ 13d 
Regional Services Index .020 : 012 
Regional Unemployment Rate .033 021 
Year’ -.014 eS .006 
Interactions 
Woman x Unemployed 1.415 a .208 
Unemployed x Year? -.019 tere 60 
Woman x Year’ .030 ome asl: 
Number of Observations 825 
Person-months at Risk 20750 
Number of Events 473 


Log-likelihood -367.37 


*Risk set: Currently unemployed or et ee ev working-age 
respondents with completed education, Standard errors are robust. 
Dummy variable for missing Communist Party membership is. 
included, but not shown. "Year" denotes integer years elapsed since 
January 1991. 

*p < .05 one-tailed **p < .05 two-tailed 
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Refining the Measurement of Women’s Autonomy: 
An International Application of a Multi-dimensional 
Construct 


Rina Agarwala, Princeton University 
Scott M. Lynch, Princeton University 





Abstract 

Women’s autonomy has long been a central concern for researchers examining the social 
position of women in developing countries. However, little emphasis has been placed on the 
measurement of autonomy, despite its importance for assessing the validity of comparative 
research. In this research, we use confirmatory factor analyses to determine (1) whether 
items thought to measure autonomy in fact form a reliable measure of autonomy, (2) 
whether the relationship between multiple dimensions of autonomy are strong enough to 
justify a discussion of autonomy as a single underlying construct, and (3) whether 
comparative research on autonomy ts possible between two countries (India and Pakistan). 
We find that our indicators capture four distinct dimensions of autonomy that are 
moderately related, and that, while the model structures replicate fairly well across the two 
countries we study, there are measurement differences that make comparative research 
challenging. 


The concept of women’s autonomy has been an important one in sociology and social 
demography for more than two decades (e.g., Connell 1987; Cubbins 1991; Ferree and Hall 
1996; Kane and Sanchez 1994; Mason 1986). Early literature defined autonomy as “the 
degree of access to and control over material and social resources within the family, in the 
community and in the society at large.” (Dixon-Mueller 1978) More recently, the definition of 
autonomy has been broadened to include “the ability to influence and control one’s personal 
environment” (Safilios-Rothschild 1982) or “the capacity to obtain information and make 
decisions about one’s private concerns and those of one’s intimates.” (Dyson and Moore 
1983). These definitions assert a single construct that captures the multifaceted ability to gain 
control over the circumstances of one’s life. Among women, attaining such control is viewed 
as a key to improving their living conditions. An in-depth examination of this construct, 
therefore, can bolster our efforts to alleviate gender inequalities. 
In this paper, we investigate the empirical utility of the “autonomy” construct by examining 
(1) whether items thought — and commonly used — to measure several aspects of autonomy 
in fact form a reliable, unified scale and (2) assuming they do, whether the scale can be used 
successfully in comparative research. We use the term autonomy to reflect “the extent to 
-which women exert control over their own lives within the families in which they live, at a 
given point in time.” (Jejeebhoy 2000: 205) In other words, we define autonomy in its static, 
individual form, rather than as a group process. As Jejeebhoy (2000) argues, various terms, 
including “status,” “autonomy” and “empowerment,” have been used over time to capture 
some element of gender equality in the household and community. “Status” was often 
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confused with “prestige” or “esteem” in the eyes of men, while “empowerment” referred to 
a more dynamic process of challenging existing power relations and gaining greater control 
over sources of power. Unlike autonomy, the term “empowerment” emphasizes the collective 
aspect of power, in addition to the individual one.' While the collective aspect of power is no 
doubt paramount to gender equality, especially in certain cultural contexts, it is the individual 
aspect of power (captured through the term “autonomy”) that appears more frequently in the 
quantitative literature on gender and power. This paper, therefore, fits within the context of the 
existing literature on autonomy. 


Background 


Autonomy has been equated with mechanisms used to alter rigid gender stratification, a 
central concept in sociological discourse (Connell 1987; Ferree and Hall 1996). Throughout 
the 1970s, modernists attempted to narrow sexual inequalities by increasing women’s 
choices in education, employment and reproduction. In response to persisting gender 
inequalities, despite such attempts to assimilate women into modern life, a new generation 
of scholars pointed to the vital role that women’s power or autonomy plays in accessing 
and maximizing choices in the first place (Blumberg 1984; Fernandez -Kelly 1994; 
Nussbaum 2000). 

In the mid-1980s, scholars began to analyze the implications of autonomy through in- 
depth empirical research in developing countries. Employing quantitative analyses of field- 
level data, they showed that increased female autonomy was correlated with reduced fertility, 
improved child nutrition and education, and improved standards of living for women (Balk 
1994; Basu 1992; Basu and Basu 1991; Dyson and Moore 1983). Based on such findings, the 
United Nations asserted in,1995 that women’s autonomy is essential to human dignity and 
must be considered a basic human right (UNDP 1995). Since then, several new studies have 
further examined the important determinants and effects of women’s autonomy in different 
contexts. (Balk 1997; Hashemi, Schuler and Riley 1996; Hehui 1995; Jejeebhoy and Sathar 
2001; Kritz and Makinwa-Adebusoye 1999; Morgan and Niraula 1995). 

To date, however, there has been little evidence to show if and how these existing findings 
can be compared across the different contexts. Moreover, few have seriously considered 
how autonomy should be measured, within and across context. This is surprising given both 
the wealth of literature underscoring why autonomy is an important concept to study and the 
rich data available to analyze women’s autonomy in different countries. Precise measures of 
theoretical constructs, such as autonomy, can help provide empirical evidence for their 
existence and are essential to ensuring robust work on the roots and consequences of the 
constructs. In turn, such work can improve our understanding of the dynamics of gender 
Stratification as well as our policy prescriptions to narrow gender gaps throughout the world. 


Indirect Proxy Measures of Autonomy 


Since Mason (1986) first highlighted the conceptual and methodological issues that arose in 
the early studies on women’s autonomy, a consensus on how we cannot measure autonomy 
has emerged. Autonomy cannot be measured, as it often was, using a single observable 
characteristic, such as women’s education or labor force participation rates, as an 
approximate indicator of autonomy. These one-dimensional proxies, or indirect measures, are 
highly imperfect and have grave policy implications, especially when used to analyze the 
predictors and effects of autonomy (Balk 1994; Jejeebhoy 1991; Viassoff 1994). 
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First, proxies for autonomy are extremely context dependent, which makes comparative 
research on autonomy difficult. For example, while women’s education may be highly 
correlated with women's increased autonomy in one setting, it may be completely 
uncorrelated in another.? Second, proxy measures do not provide ample evidence for how 
well they capture the construct of autonomy. Without an understanding of how various 
proxies are correlated with one another and to autonomy, it is difficult to compare the results 
from studies using different proxies. Third, proxies blur the channels through which autonomy 
works. For example, when examining the impact of autonomy on fertility using education as 
a proxy for autonomy, we cannot be certain whether a decrease in fertility from increased 
education is due to the direct effects of education, or to education's effect on autonomy, 
which in turn affects fertility. Finally, proxies obscure which dimension of autonomy is being 
measured (Whyte 1978). Scholars have long argued that autonomy is comprised of multiple 
dimensions; each dimension is determined by, and predicts, different demographic and 
socioeconomic factors. 


Direct Measures of Autonomy 


Recently, scholars have turned from using indirect proxies to quantify autonomy, choosing 
instead direct measures. These direct measures consist of a combination of observable items 
or indicators that are categorized into different dimensions of autonomy, such as access to 
and control over resources, participation in economic and child-related decisions, self- 
esteem, mobility, freedom from domestic violence, and political awareness and participation 
(Balk 1994; Balk 1997; Blumberg 1994; Hashemi, Schuler and Riley 1996; Jejeebhoy 2000; 
Jejeebhoy and Sathar 2001; Morgan and Niraula 1995). 

Studies using direct measures have addressed many of the inadequacies of the earlier 
indirect-measure approach. They have explicitly quantified the mutli-dimensionality of 
autonomy, thereby clarifying the roots and consequences of each dimension. In addition, 
direct-measures have illuminated the channels through which economic and social factors 
(such as education and labor force participation) affect autonomy, rather than confounding the 
causes and effects of autonomy (Goetz and Sen Gupta 1996; Kritz and Makinwa-Adebusoye 
1999; Mason 1997). 

Some of the inadequacies of the earlier literature, however, remain unaddressed in the 
literature using direct measures. First, few have addressed the issue of context dependency; 
elements that may genuinely reflect autonomy in one context may be irrelevant or impossible 
in another (Davis and Robinson 1991; Entwisle, Henderson, Short, Bouma and Fengying 
1995: Kenworthy and Malami 1999; Lobao and Brown 1998; Orloff 1993). A failure to address 
this issue has undermined the scope of comparative research on autonomy. 

Second, recent research has not shown how reliably each direct measure, or observed item, 
reflects a particular dimension of autonomy. Most studies construct a dimension of autonomy as 
a simple summative index of a series of dichotomous, observable items. For example, the 
mobility dimension is often measured by asking women whether they are able to go alone toa 
series of places, such as the market, the health center, the next village, etc. Those who answer 
“yes” receive 1 point; those who answer “no” receive 0 points. At the end of the series of 
questions, a respondent's answers are aggregated to equal her score for mobility ranging from 
0 to the total number of questions asked on mobility. This procedure is then repeated for each 
dimension (Balk 1994; Hashemi, Schuler and Riley 1996; Jejeebhoy 2000; Morgan and Niraula 
1995). Few studies test the inter-item reliability of each index by measuring the correlations 
between the items within each dimension. In some cases, internal consistency of a dimension 
has been tested, but the measurement error within each item has not been considered (Balk 
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1994; Mason 1986). Of all research on autonomy, Mason’s works are perhaps the most 
significant in paying considerable attention to the measurement of autonomy. For example, in 
reference to the domestic violence dimension used in one of her papers, Mason writes, “This 
scale [index] is relatively weak and is used here primarily for convenience.” (Mason 1997: 3) 

Finally, while direct measures have highlighted the important multidimensionality of 
autonomy, some of the literature has undermined the justification for conceptualizing autonomy 
as a single, empirical concept by analyzing the predictors or effects of each dimension of 
autonomy separately (Morgan and Niraula 1995; Viassoff 1992). In such cases, scholars are 
effectively assuming that the correlations between the dimensions are zero, thereby weakening 
the power of their own conclusions regarding the net causes or impact of autonomy in general. 
Analyzing each dimension separately provides robust evidence only for the impact of each 
individual dimension of autonomy. 

Among the studies that do analyze the strength of the relationship between the dimensions 
of autonomy, the relationships appear to be low to moderate. This raises important questions as 
to why we should study autonomy as a single measure at all. Balk (1994) interprets the low 
correlation coefficients between the dimensions she studied as evidence for the “successful 
measurement of largely distinct dimensions of women’s status.” (Balk 1994: 43) Other scholars, 
however, interpret the weak and inconsistent associations between some dimensions as 
evidence that those dimensions reflect aspects of autonomy that differ from those reflected by 
other dimensions that are more positively related (Jejeebhoy 2000; Mason and Smith 1999). 

Alternatively, the weak correlations may be due to measurement methods. None of these 
studies accounted for measurement error within the observed items used to measure each 
dimension of autonomy. Nor did they allow the observed items within each dimension or the 
dimensions themselves to have differential weights. For example, in creating a measure for 
total autonomy, Jejeebhoy (2000) and (2001) summed all six dimensions into a single index for 
autonomy, which forces all the dimensions to have equal weights and a correlation of one. 
Consideration of measurement error and differential weights could possibly produce higher 
correlations between the dimensions and provide evidence for considering autonomy as a 
more unified construct. Alternatively, it may shed greater light on why certain dimensions are 
not as strongly correlated to the other dimensions, and to what extent we can continue to 
analyze the weakly associated dimensions as a part of an overall construct of autonomy. 


Data 


This paper uses data from the Survey on the Status of Women and Fertility (SWAF), a survey 
conducted in 1993-94 on women and their husbands in five countries in Asia (Smith, Ghuman, 
Lee, and Oppenheim Mason 2000); we analyze the data from two of the five survey countries, 
namely India and Pakistan. The survey was specifically designed to measure women’s 
autonomy and its relationship to reproductive behavior. It is one of the first surveys that has 
tried to operationalize the multiple dimensions of autonomy. 

The India survey includes 1,842 rural Indian women, covering both North and South India, 
and both Hindus and Muslims. The survey was conducted in two states — Uttar Pradesh (UP) 
in North India and Tamil Nadu (TN) in South India. Traditionally, UP has had lower indicators 
of gender equality than TN. The survey samples evenly from two more developed sub- 
districts and two less developed sub-districts in each state.’ It also draws evenly between 
Hindus and Muslims and over samples castes that are numerically small. In order to ensure 
this ethnic mix, villages were merged into clusters of 1,000-2,000 households. One cluster 
from each of the sub-districts was chosen at random, and houses in each chosen cluster 
were put on house lists, which constituted the sampling frame. Approximately 800 currently 
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married women, ages 15-39, were randomly selected for interviews in each of the four sites. 
Husbands who were present were also interviewed. 

The Pakistan survey includes 1,050 rural Pakistani women. All interviews were conducted 
in the province of Punjab, which covers 52 percent of Pakistan's population and 56 percent 
of its geographical area. The province was divided into three agro-ecological zones that were 
developed by the Pakistan Agricultural Research Council. The three zones, North Barani Belt, 
the Central Zone and the Southern Zone, represent varying degrees of development, cultural 
and linguistic traditions, and feudal regimes, all of which affect gender equality indicators. The 
North Barani Belt reflects higher indicators of gender quality than the highly feudal Southern 
Zone. Based on Pakistan’s Federal Bureau of Statistics’ master list of rural Primary Sampling 
Units (PSUs), 10 sites were randomly selected from the three zones. Prior to sampling, the 
PSU list was restricted to sites with a population size between 2,500 and 4,999 in 1991. After 
conducting a household census in each site, a sampling fraction was adopted to ensure a 
minimum of 100 interviews in each of the 10 sites. 

It should be noted that the samples are not representative of the entire population of India 
or Pakistan, and our results cannot be generalized to the national level. However, the samples 
taken in India and Pakistan are relatively similar to one another and are representative of a 
population that the women's autonomy literature is concerned with, namely rural women in 
developing countries. 


Methods 


We use a confirmatory factor analysis approach to examine the measurement of autonomy (Bollen 
1989). Confirmatory factor analysis, unlike exploratory factor analysis, places @ prior/ structure on 
the data and allows the explicit testing of competing hypotheses regarding the measurement 
properties of indicators thought to reflect a theoretical construct. In this research, we begin with a 
confirmatory factor analytic approach in analyzing the Indian data, but then refine the measurement 
models baséd on substantive considerations and modification indices. We then replicate the 
analyses with the Pakistani data following a strictly confirmatory factor analytic approach. This 
approach allows us to assess whether, given a fixed “best” model for the Indian data, the model 
fits the Pakistani data. The choice of country on which we based our initial models was arbitrary. 
In the case that the Pakistani data do not fit the Indian model, starting with the Pakistani data would 
simply lead to a different best model. This fact should be kept in mind: We do not profess that 
our scale for autonomy is the right one — rather, we propose that it is right for the rural Indian 
context, and our goal is to determine whether this model is transferable to another culture. 

We began with a set of 54 items thought to reflect autonomy. These items were chosen 
based on two criteria: (1) they were measured in our data, and (2) existing research has used 
them to represent autonomy. Because this paper aims to test existing measures of autonomy, 
we do not assert or test entirely new observable items that may represent autonomy. Therefore, 
we do not include items that meet the first criteria but not the second. Unfortunately, due to data 
limitations, we were also unable to include items that meet the second criteria and not the first.* 
Based on the 54 selected items, we then constructed the following four dimensions of 
autonomy to test on our Indian data: autonomy from violence (7 items); autonomy in family 
decisions (16 items); autonomy in community involvement (15 items); and autonomy in 
household economics (16 items). Table 1 presents a listing of all 54 items, along with the 
dimensions of autonomy they are thought to reflect, according to the existing literature. 

All items were coded as ordinal. Estimators for confirmatory factor analyses require 
covariance or correlation matrices as input for estimation. These matrices are generally computed 
using Pearson productmoment covariances or correlations. However, these measures are 
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inappropriate for ordinal data. Given that the variables measuring autonomy in this study (as well 
as most others) are measured at the ordinal level, a more appropriate approach is to estimate 
polychoric correlations between the variables and to use these resulting matrices as input into the 
structural equation modeling software. Polychoric correlations are a measure of the correlation 
between the latent continuous (and normally distributed) variables thought to underlie the crudely 
measured observed variables. Methodological studies have shown that, in addition to being a 
more theoretically appropriate measure of the association between ordinal variables, polychoric 
correlations correct for the negative bias that using Pearson correlations produces, making the 
results of structural equation analyses stronger (Joreskog and Sorbom 1986). Thus, for these 
analyses, matrices of polychoric correlations were used in estimation, and the weighted least 
squares estimator was used to estimate all measurement models. 

The analyses proceeded in four steps. (1) We first estimated simple one-factor models for 
each dimension of autonomy using the Indian data. These models were then revised based 
on inspection of the parameter estimates and modification indices. In some cases, significant 
similarity in wording of items within each dimension required the inclusion of either 
correlations between errors of similar items or the inclusion of an item-level random effect to 
eliminate the effect of wording. In other cases, the initial models indicated that the items 
reflected different sub-dimensions of autonomy. In those cases, we revised the model to 
consist of more than one substantive latent factor. In some cases, certain items did not load 
well on any factor. These items were deleted, leaving us with a revised, final set of factors. 
(2) We tested each of these final models against one roughly equivalent to the standard 
approach to measuring autonomy: using summed scales. (3) After the dimensions of 
autonomy were investigated, we combined all dimensions into a single model in order to 
determine the extent to which the dimensions of autonomy were related. Finally, (4) we 
replicated the Indian analyses using data from Pakistan in a multiple group analysis. Multiple 
group analysis allows the simultaneous estimation of models for two or more groups with the 
imposition of increasing numbers of constraints. In brief, if all measurement model 
parameters can be constrained to be equal across groups, then the groups can be said to be 
equivalent with respect to measurement (Bollen 1989). In this setting, such a finding would 
provide evidence that the measurement of autonomy may be independent of context. 

We choose to use data from Pakistan to replicate our model for one primary reason — the 
existing evidence for the highly context-dependent nature of autonomy. A comparison across two 
relatively similar cultures (India and Pakistan) provides leverage with which to determine whether 
our autonomy measures are replicable independent of confounding factors that could arise from 
substantial differences in context. This is a conservative test of the viability of comparative 
research on autonomy and can only provide a first step in a longer analysis on the subject. 


Results 


One-Dimensional Measurement Models 


Violence 

Figure 1 shows the initial model estimated for the dimension of autonomy from violence.’ As the 
figure indicates, all seven violence indicators were initially assumed to reflect a common factor. 
Table 2 shows that the first model estimated had an excellent fit, as assessed by the Incremental 
Fit Index (IFl = .99), but had a significant chi-square (361.49, 14 d.f., p < .001) and an RMSEA 
greater than .1. By these measures, the model does not fit the data well (see Bollen 1989: IFIs 
should be above .9 for an acceptable fit and above .95 for an excellent fit; chi squares should be 
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nonsignificant; and RMSEAs should be below .05 for an excellent fit and below .1 for acceptable 
fit). An examination of the model parameters revealed that items v1 and v7 loaded poorly on the 
factor. The poor loading could be due to either measurement or substantive differences between 
these items and the other items. Items v2-v6 ask whether the respondent feels a husband would 
be “justified in beating his wife” under various circumstances, while items v1 and v7 ask about 
actual fear and experience of violence. Wording differences may account for the poor loading of 
v1 and v7. Alternatively, there may be a substantively meaningful difference between experiences 
of violence and the perceived legitimacy of violence. 


Figure 1a. Initial Measurement Model for Autonomy from Violence 


1 


In our second model, we added another latent variable representing a wording effect. The 
loadings for this effect were all constrained to be 1, with the variance of the latent variable 
free, making the latent variable essentially a random effect. This model fit the data better, with 
a much smaller chi square, an RMSEA below .1, and an IFI just under 1. In a third model, we 
removed the latent variable for wording and created a separate latent variable. This model fit 
the data better than the previous model and was retained as the final model. Thus, autonomy 
from violence is represented by two factors: one that reflects perceptions of the legitimacy of 
violence in the household and the other that reflects feared or actual violence. Figure 2 shows 
the structure of the final model. 





Figure 1b. Final Measurement Model for Autonomy from Violence 
Legitimacy of 


Feared/Actual 
Violence Violence 
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Family Decisions 

As with violence, autonomy in making (non-economic) family decisions was first modeled as 
a single latent variable with all 16 items loading on it. This initial model did not fit the data well, 
with a large chi-square, RMSEA > .1, and an IFI of .78. (See Table 2 for results of the following 
sequence of models). The loadings for v22 and v23 were very low (as were their reliabilities — 
the explained variance for these indicators), and modification indices indicated that adding a 
correlation between the errors of v13 and v14 would significantly improve the model's fit. This 
error correlation is reasonable, given that both items concern the schooling of children and 
have very similar wording. In the second model, items v22 and v23 were eliminated, and the 
error correlation was added. This model fit the data considerably better, but the RMSEA and 
IFI still suggested the model needed improvement. Modification indices suggested that the 
model would be strengthened by the addition of error correlations between several of the 
items v16-v21. As with several of the violence items, these items are all similarly worded, 
asking whether the respondent feels uncomfortable speaking in front of various people. In our 
third model, we thus added a latent variable to capture a wording effect. The RMSEA for this 
model was acceptable, as was the IFI. Modification indices suggested that v9 loaded very 
poorly (as it had in the previous models), and that an error correlation between v8 and v10 
would improve model fit considerably. Thus, in our final model we dropped v9 and added the 
suggested error correlation. The RMSEA for this model was no better than that of the 
previous model, but the IFI improved slightly (from .92 to .93). Finally, we attempted a two- 
factor model in lieu of the wording effect, but that model did not have a better fit than the 
wording effect model. Figure 3 shows the final model for autonomy in family decisions. 


Community Involvement 

The base model for autonomy in community involvement had a reasonable RMSEA (.08), but 
an unacceptable IFI (.89). As with the previous set of analyses, a number of error correlations 
were suggested by the modification indices (for items v33-v37). These items ask whether the 
respondent has engaged in several activities during the previous week. Thus, in the second 
model, we added a latent variable to capture this wording effect. This model fit the data 
better, with the IFI now indicating acceptable fit (.91). Modification indices suggested the 
addition of an error correlation between v35 and v36, and in the next model we added the 
correlation. The results of that model indicated a very good fit of the model to the data, with 
the RMSEA dropping to .06 and the IFI increasing to .95. In this model, items v24 and v38 
continued to evidence poor loadings and reliabilities. In a final model, we eliminated these 
variables. The results of the final model indicated excellent fit: the RMSEA was .05, and the 
IFl was .97. As before, we attempted one final model with a substantive latent factor included 
rather than a wording effect, but those results were not better than those of model 4. Figure 
4 shows the final model for autonomy in community involvement. 


Household Economics 

The initial one-factor model for autonomy in household economic decisions fit the data very 
well with an RMSEA of .10 and an IFI of .98 (see Table 2). However, the modification indices 
suggested that the inclusion of an error correlation between items v47 and v48 would 
significantly improve the model's fit. In the second model, we added this error correlation with 
a slight improvement in the RMSEA and IFI. Modification indices suggested the addition of 
error correlations between items v39-v44, items which have very similar wording. Thus, in the 
next model, we included a wording effect for these items. This model fit the data only slightly 
better, based on the RMSEA. v64 continued to load poorly in this model, so in a final model, we 
dropped the variable. This model fit the data only slightly better (but not seen after rounding 
of the RMSEA). Figure 5 shows the final model for autonomy in household economics. 
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Figure 2. Final Measurement Model for Autonomy in Family Decisions 
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Figure 3. Final Measurement Model for Autonomy in Community Involvement 
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Comparison to Summed Scales 

Our next step in the analyses was to compare these final four models to summed scales to 
determine if summed scales produce a significant lack of fit of the models to the data. In 
these models, all factor loadings were constrained to 1, wording effects were removed, and 
all measurement errors and error covariances were set to 0. Table 3 shows the results of 
these models. In all cases, after setting the factor loadings equal to 1 for all variables and 
constraining the measurement errors to be 0, the model chi-squares (as well as the RMSEAs 
and IFls, not reported in table) showed a significant loss of fit. 


Table 3: Comparison of Confirmatory Factor Analysis Results to Summed Scales 


Model CFA x°(d.f.) Summed Scale y°(d.f.) Difference °(d.f.) 
Violence 123.52(14)*** 9933.96(25)*** 9810.44(11)*** 
Household Decisions 767.62(62)*** 4562.47(90)*** 3794.85(28)*** 
Community 

Involvement 338.35(63)*** 3521.63(90)*** 3183.28(27)*** 
Household Economics 1209.86(88)*** 5929.58(119)*** 4719.72(31)"™* 
ep 20001 


Four-Factor Combined Model 

In the next step in the analyses, we combined all four (five, counting the second substantive 
violence scale) autonomy factors into a single factor analysis model in an effort to determine 
the extent of the relationship between the latent factors. Table 4 shows the correlations 
between the substantive factors. These results reveal several interesting patterns. First, the 
correlations between the second violence factor (perceptions of the legitimacy of violence) 
and the other factors are either very weak (between the first violence factor and the second) 
or statistically 0 (between the second violence factor and the other latent factors). Second, the 
correlation between the family decisions factor and the household economic decisions factor 
is very close to 1. Third, the remaining correlations reveal moderate relationships between all 
other factors. 


Table 4: Correlations Between Dimensions of Autonomy 


Violence 1 Violence 2 Family Community _ Economics 


Violence 1 Oana 


Violence 2 nes oF" 

Family Age o7*** oe 

Community sO 0 67" 95*** 

Economics A etd 0 9g*** 66t 60 





Note: Variances on diagonal; correlations off diagonal; model fit: chi-square = 
14,886.48(1064)***; RMSEA = .08; IFI = .98 
tp < 001 
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India-Pakistan Comparison 

Finally, we replicated the results for the Indian data with virtually identical data from Pakistan. 
Several studies have argued for the important role that context plays in autonomy (Dyson and 
Moore 1983; Jejeebhoy 2000; Jejeebhoy and Sathar 2001; Mason 1986; Mason and Smith 
1999). As stated earlier, a comparison across two relatively similar cultures (India and Pakistan) 
provides leverage with which to determine whether our autonomy measures are replicable 
independent of confounding factors due to major-differences in context. In order to do the 
replication, we conducted two sets of analyses. First, we re-estimated the models using the 
Pakistani data alone. Second, we estimated a multiple group model that allowed the parameters 
for the Indian and Pakistani data to be freely estimated across groups. Next, we constrained all 
parameters to be equal across the two datasets. Finally, we constrained all the factor loadings to 
be equal across the datasets but allowed the error variances and variances of the latent variables 
to be freely estimated across groups. In these analyses, we had to examine each dimension of 
autonomy independently because the sample size for the Pakistani data was too small to 
estimate the complete asymptotic covariance matrix (used in WLS estimation using polychoric 
and polyserial correlations) for all the variables. 

Table 5 presents the results of these two comparisons. The first column of the table presents 
the results of the models for the Pakistani data only. The results for the model for violence using the 
Pakistani data are comparable to the results using the data on India. The RMSEA is below .1, and 
the IFI is .99. The results for the model for family decisions are, in fact, better for the Pakistani data 
than the Indian data. The RMSEA indicates excellent fit at .045, and the IFI indicates the same at 
.98. In contrast, the model for community involvement does not appear to fit as well to the 
Pakistani data. The RMSEA was .102 (compared to .05 in the Indian data), and the IFI was .82 
(compared to .97 in the Indian data). Finally, the model for household economic decisions indicates 
comparable fit to the Pakistani data. In that model, the RMSEA was .058 (slightly better than the .08 
for the Indian data), and the IFI was .98 (very slightly worse than the .99 for the Indian data). 

The remaining columns in the table show the results from multiple group analyses of the 
Indian and Pakistani data together. The first of the remaining columns shows the results of 
estimating the parameters freely across the countries; the second column shows the results of 
constraining all parameters to be equal across groups; and the third column shows the results 
of constraining only the factor loadings. For the sake of brevity, we do not discuss all of these 
results. In brief, the results of chi-square difference tests (not shown in the table, but easily found 
by subtracting the unconstrained chi square from the constrained chi-square) show that a 
significant loss of fit results if parameters are forced to be equal. Although the loss of fit is 
considerably less when all the variances and covariances of the latent variables — as well as the 
error variances — are freely estimated across data sets, difference chi-square tests continue to 
show a significant loss of fit. However, the overall fit of the models, based on the RMSEAs and 
IFls, is quite good. 


Discussion and Conclusion 


In this research, we have examined in detail the measurement of women’s autonomy, an 
important theoretical construct in sociological and demographic literature on gender and 
development. In doing so, we hope to contribute to a deeper discussion on how best to 
model autonomy in future empirical research and ultimately improve our efforts to alleviate 
gender inequalities. The results of the analysis provide several interesting insights that can 
help further our understanding of the measurement of women’s autonomy in developing 
countries. In particular, the results provide evidence to support and question some of the 
models and claims in the autonomy literature to date. 
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First, we measured the robustness of four dimensions of autonomy: freedom from 
violence, participation in non-economic family decisions, community involvement, and 
participation in household economic decisions. Our measurement models for each 
dimension fit the data well, and the separate autonomy latent variables were intercorrelated 
but not perfectly so. These results lend concrete support to the existing literature arguing for 
the multidimensionality of autonomy (Balk 1994; Jejeebhoy and Sathar 2001; Mason 1986) by 
showing that autonomy items do indeed cluster into distinct and meaningful dimensions. 
Studies on autonomy should, therefore, continue to measure autonomy in its multi- 
dimensional form. With regard to violence, we found this aspect of autonomy should be 
further divided into two sub-dimensions: “feared and actual violence” and “views on the 
legitimacy of violence.” This finding suggests that in India and Pakistan, a woman's views on 
the legitimacy of violence do not reflect the same aspect of autonomy.as do her experiences 
of actual violence. Examining these two aspects separately may, therefore, yield more precise 
results in terms of the causes and effects of domestic violence against women. For example, 
a women's group may raise women’s understanding that domestic violence is unacceptable, 
but it may not provide women with the social or physical resources needed to prevent it. 

Second, we tested each dimension of autonomy against the most common method of 
measuring autonomy in the literature to date: summed scales (Hashemi, Schuler and Riley 
1996; Jejeebhoy 2000; Morgan and Niraula 1995). We found that because summed scales do 
not account for measurement error and differential weighting of the items included in the 
scales, they provide a less appropriate measure for autonomy. Instead, autonomy research 
might benefit from using models that provide more robust measures of autonomy that allow 
for differential weight on observable items within a dimension and account for measurement 
error on each item. It is clear that measuring a theoretical construct through a series of 
observable items bears a host of potential complications due to measurement error. 
Nevertheless, such work is important to pursue. Accounting for the measurement error, as 
well as the differential weights on the observable items, is a relatively low cost way of 
strengthening the important research on women’s autonomy. Given the rich data available 
on autonomy, future research could examine how the causes and effects of autonomy using 
this new measure of autonomy differ from studies using simple summed measures. 

Third, we combined all four dimensions of autonomy into a single model to determine the 
extent to which the dimensions of autonomy were related. Here, again, our results lend 
concrete support for the argument that the dimensions of autonomy are, for the most part, 
moderately related (Balk 1994). This finding indicates that the various dimensions of 
autonomy can indeed be considered part of a single underlying construct, but also that they 
have distinct contributions to autonomy. Future research on women’s autonomy should 
continue to study autonomy as a single, yet multidimensional, concept. Because each 
dimension can variously affect and be affected by different factors, interpretations on the 
causes and effects of autonomy as a whole would be facilitated by analyses that do not 
examine each dimension separately, but rather combine all dimensions into a single model. 
This single model, in turn, should not merely sum all autonomy indicators to produce a single 
measure of women’s autonomy, but rather allow for the individual contributions of each 
dimension by accounting for differential weighting and measurement error 

An important exception to the dimensions being related, however, is the perceived 
legitimacy of violence sub-dimension. Because legitimacy of violence was weakly correlated 
with the experience of violence as well as the other dimensions, we argue that it should not 
be used in measures of autonomy. This finding presents important implications for future 
research on autonomy. Views on the legitimacy of violence may, in fact, reflect education or 
social norms in a community, which in turn can be a cause or effect of autonomy, rather than 
part of autonomy itself. In other words, our findings indicate that legitimacy of violence may 


Refining the Measurement of Women’s Autonomy * 2095 


be measuring concepts that are unrelated to our understanding of autonomy as an indicator 
of control over one’s life. Studies have often combined this dimension into the single 
construct of autonomy. Future research should examine the issue of violence in greater depth 
to better understand its relationship with other dimensions of autonomy. One way to do so 
could be to include alternative observable items.on violence in future questionnaires. For 
example, answers to questions on one’s capacity to respond to or prevent violence might be 
better related to other dimensions of autonomy than are the answers to questions on actual 
violence or perceptions of the legitimacy of violence. 

We also found an important exception to the finding that while the dimensions (aside from 
the perceived legitimacy of violence sub-dimension) are part of a single underlying construct, 
each dimension has a distinct contribution to autonomy. The relationship between two 
dimensions, family decisions and household economic decisions, was very close to 1, 
suggesting that these two dimensions may be collapsed into a single latent construct or 
dimension and examined simultaneously. Again, much of the current literature examines 
these dimensions separately, thereby implying that they reflect different aspects of autonomy. 
This research, however, suggests that they reflect the same aspect. This finding implies that 
the dynamics involved in empowering women in household decision-making simultaneously 
affect a wide range of issues from reproduction and child education to family employment 
allocation. On a practical level, examining family decisions and household economic decisions 
as a single dimension can facilitate future empirical research on autonomy. More importantly, 
however, on a policy level, this finding indicates that efforts to empower women in less 
controversial household decisions, such as nursing a sick child or punishing a misbehaving 
child, may have positive effects on empowering women in more controversial household 
decisions, such as family planning or women’s control over money. Future research should 
study the connections between women’s autonomy in these various household decisions. 

Finally, we replicated the Indian analyses using data from Pakistan to examine the extent 
to which autonomy may be useful for comparative research. These results indicate that the 
model structure for autonomy is replicable across countries (based on the acceptable fit of the 
models of the Pakistani data only). On the other hand, the results are less clear in terms of 
whether the measurement of autonomy is comparable in terms of factor loadings and 
variances. These results are significant in that they suggest, contrary to our initial 
expectations, that even relatively small changes in context can affect the weights used in the 
measurement of autonomy and thus the concept of autonomy itself. _ 

Our findings raise questions on the robustness of comparative research on autonomy 
(Jejeebhoy and Sathar 2001). At the least, they show that comparative research on autonomy 
must allow for differences in measurement errors and variances of latent variables measuring 
autonomy — no matter how small is the change in context. Again, this finding suggests that 
summed scales should not be used in studying autonomy, especially cross-culturally. In the 
current globalizing economy, where women are increasingly crossing borders for work, it is 
important to understand women’s relative power across various contexts, and it iS essential that 
measures of autonomy remain flexible enough to accommodate contextual changes. At the 
same time, future research may also test to what extent the model structure, or dimensionality of 
autonomy, changes with larger differences in context. Having different models with which to 
measure autonomy can make comparing autonomy levels across contexts difficult. Therefore, 
new constructs that can better accommodate contextual changes may need to be developed to 
measure the extent of power women have over their lives. In the long run, measurement 
concerns across context, particularly among sub-populations such as immigrants, may become 
less relevant as cultural practices become independent of geographic location. 

While this study has provided a systematic study of the measurement of women’s 
autonomy, it is not without limitations. First, the items used in measuring autonomy in this 
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research are specific to this survey. That is, other surveys use different measures and may 
therefore reach other conclusions regarding the extent of the relationship between 
dimensions of autonomy. Nonetheless, we feel that greater attention needs to be paid to 
how autonomy is measured in future research. Such attention can help us continuously 
improve our survey questions so we can eventually capture the best observable indicators 
of autonomy. In addition, the findings in this study are subject to the samples in the data 
used. ‘ 

A second limitation to these analyses is that we used data for only two countries: India 
and Pakistan. A more substantial test of the comparability of measurement of autonomy 
across developing countries is needed. Part of this test can be done using the remaining three 
countries in the SWAF data set. Beyond this, however, data limitations to date make such a 
test difficult. Specifically, autonomy is often measured with different indicators not only in 
different surveys, but even within the same survey applied in different countries. In addition, 
samples and sampling techniques must be comparable across data sets of different 
countries. This limitation is thus not so much a shortcoming of this research specifically, but 
a shortcoming of any comparative work on women’s autonomy. 


Notes 
1. For more on this discussion, see Batliwala 1994; Dyson and Moore 1983: Jejeebhoy 2000. 


2. For differing views on the relationship between education and autonomy, see Oropesa 
1997; Sathar, Crook, Callum and Kazi 1988; Sathar and Kazi 1990. 


3. Development indicators were based on income, percentage of roads surfaced and other 
economic criteria. 


4. Due to the quality and depth of the survey, however, there were very few items in this 
category. Most of the items in this category dealt with political participation. 


5. “Autonomy from violence” is the ability to avoid domestic violence against oneself. The 
observable items used in this model are those used in much of the existing literature and 
are thus available in the data set used for this study. 
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It Cuts Both Ways: 
Workers, Management and the Construction of a“Community 
of Fate” on the Shop Floor in a Mexican Garment Factory 


Nancy Plankey Videla, Texas A&M University 





Abstract 

Most studies of lean production are based on surveys of managers. This article examines 
the labor process under lean production at a high-end garment factory in Central Mexico 
through ethnographic research, consisting of nine months of work at the factory, and in- 
depth interviews with 25 managers and 26 workers. I found that implementation of lean 
production is a complex organizational and social phenomenon. I argue that besides a 
focus on quality, just-in-time production and flattened hierarchies, lean production is 
based on a management-sponsored “community of fate” ideology. In this case study, the 
“community of fate” ideology constructed by managers — with its discourse of loyalty and 
sacrifice and its buttressing corporate welfare programs — convinced workers to extend their 
physical, intellectual and emotional labor to the firm. What managers failed to fully 
understand was that in workers’ eyes, the “community of fate” belief also tied the firm to 
the workers. When management reneged on this social pact, workers not only resisted 
management’s efforts to regain control over the shop floor, but also actively used the team 
system to thwart the firm’s economic viability. In the end, instead of controlling workers, 
lean production facilitated worker radicalization and mobilization. 


On March 15, 2001, at 3 p.m. sharp, we walked out on strike. It had only been three months 
since | had graduated from the month-long training at “la escuelita,” the little school. | had 
come to see how a successful lean factory worked. The Mexican garment factory had been 
reorganized five years ago around lean production techniques, such as self-managed, quality- 
at-the-source, and just-in-time production, all to assure efficiency and quality. On the shop 
floor | joined two teams. During my months at the factory, | ironed, followed the flow, and 
helped with repairs in my role of participant observer. | later interviewed workers and 
managers. 

The day of the strike, | was so nervous that | had lost track of time. Lupe, my teammate, 
tapped me on the shoulder. “It’s time to go,” she said in a calm voice. Around us, machines. 
stopped as women and men of all ages got up and walked outside to a loading dock for a 
union general assembly. The words of one strike committee member pierced the air: “I'ma 
single mother just like most of you and | am not afraid of losing my job. The company has 
fired the entire second shift, failed to restore benefits we froze last year to help them out and 
now they want to renege on wage increases. Let's declare ourselves on strike companeros y 
compafieras. We are in this dance together.” 
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Institution program, the John T. and Catherine D. MacArthur Global Studies Fellowship at the 
University of Wisconsin, Madison and an International Research Travel Grant from Texas A&M 
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We stood in the hot sun for what seemed an eternity debating whether to put up the red 
and black flag. The Cutting Department, mostly male, still working on piece-rate and earning 
a higher wage, did not want to go on strike. The other 70% of the workforce consisted of 
women who had been reorganized into teams. They yelled at the men, “You better back us up 
because you are next! Teamwork got us in this mess. And Cutting is next.” The men were 
reluctantly convinced. 

Meanwhile, the strike committee was talking to the labor authorities. Word came down 
that if we put up the flag without their witnessing it, the state would declare the strike illegal. 
Others yelled, “But without the flag we are illegal!” No one knew what to do. In addition, the 
labor authorities gave contradicting information. It was decided that workers would guard the 
facilities so that management could not sneak out products or machinery while the strike 
delegation sat at a table with company representatives and state labor authorities. In the end, 
the state arbiter convinced the delegation not to declare a strike because doing so would 
“officially and unilaterally end negotiations.” If the talks did not work, the flag could go up 
immediately. At least that is what the state arbiter said. However, after 72 hours of wrenching 
and intimidating “discussions,” the union and company signed a contract against workers’ 
wishes. Then the same arbiter who brokered the deal declared the strike illegal: either return 
to work or be fired. After the fact, we learned several local arbitration boards to break other 
strikes around the country had used this delaying tactic. 

Workers reluctantly returned to work. They protested the collusion between the state, 
company, and corrupt union in almost daily marches in the streets and meetings outside the 
factory gates. An independent union was elected, but the state and company refused to 
accept its legality. In protest, women and men workers shut down a major city avenue on 
April 4". The governor sent in the military police — with dogs, batons, and tear gas — to break 
up violently the peaceful sit-in. 

When we returned to work the next morning, over 30 military police in riot gear lined the 
company entrance. That morning my mother insisted on taking Maria, a neighbor and co- 
worker, and me to the factory. We pretended not to be afraid as we walked through the 
outside gates. Many other parents were there; husbands, too. The moment of truth arrived: 
7 a.m. Confused and humiliated, we lined up, showed our identification cards and entered the 
factory gates as the Human Resources Director stared each one of us down. Parents and 
husbands went home. 

After the failed strike attempt, workers struggled to remain united. Work slowdowns 
sabotaged the lines, but the costs of organizing were too high. Leaders, or anyone suspected 
of being one, were fired. Managers at all levels pressured workers to turn in “troublemakers.” 
In the end, it was your job or your neighbors’ job. Company spies were said to be everywhere. 
What had once been one of the highest paying and most cutting-edge factories had become 
a war zone. Several of my female teammates just stopped showing up at work. The stress on 
the shop floor and the interminable demonstrations and rallies had worn down their spirit of 
resistance and solidarity. “It's just not worth it,” they told me. 

Five months after the strike the company reorganized its production lines again. On 
August 16" management announced new rules: no more teams, no more leaders, no more 
moving around the shop floor. As it had been previously under Taylorist work organization, 
every worker would be individually responsible for his or her quota. An enhanced hierarchy 
of supervisors was to assure smooth running of the line, now traditionally organized. The only 
remaining sign of lean production was the emphasis on quality. Each worker was still 
responsible for his or her own and the previous workers’ quality, with task sheets to direct the 
flow and display each person's action on the product. Furthermore, a new layer of quality 
supervisors was set up. Quality supervisors roamed sections randomly checking quality, 
returning bundles with errors and ensuring all workers were following the correct work 
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method within the standard allotted minutes. At the same time, production supervisors 
followed the flow and dealt with absenteeism, turnover, and anything else which would affect 
the final product leaving the line. Management had learned a valuable lesson during the strike. 
They had lost — in their own words — “control of the shop floor” — under teamwork. The lesson 
learned: controlling the workforce was more important than being cutting-edge. 

When | planned my research, | never expected to participate in a strike; neither did my co- 
workers at Moctezuma.1 This was to be a study of a successful, well-established, national 
garment factory in Central Mexico which, in 1996, had reorganized its production around 
Japanese-inspired lean production methods. For five years, managers relinquished authority 
and workers participated in the new production system. To secure consent, management 
employed the strategy of constructing a “community of fate” along the lines envisioned by 
organizational theorist Robert E. Cole (1979), seeking to tie workers to the firm through 
expanded benefits and paternalistic programs. In effect, workers and management entered 
into a normative pact to make the new system work. Yet five years after implementation, the 
experiment ended in a strike and a retreat to a hybrid form of Taylorist assembly-line 
production. In this article, | build upon the concept of “community of fate,” first discussed by 
Cole (1979) and later invoked by Besser (1996), to show how. such a normative agreement 
was central to the success and unraveling of lean production at Moctezuma. 

Economic sociologists claim that to compete internationally, firms would do well to 
reorganize around lean production forms (Gereffi 1999; Gereffi and Kaplinsky 2001; Kaplinsky 
1994: Sabel 1996). In this case, | found that implementation of lean production is a complex 
organizational and social phenomenon. For lean production to work, both workers and 
managers must believe that their interests and the interests of the company are synonymous. 
| will argue that the construction of a “community of fate”, a neglected concept in the 
economic sociology literature, is central to the success of lean production in today’s volatile 
global economy. Without the belief of interdependence to bind workers and managers 
together during times of economic downturn, it is nearly impossible to sustain a functioning 
shop floor organized around consent rather than direct control. To demonstrate the 
importance of a “community of fate” belief, | will show how the labor process at Moctezuma 
was transformed through lean production and the factors that led to its demise. After five 
years of prosperous implementation, management broke the normative agreement that 
sustained the notion of a “community of fate.” It was only at this point that the workers 
refused to consent to the industrial relations regime established, using the tools learned 
through lean production to undermine the firm. 

This research is based on in-depth semi-structured interviews with 26 managers and 25 
workers in addition to participant observation carried out from October 2000 to July 2001 at 
a large national garment factory in Central Mexico. All the in-depth interviews, and many 
informal conversations, were tape recorded with subjects’ consent, thus allowing me to 
represent more accurately diverse views and experiences. Moreover, all research was overt, 
premised on openness and honesty. Permission was obtained from both managers and 
workers to enter the factory as a new worker, commencing with training and then placement 
on the shop floor. To capture more fully the advantages and pitfalls of teamwork, | participated 
in two teams, one considered the worst and the other among the best in terms of 
coordination and productivity as defined by department heads. 

Through daily interaction with workers at the bus stop, shop floor, cafeteria and outside 
the factory gates, workers accepted me as a friend despite my privileged status as a middle- 
class, highly educated Latina. At the same time, managers embraced me because of our 
shared privileges (on more than one occasion we discovered our kids went to the same 
schools and doctors) although they did not understand why | had to labor as a worker, and 
without pay, to boot. This contradictory location between workers and managers allowed me 
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to observe and to ask frank questions about the changes occurring on the shop floor and in 
the front offices. Moreover, through participant observation | was able to understand the 
situations and meanings that had been invisible to me before, such as the ways that 
production structures directly led to conflicts between workers and alignment of the team 
leader with manager. All worker interviews took place away from the factory; however, most 
managers were only available during work hours. To compensate for this disadvantage, | 
returned in 2003 to re-interview several managers who were no longer working at the factory. 
At this time, | also conducted focus group interviews with workers to discuss my research 
findings. 

The article is organized as follows. First, a brief history of the firm and its location within 
the Mexican apparel industry will be provided. Second, a discussion of lean production and 
the specific forms it takes in the garment industry will illuminate changes at Moctezuma. 
Third, a review of labor process theories follows to highlight how production organization 
affects workers. Fourth, | will compare work at Moctezuma under Taylorist, lean and hybrid 
forms of organization in order to understand how the labor process was transformed and how 
those changes affected normative bonds between workers and management. Lastly, | will 
demonstrate the ways in which workers subverted work organization by using core elements 
of lean production in their mobilization and ensuing strike. 


Organizing the Shop floor: Taylorism and Lean Production 


In the late 20" century, management theories about how factories should be organized have 
undergone great changes. For most of the twentieth century, factories have been organized 
around Taylorism, or scientific management, consisting in the separation of conception 
(intellectual labor) from execution (manual labor), the fragmentation and standardization of 
work into simple measurable operations, incentive-based payment, and close management 
supervision (Kessler-Harris 1982; Taylor [1911] 1967). The underpinning philosophy held 
workers to be naturally lazy, needing incentives and direct control to be optimally productive 
(Chandler 1977). Taylorist organization, then, relies on an antagonistic relationship between 
workers and management, where there exists no mutual obligation or binding commitment 
to the firm. Individual workers are easily replaced, cogs in a system. 

Since the 1970s, Japanese lean production emerged as an alternate form of organization 
based on teamwork, open job-classifications and cooperative rather than conflictive industrial 
relations practices. Early works praised Japanese-style lean production, a form of flexible 
specialization, for productivity and efficiency gains (Piore and Sable 1984; Womack, Jones 
and Roos 1990). As the organizational form spread to the U.S. and Europe, debates shifted 
to how distinctive and universal lean production methods really were (Cole 1979; Dohse et al. 
1985; Kenney and Florida 1993; Oliver and Wilkinson 1992). Moreover, as the darker side of 
work intensification, location in depressed areas and union discouragement surfaced in 
scholarly works, the desirability of such implementation was also questioned (Milkman 1991; 
Park and Slaughter 1988; Vallas 2003). Increasingly in the academic literature, lean production 
is seen to “cut both ways” with contradictory objectives, processes and outcomes. 
Nevertheless, in policy, business, and some academic circles, lean production continues to be 
seen as the unequivocal path to development. 

Japanese-inspired lean production is organized under three core principles: quality 
consciousness, teamwork and flexibility (Wilkinson and Oliver 1989). Together, they strive for 
the ultimate goal of kaizen, continuous improvement. Quality consciousness refers to quality- 
at-the-start, taking place at each step, not at the end of the line so that mistakes are corrected 
immediately. Assembly lines are reorganized around product families and workers into teams, 
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where rotation and further training push toward the goal of a multi-skilled workforce. Skilled 
workers can then use their ingenuity to solve problems and improve functioning on the shop 
floor. Managerial tasks of translating experience into scientific method can then be returned 
to the workers through teams. In this way, management relinquishes significant control over 
the labor process as quality, production and worker issues are dealt with by team leaders and 
team members in coordination with management. 

Finally, flexibility is built into the system by reducing inventory on the shop floor by 
pressuring suppliers to deliver parts just-in-time. Such a tight system demands quality and 
coordination to reduce waste and production time, freeing up capital for reinvestment. Client 
demands create a pull on a product stream that is followed as closely as possible. Smaller 
batch production, ability to reorganize the line and machinery, and multi-skilled workers 
respond to this goal. 

In the garment industry, lean production theories hold that reorganizing production 
processes through lean or modular assembly help lower production costs by decreasing 
work-in-process, inventory, through-put time, and rework (repair of pieces flagged for bad 
quality). As a result, productivity and quality improve, providing augmented responsiveness to 
clients in terms of turn-around time and model changes (Appelbaum et al. 2000; Abernathy et 
al.1999; Berg et al. 1996; Dunlop and Weil 1996; Bailey 1993). According to Berg and his 
colleagues (1996), under lean production workers are empowered to take control of the work 
process, having the challenge of producing a larger segment of the garment and focusing on 
issues of quality. Increased worker satisfaction arising from the trust and responsibility given 
to them, in turn, should translate into greater productivity. Furthermore, since teams are self- 
managed, middle management disappears, which lowers payroll costs and provides direct 
and constructive communication between workers and management (Tello and Greene 1996; 
Arthur 1994). 

An important method of securing increased efficiency was through Kanban markers, which 
perform a dual function. First, the markers direct JIT production by specifying which inputs are 
required for immediate orders on the shop floor. Only the immediately required pieces are 
supplied to each worker's station. Second, Kanban markers pull products through the line. The 
cards move upstream on the shop floor designating which operations are to be done and who 
is to perform them. In this way, inventory, quality and flow can be continuously monitored so 
that workers with bad quality are signaled out; again, continuous improvement at work. 

Japanese-style lean production has become so popular that many academics, policy- 
makers and practitioners have overlooked the particular ways in which lean production 
emerged in Japan. In order for workers to consent to working smarter (and many say harder 
as well), industrial relations in Japan developed in ways which connected interests between 
management, unions, workers and the state (Chalmers 1989; Cole 1979).’ Japanese 
employment relations secured consent of workers through life-time employment, seniority- 
linked wages and company unions (Cole 1979; Elger and Smith 1994). Yet as Kucera (1997) 
and Carney and O’Kelley (1990) have shown, life-time employment was offered to men in the 
big automobile companies while women made up a large portion of the part-time, contract 
and temporary workers in supplier firms. Furthermore, Japanese management enforced 
women’s life cycle patterns to move them out of work during economic downturns, 
enhancing industry flexibility and further denoting skill as a male characteristic. Therefore, a 
segmented and gendered labor market regime emerged as men were considered the 
breadwinner and women (occasional) supplemental earners (Besser 1996; Cole 1979). 

Although lean production established a less antagonistic regime than Taylorism, work 
practices that secured employment were not sufficient to manufacture consent in Japan. Life- 
time employment for men and weak organizational representation were buttressed with 
corporate welfare policies. Commenting on Japanese production models, Cole writes: 
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“Yet for an internalization of organization goals to take place, so that 
the individual identifies his personal success with organizational 
success, dependency per se is not adequate... Business firms had to 
establish a set of positive rewards to serve as incentives... 
contributing to the solution of the central life problems of the 
employees.” (1989; 250) 


The desired outcome of these management-led practices was to have workers see 
themselves as participating in a “community of fate” with managers. Thus, individual success 
is premised on company success, and vice-versa, requiring workers and managers to 
cooperate for a common goal (Besser 1996; Cole 1979; Graham 1995). The implications of 
internalizing the notion of shared vested interests in the company are that both managers and 
workers should then be willing to sacrifice for the good of the company, given that /n the end 
they believe it is in their interest as well. While Cole notes that “community of fate” is a 
management idea, it is important to highlight how the construction of this normative pact is 
used to control workers. Although rarely fully internalized, it was a convenient way to bundle 
workers’ interests with those of management in everyday practice. 

In reviewing the literature, other scholars have also encountered ideological 
manifestations akin to “community of fate.” In her study of Choc Co, Pollert finds that 
management changed their philosophy when they converted to teams. She writes: 


‘T]he agenda [became] to promote a ‘culture of business 
improvement’ (Production Service Manager) and shift from a paternalist 
‘we'll look after you, but don't think’ and ‘doing things for you’ to a 
participatory but unitarist ‘It’s a cold, hard world, we'll sink or swim 
together.’ ” (Factory General Manager) (1996: 182) 


Besser (1996) and Graham's (1995) studies in the auto factories organized around teamwork, 
found similar ideological management shifts. 

Vital to any organization are social norms that demarcate appropriate behavior; respecting 
these norms bestows legitimacy to both management and workers. Thus, respecting work 
place norms about recruitment, promotion, training, work practices compensation, and 
grievances is crucial in building trust and a “community of fate.” Hodson argues that 
managers must be good citizens at the workplace in order to develop the trust of their 
employees. Managers are good citizens when they “[abide] by norms concerning the 
treatment of employees and providing a workable technical system of production (2001: 
93)."° However, management violation of social norms can have serious consequences, 
especially under lean production practices that are premised on trust. The case studies 
discussed in Vallas (2003) and Pollert (1996) clearly demonstrate how mistrust turns into 
resistance. Moreover, empowered workers are dangerous. They can use the citizenship skills 
they learn at work to mobilize fellow workers against the firm (Hodson 2001; Vallas 2003). 


Organization of Production and Labor Process: Re-visiting Labor Process 
Theories 


Lean production theorists claim that by cutting-out middle management, allowing workers 
more autonomy and appreciating their knowledge, the relationship between managers and 
workers is converted to one of cooperation instead of antagonism (Appelbaum et al. 2000: 
Abernathy et al. 1999; Berg et al. 1996; Dunlop and Weil 1996; Bailey 1993). As Kaplinsky puts 
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it "...[LJabour is no longer seen as a cost of production which should be minimized, but as a 
resource which needs to be augmented.” (1994: 290) Labor process theorists have 
demonstrated the ways in which capital appropriates workers’ labor power through direct 
control, either by manufacturing consent or by reproducing gender and -race inequalities 
(Braverman 1974; Burawoy 1979; Knights and Willmott 1986; Lee 1998; Leidner 1993; Pierce 
1995; Salzinger 2000, 2003; Thompson 1989; Wardell, Steiger and Meiksins 1999). 

While Marxian theorists have made claims about how society is organized and feminist 
scholars have unmasked structures of gender inequality by unpacking the labor process on 
the shop floor, neither speaks directly to the labor process under lean production. 

That task has been taken up by British organizational scholars who use a Foucauldian 
framework to address the labor process under lean production and other flexible forms of 
production organization. Sewell and Wilkinson (1992), for example, peel away the layers of 
just-in-time labor processes to find that lean production techniques inscribe a control regime 
based on surveillance and discipline. Teamwork and the kanban replace management 
supervisors with more effective forms of information gathering and pressure that secure 
discipline and compliance to quality rules on the shop floor (Wilkinson and Oliver 1989). Using 
Foucault's concept of power as panopticon, Sewell and Wilkinson (1992; see Salzinger 2000 
for a feminist critique of labor process theories using this lens) note how the lean factory 
layout with little to no inventory stacked-up on the shop floor and prominently placed markers 
to show team productivity/rework statistics, physically allow for increased surveillance. 
Workers can no longer disguise the ways they use time and ingenuity to their advantage such 
as ‘making out’ (Burawoy 1979) or ‘goldbricking’ (Roy 1952). On the contrary, teamwork seeks 
to appropriate workers’ experience in the continuous improvement process of ka/zen. 

The practice of lean production at Moctezuma bears a striking resemblance to the 
negative aspects of teamwork mentioned above. However, British labor process theorists do 
not pay enough attention to Marxian notions of class struggle (Burawoy 1979), feminist claims 
of gendered structures (Lee 1998; Salzinger 2003) nor the influence of management practices 
(Hodson 2001). Workers and managers are both agents in the labor process. That is, while the 
panopticon did inscribe a pervasive sense of control among workers, they were not 
powerless. Workers | interviewed were aware they were being watched, the majority 
experienced work as a form of exploitation. Nonetheless, they consented to participate in this 
system because of the normative and economic bonds forged with the company through the 
management constructed “community of fate” belief. In the following section, | will discuss 
how labor processes at Moctezuma differed under Taylorist and lean production organization. 
Under the latter, the ways in which a “community of fate” belief developed and unraveled will 
be central to understanding how the teamwork structure facilitated mobilization, class-based 
solidarity, as well as the empowerment of women qua women. The resultant hybrid form of 
Taylorist production and its consequence for workers will then be examined, evoking 
questions for further research. 


Firm Strategies in a Globalized Economy: The Case of Moctezuma 


The history of Moctezuma parallels the history of Mexico's apparel industry, clearly reflecting 
twists and turns in state policy and global environment. The origins of Moctezuma date to 
1951 when an Italian enterpreneur set up a textile mill in Central Mexico. For the purposes of 
this study, the original mill will be designated as Italia Inc. Although capital, management, and 
designs were decidedly from Italy, it was legally formed as a Mexican enterprise to take 
advantage of subsidies and protections provided by the Mexican state for national industry. 
While initially Italia Inc. only produced cashmere wool cloth, in 1964 the company opened a 
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new division to use the cloth to fashion cashmere suits. After a protracted struggle over 
worker rights and an independent union at the company, in 1972 this new division became its 
own company, Moctezuma. 

A few words on unions in Mexico are in order. Unions in Mexico: have historically 
functioned as corporatist entities, under the control of the ruling party through labor umbrella 
organizations affiliated with the government. The vast majority of unions belong to the Labor 
Congress (CT)*, through which favors and pressures between state and labor leaders have 
been continuously negotiated. The strength of this corporatist system has been buttressed by 
automatic and obligatory affiliation of union members to the — until recently — ruling party, the 
PRI (Institutional Revolutionary Party). In 1990 this “forced mass affiliation,” as LaBotz calls it, 
was overturned (1992:41). Nevertheless the structure remains, as enormous pressure is 
wielded on individuals to belong to the party. Besides the “official unions” belonging to the CT, 
there are two other types of unions in Mexico: employer-promoted and independent. 
Employer-promoted, or “white unions,” can be found mostly in large and medium-sized firms 
in the export sector, where the presence of a union satisfies legal requirements while 
ensuring cooperation with company goals (de la Garza 1998). Independent unions, though 
small in number, have sporadically emerged as important local labor movements calling for 
union democratization (Collier and Collier 1991; LaBotz 1992). 

While the election of Vicente Fox in 2000 broke the 70-year uninterrupted rule of the PRI, the 
corporatist structure of unions — and the rejection of independent unions by the state — largely 
remains (Bensusan et al. 1998; Bensusan 2004). In addition to the corporatist nature of labor 
unionism in Mexico, the Constitution enshrines a structure of mediation that favors the state and 
capital. Although the Mexican Constitution and the Federal Labor Code are considered to be 
among the most favorable for workers, it is the responsibility of Local Boards of Conciliation and 
Arbitration to enforce the law. Representatives of employers, workers, and the state make up 
this tripartite Board that adjudicates the legality of unions, strikes and resolves labor disputes. 
The workers’ representative, however, is the “official” union, providing little to no voice for 
authentic worker concerns (Bensusan et al. 1998; LaBotz 1992). 

The undemocratic and pro-capital nature of the Local Arbitration and Conciliation Boards 
was evident in the 1972 strike at Moctezuma. During the 1960s and 1970s, the government 
of the Central state in question heavily promoted industrialization by establishing an industrial 
corridor through tax and infrastructural subsidies. Although national and international textile 
industries already existed in the area, their ranks were augmented with transnational textile, 
auto, pharmaceutical and chemical industries. With the rapid increase of factories came the 
growing number of workers. A wave of social movement unionism spread through the region 
from 1970-1974, fighting to replace government unions with independent ones like the FAT 
(Frente Auténtico del Trabajo).° The women at Moctezuma, at that time still Italia Inc., were 
an important part of this movement (Martinez Cruz 2002). 

The first day of June 1972, 110 temporary workers were fired from Italia Inc. Having 
already been influenced by the ongoing movement for independent unionism, the workers — 
through the elected CTM (Confederaci6n de Trabajadores Mexicanos) corporatist union — 
Staged a city-wide protest. By June 17, the fired workers had been reinstated as full-time 
employees. However, on August 11 of the same year the workers called another strike after 
the company violated the collective bargaining contract. Without legal merit, the Local 
Arbitration and Conciliation Board declared the strike illegal; this time the workers took over 
the Board offices. After a month of marches, demonstrations and countrywide disruptions 
due to the surging independent unionism movement, the company gave in to the demands 
for daycare and reinstatement of all workers. During this period the women at Italia Inc. 
replaced the corporatist union with an independent one belong to the COR (Confederacién 
Obrera Revolucionaria). 
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On May Day 1974, workers from several factories which had won independent unions 
called for a boycott of the traditional parade by unions, meant to show appreciation to the 
government. Over 10,000 workers, including the women of Italia Inc., deviated from the 
parade route and refused to pay homage to the government. A campaign of repression, 
coordinated by the government and entrepreneurial sectors, swept the region, clamping 
down on the wave of independent union movements (Basurto 1989; Martinez Cruz 2002). 
After the labor unrest was put down, the garment division of Italia Inc. became its own 
company, Moctezuma. The leaders of the independent union elected in 1972 was forced, 
behind workers backs, to realign with the corporatist CTM, becoming a management- 
dominated white union. Besides the return to corrupt unionism, not much else changed 
inside the factory until the mid 1980s to 1990s. Women workers continued to produce suits 
from Italia Inc.’s cloth, with the textile mill and garment factory side-by-side. Beyond the 
factory gates, however, the import-substitition development model which protected national 
firms was giving way to export oriented free market policies. 

When the Mexican economy began to open its doors to foreign investment and push 
exports in the 1980s, the Italian management of Italia Inc. and Moctezuma took advantage of 
new regulations. High levels of investments poured into the companies, specifically for 
technology at the textile mill (in 1984) and computers for the Design Department at the 
garment factory (in 1990). In 1984, for the first time since Moctezuma’s inception, products 
were exported to the U.S. and Europe. Given this, it should come as no surprise that the 
companies had been able to survive the shift from import substitution to export-led 
industrialization. Both companies had always maintained a direct connection with Italian 
capital and designers and had made a smooth transition with the liberalization of the 
economy to become a leader in production for the national and export markets. 

In 1993 Italia Inc. and Moctezuma were put on the auction bloc. According to several high 
level managers, the Italian owner of the industrial group died and the board decided to sell 
all overseas investments. Two high profile Mexican industrialists become majority owners of 
the two firms, which were then incorporated as a single industrial group, which | will call 
Mexico Inc. Although NAFTA did not become law until the first day of 1994, industrialists in all 
three participating countries were already positioning themselves to take advantage of the 
new trading rules. Therefore, it is reasonable to think that an experienced and successful 
exporting firm taken over in 1993 would have been a prize acquisition. 

Taken over just as NAFTA went into effect, the new Mexican majority owners wasted no 
time in making their intentions clear: turning the firms into cutting-edge world-class 
producers. One of the first steps taken in 1994 was to move toward ISO (International 
Organization for Standarization) certification, which ensures consistent international quality 
levels. Both firms — which retained their names, management, and workers — retooled 
production processes to incorporate quality controls throughout the process. Obtaining ISO 
certification provided a stamp of approval which improved the possibility of landing and 
retaining high-end clients. Interested in sponsoring regional development, state government 
training programs and subsidies were made available for certification. At the same time, 
capital was invested in the garment factory to computerize some key sewing operations. New 
national ownership proved ambitious and visionary. One year after purchasing the textile and 
garment firms, the new industrial group set up a series of store outlets for their own national 
brands inside Mexican high-end department stores. The goal was to conquer not only the 
export market but the national one as well. 

In order to upgrade industrially, a plan was needed: reorganization of production around 
lean techniques and verticalization. On a state-entrepreneurial tour of innovative firms in 
Europe organized by the Mexican state, one of the majority owners of Moctezuma visited 
European firms where he witnessed Japanese-style lean production at work. Accounts point 
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to this being the spark which started the dramatic reorganization of production at 
Moctezuma. In 1995 the industrial group invested heavily in the services of a high profile 
international consulting agency, which also proposed Japanese-style lean production. 
Management provided me with a copy of communication and planning manuals that 
document the transition to lean production. In these pages it is clear that the international 
consulting agency, which | am calling American Consultants, spearheaded the process, 
inculcating the rules of lean production to a team of managers. The team of managers 
together with the consultants, then planned and implemented the changes on the shop floor. 

Reorganization around lean techniques at Moctezuma had four main elements: self- 
managed work teams, elimination of mid-level supervisors, increased information sharing 
regarding production with workers, and focus on quality. Another crucial element was the 
presence of a compliant union. (See Plankey Videla 2005 for an.examination of why 
Moctezuma reorganized around lean techniques). 


Transforming Work at Moctezuma 


Moctezuma, like most garment industries throughout the world, had previously been 
organized under Taylorist principles through the progressive bundle system (PBS). While | did 
not witness this form of organization, | have reconstructed its outlines from interviews and 
secondary sources. At Moctezuma this system consisted of sewing lines organized so that an 
operator performed one task at their own pace and then passed bundles of work to the next 
operator. Ideally, there would be a buffer of 30 garments between operations to deal with 
differences in speed, absenteeism and turnover. Garment workers aimed at individual 
proficiency and speed at one simple task that was repeated over and over again. Hence, piles 
of bundles accumulated between operators, allowing the assembly line to flow as each 
worked at her own pace. Of utmost importance in this system was speed, as each task was 
measured in standard allocated minutes setting quotas for daily production. Piece-rate 
payment was based on reaching and surpassing the quotas set by industrial engineers. Thus, 
work pace was individually regulated as women could exploit themselves to earn a higher 
wage (Collins 2003), 

An important element of the progressive bundle system at Moctezuma (and Taylorist 
production in general) resided in controlling workers and work through several layers of 
supervisors. With the assistance of a senior industrial engineer and multiple interviews with 
workers, | mapped the structure of the Jackets Department, revealing four layers of 
supervisors under Taylorism at Moctezuma. At the top of the organizational hierarchy stood 
the plant manager, who oversaw workers directly by roaming the shop floor and indirectly 
through department managers. In Jackets, the department manager was responsible for five 
area supervisors, who in turn, watched over a line supervisor. In total, under PBS the Jackets 
Department had eleven supervisors. 

Another mechanism for control in Mexico, the company-promoted union or “white” union 
was also present in the company (LaBotz 1992). The union had two main responsibilities, 
recruiting workers and solving personal worker problems. In terms of recruiting, management 
forwarded their desired profile; the union put out radio announcements. Final decisions, 
however, resided with the Human Resource Department. Women of all ages and levels of 
experience were hired. Until 1996, men were only allowed in the Cutting Department, which 
managers justified by requiring higher stature due to equipment. 

The other role played by the union, responding to workers’ personal problems, 
strengthened paternalistic ties of loyalty. According to workers, the union was more 
concerned with collecting money for a funeral, lobbying for an unjustly jailed family member, 


On the Shop Floor in a Mexican Garment Factory * 2109 


or settling a love triangle gone awry than dealing with issues like seniority, work rules or pay 
discrepancies. Although technically under the purview of the union, management dealt with 
work issues. 

When Moctezuma management decided to become a lean garment factory in 1996, they 
were in large measure responding to pressures from clients to produce at higher quality levels 
and through JIT organization (Bonacich and Appelbaum 2000; Nebel 2003; see Plankey Videla 
2006 for an examination of how teamwork was gendered at Moctezuma). Lean production, as 
mentioned previously, is centered around producing high-quality goods with a multi-skilled 
workforce that reduces waste and job redundancy. The layout of the shop floor at Moctezuma 
was changed to reflect such changes in philosophy. Instead of individual work stations on 
assembly lines, workers were organized into teams, most often in a clearly denoted space, 
and trained to operate several machines or perform several operations. Kanban markers 
pulled products through the line as inputs arrived through JIT arrangements. Without 
unnecessary warehousing of inputs or buffers of work on the shop floor, only immediate work 
orders and workers were visible making it easier to pinpoint difficulties. 

However, to convince workers to accept an increase in work responsibilities and alteration 
in work routines, the company had to validate the change in both ideological and material 
ways. At Moctezuma, this was accomplished through the idea of a “community of fate,” 
fostered through corporate welfare programs which demonstrated the company cared for the 
workers as human beings, not just employees. The ideology of a “community of fate” was 
cemented with an onsite clinic staffed by a nutritionist, psychiatrist, dentist and family 
physician who offered free care not only for the worker but also her family. The doctors were 
especially proud of the yearly dental programs, diabetes screening and women's health 
campaigns. Special workshops and one-on-one counseling were also available for drug 
addiction, family violence and birth control. In the context of high unemployment, under- 
employment and few state safety nets, such policies were more than generous; they 
resolved many immediate problems for workers. Moreover the company sponsored activities 
to deepen social norms of dependency and attachment such as athletic teams, cultural 
events, summer craft workshops for worker's kids and the most popular, primary and high- 
school equivalency classes. 

In effect, the company replaced and greatly enhanced the paternalistic programs 
previously provided by the union. The underlying message of Moctezuma’s management 
specifically (and Japanese-style lean production more generally) was that the adverserial role 
of the union had been superseded by coordination and trust between management and 
workers. As the firm transitioned from Taylorist to lean production techniques, the union 
continued to play a facilitating role. Although no longer the main channel of communication, 
unioin officials remained important interlocutors for workers and were directly involved in 
hiring and solving worker's personal problems. In terms of hiring practices, women were still 
the favored worker. Perhaps the biggest change in worker profile under lean production was 
that older women with garment work experience were preferred as their experience, 
knowledge and ingenuity, which could be harnessed by teams, was officially recognized. 

Another important way to construct a “community of fate” at Moctezuma included 
flattened hierarchies on the shop floor. With the introduction of lean production, the number 
of supervisors decreased; their authority transformed. Line supervisors were replaced with a 
diminished number of “advisors,” who, differently from the old supervisory model, required 
permission from teams to enter into a teams’ work area.° The goal of eliminating the middle 
supervisory level was to cut costs and demonstrate confidence in workers’ capabilities. The 
number of supervisors in the Jackets Department was cut from eleven to three (department 
manager and two advisors), with team leaders playing an intermediary role between 
management and workers. Team leaders, who rotated every three months according to 
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seniority, met with managers every morning to receive the previous day's production and 
quality report and plan out the day's work. It was then the team leaders who communicated 
this information to the teams. As self-managed teams, it was up to the teams to organize 
themselves to reach production and quality imperatives. Thus, teams decided how to cover 
for sick or absent workers, determined who received permits for sickness and family 
emergencies, and made sure production and quality did not falter. Workers were constantly 
reminded that self-managed teams were proof of management's trust. 

Although symbolic in nature, other practices helped construct the “community of fate” 
belief. For example, both managers and workers ate at the same cafeteria (though rarely at 
the same table) and mingled at company celebrations. Christmas and the Day of the Virgin 
of Guadalupe became occasions on which both managers and workers brought their families 
and sat together without distinction. The jovial “dance with the advisor” seemed especially 
effective at breaking down status and gender hierarchies. 

Other changes made at Moctezuma further emphasized the importance of quality and the 
trust the company placed in the workers. Under the new lean system, each operator checked 
their own quality and that of the workers before them, acknowledging the worker's skill and 
capability. Moreover, with the new-placed accent on quality rather than quantity, payment was 
no longer based on piece-rate but rather on a base wage with bonuses for skill, seniority and 
leadership.’ Incentives for producing above the weekly quota also enhanced wages 
significantly, sometimes surpassing half of a week's wage. In effect, work loads and 
responsibilities increased, but so did take-home pay and benefits. 


Transforming the Work Experience 


The new system of lean production first appeared to work successtully. Quality performance 
soon increased, while production remained level.® Workers took advantage of the opportunity 
to learn new skills, achieving the goal of a multiskilled workforce. In terms of wages and 
benefits, Moctezuma was considered one of the highest paying garment factories and one of 
the best places to work in the area. Why, then, did workers break out on strike five years later 
in March of 2001? | will argue that the organization of production significantly changed the 
labor process and the way women and men experienced work. In the end, such changes 
undermined the ideology of “community of fate,” without which workers were no longer 
willing to participate in the new system. 

Essential to understanding worker resistance is the way many women and men 
experienced teams and promotions. Instead of being understood as empowerment, 
teamwork became code for pervasive control. Under the PBS, workers regulated their own 
work pace, allowing either for self-exploitation or a leisurely rate. However, under lean 
production every worker was responsible for every other workers’ performance regarding 
quality and quantity. Therefore, instead of a few supervisors, everyone became an unwitting 
worker-supervisor. Moreover, since incentives were based first on the team, and then the 
entire shop floor surpassing pre-set quotas, pressures to increase the work pace were 
continual sources of discord. Markers of team performance surfaced all over the shop floor as 
white boards were hung from the ceiling over each team with an inventory of output 
tabulated every two hours. Slow workers were reprimanded in team meetings, often being 
ostracized from group activities. 

Reorganization of production also had an impact on the internal labor market. Promotions 
had previoulsy been allocated based on experience and performance. Most of the line 
supervisors under PBS at Moctezuma were older women who had risen through the ranks. 
Under lean production, technical knowledge was rewarded. Newly graduated industrial 


On the Shop Floor in a Mexican Garment Factory * 2111 


engineers — all but one male — took over the positions of advisors and managers, effectively 
ending promotion opportunities for workers, especially women. 

Lean production was not whole-heartedly accepted by most workers, given the increased 
control by teams and loss of promotion opportunities. However, teamwork was not, as the 
union leader in the introductory story believed, the sole cause of the strike and ultimately, the 
failure of the firm. The negative experiences of teamwork were necessary but not sufficient 
conditions for worker mobilization. | argue that the “community of fate” ideology kept the 
shop floor running. One particular incident makes this point abundantly clear. 

On the morning of Feb. 22, 2001, workers arrived at the factory to discover that the 
previous night, management had announced it was eliminating the second shift. Half of the 
workers in the night shift would be fired; the rest integrated into the first shift. Panic and anger 
spread through the shop floor. Soon team meetings could be seen all across the shop floor as 
they decided whether or not to go on a wildcat strike in protest. In the end, workers decided 
to delay the decision until they had more information. Underlying the heated conversations 
was an understanding that Moctezuma was facing an economic crisis, and that cooperation 
rather than antagonism should be the first strategy to weather the storm.9 

The fact that the workers so quickly decided to protect the company in spite of their 
resentment towards management's decision to fire the second shift is evidence of the 
“community of fate” belief. Shielding the company from a wildcat strike was seen as in each 
worker's personal interest since their future was closely tied to that of the company. Further 
evidence is that half an hour later, the team | worked with was asked to work harder to 
complete an urgent order. They did. Although angered by the unilateral action taken by 
management, which violated crucial social norms of job security and legitimate use of power, 
my team had consented to work harder. Nonetheless, the ideology of “community of fate” 
had been weakened. From this point forward workers scrutinized management actions and 
words, seeking to uncover the level of the company’s commitment to the workers. 

In March 2001, management's actions betrayed workers’ trust. March 15 was the deadline 
for the bi-annual ratification of the contract. The previous year management had asked 
workers to forgo some fringe-benefits (such as gifts of detergents, soaps and Christmas 
baskets) in order for the company to make use of the capital for needed investments. These 
investments, it was claimed, would enhance the company’s market share, devolving 
immediate benefits to workers. The union, after a member vote, agreed to freeze benefits for 
a year. Yet the year was up — and management reneged on its promises. The sense of betrayal 
on the shop floor was palpable. The company had lost all legitimacy in the eyes of the 
workers; the “community of fate” belief constructed through enhanced corporate 
paternalism, high wages and discourse of the company being a family had been definitively 
fractured. It was at this point that many felt action was required. 

At the union general assembly women took the lead calling for a strike. Since 
management had reneged on the benefits and they refused to raise wages, they could not be 
trusted. Debate centered on the issue of single mothers: most of the women on the shop 
floor were believed to fit into this category and they would suffer the most if the factory went 
on strike. Several men stood up to “defend” single mothers against the financial catastrophe 
of a strike. Some women agreed with them. But surprisingly, a larger number of single 
mothers stood up demanding a strike. Their sense of exploitation and betrayal 
overwhelmingly convinced the packed hall. 

While at first glance the rapid concurrance in favor of the strike seems anomolous, it is not. 
Rick Fantasia’s (1988) work on the U.S. labor movement and close examination of strikes at 
three sites, demonstrates how consensus rapidly forms in an atmosphere of conflict with 
management. A culture of solidarity which fosters class-based bonds and consciousness 
develops through the process of antagonistic struggle for worker rights. Often the language 
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of moral economy is evoked, claiming past work arrangments as rights to be defended at all 
costs. The same occurred at Moctezuma. 

When it was evident that Moctezuma’s management had irrevocably broken the social 
pact — the constructed “community of fate” — workers’ anger coalesced into organized 
resistance. The ability to be organized as self-managed teams, earn bonuses, and refuse entry 
to advisors into team areas were codified as rights which could not be taken away. When 
single mothers voiced agreement with this language of rights, the struggle became framed 
as one between power-hungry managers against vulnerable single mothers, who stood up for 
their traditional role of mothers (and given their class background, providers). At Moctezuma 
the culture of solidarity took on the rallying call of protecting single mothers. Ironically, it was 
in the name of traditional motherhood that women’s resistance was justified, and in some 
cases led to emancipatory journeys. On the other hand, the company was publicly chastised 
for abandoning their role of provider and protector. 

In response to management's breaking the social pact with workers, the latter used new 
work techniques learned under lean production to resist. Under lean production, the 
workforce was divided into teams, each with a leader. Meetings could be called by leaders 
at any time of day, and leaders were free to move around the shop floor since they were 
responsible for coordinating with other departments and team leaders. Furthermore, leaders 
gathered daily in departmental meetings and weekly in the union office, providing 
opportunities for them to become acquainted and share opinions. Ironically, such a structure 
of communication enhanced the worker's ability to mobilize on the shop floor. 

According to interview subjects, there have been seven to nine work stoppages from 1996 
to 2001, when teams were operational. Under the PBS there were no more than one or two 
work stoppages. Only the last work stoppages under teamwork occurred at a time when the 
firm experienced economic difficulties. Thus, the main difference between these two periods 
was the work organization. Interview data are reinforced by my own experiences on the shop 
floor. As mentioned earlier, on March 22 it was announced that the second shift was going to 
be eradicated. Team leaders called meetings all over the shop floor. While each team 
discussed what actions to follow, team leaders walked around gathering and sharing 
information. In ten minutes, our team had decided, taking into account what we had learned of 
the other teams’ deliberations, to wait and see what happened next. A network of effective 
communication between teams, as leaders facilitated discussions within teams, was apparent. 

The team structure also helped workers deal with the free-rider problem during 
mobilizations. After the strike, management and workers waged a daily battle in the streets 
and in the media. Workers had decided that to combat management's wage and benefits 
cutbacks, a new executive committee had to be elected. The problem, once workers chose 
their new union representatives, was that just like in 1972, neither the company nor the Local 
Arbitration and Conciliation Board would accept their choice. For months after the union 
election, workers would still gather outside the factory gates after the first shift to march to 
the Board's or the governor's office. In order to assure that workers attended marches, team 
leaders picked up company identifications from their team members, to be returned at the 
end of the event. If at the end the leader still had your ID, he or she knew you stepped out. 
Lists were kept of those who did not support the movement, discouraging free riders. 

In May 2001, 47 leaders were fired. In response, workers decided to work at 50 percent 
Capacity. The slowdown lasted two weeks — but it could not have lasted that long without the 
support of team leaders who kept an eye on their teams’ production levels, pressured those 
who did not want to go along and defended others against management backlash. While it 
was possible for a team leader not to go along with a decision the union membership had 
made, it was not probable. Teams could and did remove team leaders. The first team | worked 
with removed the newly rotated leader who was on the side of management. The team asked 
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for her removal, but given the tense situation on the shop floor, management refused. Hence, 
she was harassed and insulted to such a degree by her teammates that she could not 
function as team leader. She was fired. Regulations set forth by the team system worked 
against management and strengthened the workers’ efforts to be autonomous. 

Another unintended consequence of team structure was to facilitate solidarity among 
team members during mobilizations. During the strike, 12-hour shifts were organized by 
teams to guard all entrances to the factory. At this time, it was my first week with a new team 
in Jackets. We were responsible for guarding the door to the Human Resource office. Those 
hours spent together brought everyone closer, even though this team was already a relatively 
close-knit group. However, the other team | had worked with consisted of several factions that 
did not speak to each other. After several months of marches, demonstrations, and other 
mobilizations, they had become “compaferas” or companions. For the first time since they 
became a team in 1996, they gathered after work to celebrate, on this occasion, Mother's 
Day. As we sat laughing and eating, discussion turned to strategy for another union election. 
Clearly, this group of women that began as enemies had forged an identity as fellow workers 
that they had not exhibited only a few months earlier. Not only did they support each other but 
they also had acquired a culture of solidarity through the mobilizations. While teamwork did 
not foster such camaraderie or consciousness, the structure of teams facilitated 
mobilizations. Since teams organized actions during the mobilizations, there was a structure 
available that encouraged alliances based on worker issues instead of individual interests. 

Finally, several women commented that the contention of shop floor production methods 
profoundly affected their self-perception. Originally, women were sought as cheap, docile 
workers who would not contest management decisions. Many workers saw themselves in 
this way as well. However, working in teams provided an experience of leadership and 
solidarity that transcended factory walls. Women, who made up over 70% of the workforce 
at Moctezuma, were the first and staunchest supporters of the strike. Participation in the 
mobilizations empowered them to challenge unequal relations at home as well as at work. For 
many, simply participating in the mobilizations meant going against their parents or husbands’ 
wishes. Moreover, this newly found strength allowed many to see a possibility in changed 
gender relations. One case is particularly illustrative. After being fired, Maria, a young woman 
who had been elected to the strike committee became the first female delivery driver in town 
because as she put it, ‘now | know | can do anything.’ Unwittingly, the team system had 
unleashed a wave of personal transformation and resistance. 

Management also noticed the way team structure was subverted by workers. In August 
2001, the factory closed for a few days as the shop floor was reorganized around new rules. 
First, teams were eliminated. Without teams, there were no team leaders who could freely 
roam around the plant talking to other team leaders. Without teams, there were no more 
meetings excluding management from worker decisions. In effect, the new organization, a 
hybrid between iean production and Taylorist progressive bundle system, retained elements 
beneficial to company goals and discarded features that allowed worker autonomy. 
Comparing it to the previous lean production organization, the shop floor under hybrid 
Taylorism combined assembly lines with modular layout in order to best respond to 
production requirements. Some areas remained the same while others changed in ways that 
made linear assembly possible. Production was still pulled with Kanban cards and using JIT 
inventory, essential components of lean production that speed up the work pace and require 
closer communication and coordination between operators and with suppliers. However, 
issues of supervision and hiring practices starkly reverted to Taylorist principles. 

Under hybrid Taylorism at Moctezuma, direct control returned reinforced. A parallel system 
of quality supervisors was added to the recently augmented production management staff. 
Thus, instead of the three supervisors in place in the Jackets Department, there were SIX 
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production and three quality supervisors. Moreover, management set up quality checkpoints 
along the line, spending their days in the midst of workers instead of in the offices on the 
shop floor perimeter. Supervisory control was physical, visible and antagonistic. Interestingly, 
the only area where management presence was reduced was at the top level. Previously 
there had been one general production manager and one quality manager who oversaw the 
shop floor; now only one production manager called the shots. 

Even the friendly company union changed. No longer were there union offices in the 
parking lot building, nor were workers chosen to be delegates. Management had recruited a 
representative of a national prepared food and beverages union. Not coincidently, this union 
was present at many large transnational corporations in Mexico, minimally fulfilling the legal 
dictate of union representation. A woman with no knowledge in the garment industry came 
to town from nearby Mexico City for one day, every two weeks. She sat in the Human 
Resource office waiting to see if any workers wanted to talk to her. Not many did. 

Most telling were the changes in hiring practices. Whereas Moctezuma had previously 
hired women (and some men) of all ages, preferably with experience and high school 
education, they now actively and exclusively sought young women between the ages of 18- 
25, with no previous work experience, and low levels of education. Every week new groups 
of workers — usually those with seniority or those especially active during the strike — were 
fired. Young women, working outside the home for the first time, quickly replaced them. 
Comments made by management point to them perceiving young women as more malleable 
and docile. Furthermore, past experiences led them to believe they would be a transitory work 
force as young women often leave when they marry and have small children; a transitory 
workforce is less likely to organize. These changes in hiring practices signaled a desire to 
highlight compliance and docility over experience and quality, crucial elements of the original 
formulation of lean production (See Table 1). 


Conclusion 


| have argued that the concept of “community of fate” (Cole 1979) crystallizes why workers at 
Moctezuma consented to work harder and smarter under lean production. In Japan, where 
these production methods were pioneered, consent was built on the practices of life-time 
employment, seniority-based wages, company unions, and corporate welfare policies. At 
Moctezuma, stable and relatively high wages replaced life-time employment.” Seniority and 
skill underpinned wage arrangements and the union worked only in the confines of the firm. 
Likewise, management at Moctezuma made every effort to prove its commitment to workers 
by providing services beyond the legal requirement, which responded to particular needs of 
the working population and their families. These practices developed as social norms 
governing the interactions between management and workers. 

However, the labor process under lean production proved to both benefit and constrain 
workers. Jobs did pay well, but the work environment deteriorated as the pace quickened and 
the perception of surveillance augmented. Moreover, it was clear to women that there were 
no options for promotion. While before they could see themselves working at this factory 
their entire career, the combination of the work environment and lack of promotion 
opportunities made management's manipulation of loyalty transparent. Yet workers remained 
good citizens, producing at the high quality standards and production levels requested by 
management. In 2001, in response to market downturns, management broke the norms of 
shared trust. Had management continued being good citizens, as they had done for five 
years, it is likely workers would have responded in kind. Instead, recognizing the break in the 
management sponsored “community of fate,” workers used the tools of lean production to 
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undermine the firm. That is, they used practices such as coordination and teamwork to 
mobilize resistance. 

Lean production provided a structure of teams and team leaders that could more easily set 
in motion worker discussion and decision-making regarding social movement actions. With 
management's authority circumscribed, and team leaders able to move freely around the shop 
floor, coordination was swiftly achieved. Team structure also facilitated the empowerment of 
many women as they took their turns at being.team leaders and later mobilized to fight the 
company, the Local Arbitration and Conciliation Board and the local police. 

In the end, this model factory which had successfully competed internationally based on 
high quality did not look much different from any other subcontracting assembly plant. 
Harkening back to Taylorist principles of control, Moctezuma no longer hired experienced 
women of all ages. Instead, management reverted to the “docile,” “nimble fingered” young 
inexperienced women made famous in export processing zones. 

When managers reorganized the factory around hybrid Taylorism, which incorporated the 
control of Taylorism and organizational innovations of lean production, workers lost out. From 
workers’ point of view, it was not much different from Lipietz’ (1982) “bloody Taylorism” or 
Collins’ (2003) “hyper-Taylorism.” However, in the process of resisting management's 
changes, workers’ created a culture of solidarity that. propelled women and men to value 
themselves as workers and individuals. For women, especially, lean production provided an 
experience of leadership and acknowledgement of their skill and ingenuity that empowered 
many to challenge injustice and gender norms that called for submissiveness. 

In the case of Moctezuma, the “community of fate” ideology that underpinned lean 
production did cut both ways. The “community of fate” ideology constructed by managers — 
with its discourse of loyalty and sacrifice and its buttressing corporate welfare programs — 
convinced workers to extend their physical, intellectual and emotional labor to the firm. What 
managers failed to understand fully was that in workers’ eyes, the “community of fate” belief 
also tied the firm to the workers. When management reneged on this social pact, workers not 
only resisted management's efforts to regain control over the shop floor, but also actively 
used teamwork to thwart the firm’s economic viability. Instead of controlling workers, in the 
end lean production facilitated worker radicalization and mobilization. 

It would be incorrect to conclude from this one case of failed industrial upgrading that only 
sweatshops are viable in the Third World. On the contrary, the case of Moctezuma does not 
prove that lean production or other high-road options are not possible. Rather, this case 
demonstrates that lean firms are complex organizational settings. By extending Cole's (1979) 
concept of “community of fate”, | have demonstrated that greater attention must be given to 
the social relations accompanying lean production implementation and the roles that 
managers play in its success. In these times of economic volatility, not only workers but also 
managers need to work smarter, harder, and be good citizens at the workplace. 


Notes 


1. The name of the company, as well as that of workers and managers, has been changed to 
protect their anonymity. Permission was obtained from management, union and workers 
to conduct research. 


2. While there is variation in Japanese management practices and the implementation of 
lean production in Japan, scholars have pointed to the three elements discussed (quality 
consciousness, teamwork and flexibility) as being pillars of Japanese lean production 
industrial relations (Cole 1979; Oliver and Wilkinson 1992). 
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3. Worker citizenship behaviors are those that “improve productivity and cohesion in the 
workplace, which are above and beyond organizational requirements.” (Hodson 2001:45-46) 


4. The CT serves as an umbrella organization for three confederations of workers, CTM 
(Confederation of Mexican Workers), CROM (Regional Confederation of Mexican 
Workers), and CROC (Revolutionary Confederation of Workers and Peasants). Of these 
three the CTM accounts for the majority of unions. See Bensusan 1998, Collier and Collier 
1991, and LaBotz 1992 for further details. 


5. The FAT, with its roots in Catholic social teaching, brought together workers, students and 
church groups. They act more like a movement than a union, building coalitions and 
focusing on issues of justice and democracy (Basurto 1989; LaBotz 1992). 


6. In practice, advisors roamed the shop floor solving problems. They did not ask for 
permission but only stayed away if explicitly told by an entire team they were not 
welcome. Since this was rare, this provision did not greatly affect their authority on the 
shop floor. 


7. \thank Jane Collins for pointing out that this can also be considered a modified form of 
piece-rate. 


8. Given the learning curve of working in teams, the fact that production remained stable 
points to its initial success. 


9. The labor market conditions also must be taken into account. Scholars have shown that 
workers are more likely to align their interests with that of capital in times of economic 
uncertainties (Hodson 2001; Vallas 2003). Until the U.S. recession in early 2001, however, 
the labor market opportunities had not changed significantly. If anything, the signing of 
NAFTA and the resulting shift of jobs to Mexico strengthened the hand of labor. 


10. The state (like Japan after WWII) had abundant labor and a history of contentious relations 
between capital and workers. A tenuous class compromise was reached in the 70s. 
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Abstract 

Students living in inner city and rural areas of the United States exhibit lower educational 
achievement and a higher likelihood of dropping out of high school than do their suburban 
counterparts. Educational research and policy has tended to neglect these inequalities or, at 
best, focus on one type but not the other. In this article, we integrate literatures on spatial 
stratification and educational outcomes, and offer a framework in which resources 
influential for achievement/attainment are viewed as embedded within, and varying 
across, inner city, rural and suburban places. We draw from the National Educational 
Longitudinal Survey and the Common Core of Data, and employ hierarchical linear and 
hierarchical logistic modeling techniques to test our arguments. Results reveal inner city 
and rural disadvantages in both family and school resources. These resource inequalities 
translate into important educational investments at both family and school levels, and help 
explain deficits in attainment and standardized achievement. We conclude by discussing 
the implications of our approach and findings for analyses of educational stratification 
specifically and spatial patterning of inequality more generally. 


Educational success, measured by achievement on standardized tests, has become the 
central focus of U.S. educational politics and particularly the No Child Left Behind Act. Indeed, 
much of the current political rhetoric revolves around evaluating the competency of teachers 
or schools, introducing “competition” and “choice” to current public schooling options, and 
increasing mandatory student testing (Renzulli and Roscigno 2005). The assumption is that 
such changes within public education will somehow enhance student achievement. What 
remains overlooked, however, is the fact that “poorly performing” schools tend to serve the 
most impoverished populations in the United States and are located in the most 
disadvantaged rural and urban areas of the country (College Entrance Examination Board 
1995; U.S. Dept. of Education 1992, 1997; Lichter, Cornwell and Eggebeen 1993). 

In this article, we develop a conceptual model to explain local advantages and 
disadvantages and their role in generating, and perhaps reproducing, inequality. Prior 
educational research has laid the foundation by highlighting two key institutional spheres, the 
family and the school, that influence educational performance (e.g., Alexander, Entwisle and 
Thompson 1988; Roscigno 1998; Teachman 1987). We build upon this literature and embed 
its emphases within a theoretical account of rural, urban and suburban places and the 
educational opportunities they afford. Moreover, we differentiate between the resources 
available to students and investment decisions made by parents and schools within the 
constraints of resource availability. This distinction is theoretically important as it makes 
explicit our assumptions regarding institutional (i.e., family/school) decision-making and, 
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perhaps most central, its constraint within the boundaries of local context and resource 
availability. (See also Steelman and Powell 1991.) Drawing from a nationally representative 
sample of U.S. high school students, analyses examine resource availability, investment 
inequalities and implications for achievement and attainment among urban, rural and 
suburban adolescents. 


Education & Resource Inequalities 


Inner city and rural areas differ substantially from suburban places in the availability of 
resources, owing largely to the nature of the local economy — a fact denoted by a wide body 
of scholarship on labor market transition, economic development and spatial stratification. 
Urban inner city areas, once the bastion of U.S. manufacturing, have witnessed 
decentralization and capital flight since the 1950s. Frequently, what remains is a poor and 
often minority population increasingly dependent on low-wage and relatively unstable service 
sector jobs (Brooks-Gunn, Duncan and Aber 1996; Wilson 1987; Kasarda 1987). Rural areas 
throughout the country too have been shaken by industrial restructuring, especially in 
agriculture, which has increased rates of unemployment, dependence on part-time service 
sector work, and family economic vulnerability (Duncan 1996; Lichter 1989; Lobao 1990; 
Meyer and Lobao 1997; Tomaskovic-Devey 1987). Suburban neighborhoods, in contrast to 
the aforementioned negative scenarios, tend to be populated by educated, two-parent 
families and have lower rates of poverty. It is arguably easier to educate children in such a 
context given the resources that both schools and families bring to bear on the educational 
process and on potentially influential educational investments (Roscigno and Ainsworth- 
Darnell 1999; Steelman and Powell 1993; Teachman 1987). 

Familial resources in particular are shaped by local conditions and the stratification that 
they produce. Indeed, family socioeconomic status and the economic well-being of children 
are depressed in inner city and rural areas — a condition that has only intensified in the past 
three decades (Hoppe 1991; Lichter and Eggebeen 1992; Wilson 1978, 1987; Tickamyer and 
Duncan 1990). This has important implications for educational success. As educational 
research has repeatedly shown, socioeconomic status of a student’s household is 
consistently influential for both achievement and attainment (Alexander, Entwisle and 
Thompson 1987; Guo 1998; Mehan 1992; Teachman 1987). This may be due in part to the 
influence of economic well-being on cognitive development early in the educational trajectory 
(Brooks-Gunn, Klebanov and Duncan 1996; Guo 1988; Moore and Snyder 1991). Clearly, this 
relationship is also a function of the amount of time and money parents have to invest in their 
children’s cultural skills and schooling. Middle- and upper-class parents can hire tutors and 
make time to meet with teachers; they are more likely to utilize “proper” English in the 
household; and, they tend to structure their children’s leisure time in educationally meaningful 
ways (Lareau 1989; 2002). ; 

Family structure is also patterned by local economic opportunity, most obviously through 
family economic distress and unemployment (Anderson 1990; Meyer and Lobao 1997: 
McLaughlin, Gardner and Lichter 1999). Though rural families are more likely to have two 
parents, rural trends in non-marital childbearing and family dissolution now mirror those of 
inner city areas (Lichter and Eggebeen 1992). This is problematic given that children in single- 
parent or step-parent families do not do as well in school on average (Downey 1995b; Krein 
and Beller 1988; McLanahan and Sandefur 1994). Proposed mechanisms include potential 
household turmoil (Hetherington, Cox and Cox 1978; Sandefur, McLanahan and Wojtkiewicz 
1992) and further limitations on resources (Thompson, Entwisle and Alexander 1988: Zill 
1996). 
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Just as local economic opportunity influences family socioeconomic status and structure 
through the availability and quality of jobs (Garrett, Ng’andu and Ferron 1994; Lichter 1989; 
Kasarda 1987), so too does it shape the availability of school resources through the 
concentration of poverty and the generation of educational revenue through local property 
taxes (Roscigno 1995; Wenglinsky 1997; Wise and Gendler 1989). Simply, lower property 
values join with a relatively high tax burden and a lack of corporate tax revenues to produce 
a smaller economic base for educational spending (DeYoung 1987; Reeder 1988; Wilson 
1978; Logan and Molotch 1987). Indeed, many city school districts across the country are 
struggling to stay financially afloat in the face of shifting and declining tax bases. Rural school 
districts remain even more vulnerable, it seems, given smaller and poorer populations. This 
perhaps explains why rural locales are leading the charge in terms of school funding reform 
and litigation. 


Consequences for Educational Investments 


It is important to conceptually disentangle the availability of family and school resources 
across inner city, rural and suburban places from the educational investment of those 
resources. Not doing so risks overlooking the influence of prior history, resource availability 
and the mediating processes through which resources influence educational outcomes. 
Indeed, the distinction we draw between resources and investments introduces some agency 
and reflexivity at the family and school level into what are arguably over-determined 
conceptualizations of resource effects in past research. Families with resources of income 
and parental education may choose not to invest them in their children’s education. Variance 
in resource investment across similarly situated families, particularly when tied to 
geographical residence, may represent differences in cultural road maps (Tomaskovic-Devey 
and Tomaskovic-Devey 1988). That is, people who live in communities with historically limited 
access to education or modest economic returns to educational investments may simply 
invest less in education. This may be a function of persistent resource disadvantages, but 
may also be shaped by community expectations arising from local job structures and 
historical arrangements. 

Family educational investments will be constrained most straightforwardly where depressed 
income, poorly educated parents and family disruption are more commonplace (Roscigno and 
Ainsworth-Darnell 1999). There will, however, be a discretionary dimension to this decision- 
making (Teachman 1987; Steelman and Powell 1991). There are several family-level discretionary 
investments with consequences for educational success. First, parents can invest and transmit 
to their children cultural capital; that is, “high brow” cultural attributes, typically held in high 
regard by teachers and hence conducive to higher achievement/attainment (Bourdieu 1977; 
DiMaggio 1982; Lamont and Lareau 1988). Such signals are often operationalized as 
involvement in cultural practices such as attending museums or taking extracurricular classes in 
subjects such as dance, art and music (Aschaffenburg and Maas 1997; DeGraaf 1986; 
DiMaggio 1982; Ganzeboom, DeGraaf and Robert 1990). Household educational items (e.g., 
books, computers, newspapers, etc.) likewise represent a potential parental investment with 
consequences for student preparedness (Downey 1995a; Mercy and Steelman 1982; Roscigno 
and Ainsworth-Darnell 1999; Teachman 1987). Levels of parental educational involvement, likely 
shaped by local opportunity and its more direct impact on labor market experiences, also will 
hold relevance for childrens’ educational performance (Kohn, Naoi, Schoenbach, Schooler and 
Slomezynski 1990; Lareau 1989).’ 

While certainly constrained by resources, investment decisions are partially independent and 
influential for educational outcomes in and of themselves (Roscigno and Ainsworth-Darnell 
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1999). That is, while investments are correlated with social class resources and family structural 
patterns, they are not empirically reducible to them. In order for families to invest their resources 
of time or money into educating their children, they must believe that there is a connection 
between the investment and their children’s future. (For example, see Buchmann 2002.) It may 
be the case that community context influences these investment choices. Specifically, rural and 
inner city parents may make less in the way of educational investments because people in poor 
places have fewer resources. However, there may also be spatial differences in the perceived 
benefits of such investments given the economic contexts in which families reside. 

The distinction between resources and investments may be useful for explaining school-level 
processes as well. Schools’ investments will certainly be patterned by the availability of 
resources, including lower tax revenues in inner-city and rural locales. Educators and school 
boards, however, may alSo invest resources in a manner consistent with perceived needs of the 
local population and local labor markets. Here, investment choices are a function of both 
resource availability and local political processes regarding the distribution of public funds. For 
example, the rural “brain drain,” or movement of educated people out of limited or declining 
labor market locales, discourages such communities from investing resources in education, 
particularly college-oriented education (McGranahan 1980; McGranahan and Ghelfi 1991; 
Swaim and Teixeira 1991). The logic here is that the beneficiaries of such spending would not be 
the rural places themselves, but rather the non-rural locales to which the educated segments 
of rural populations relocate. Similarly, schools in inner city areas might divert funds from direct 
instruction to social services and behavioral control as the schools endeavor to do much more 
than educate. Teacher encouragement of student achievement (as a potential interactional and 
time investment in a student body) will likely be depressed in rural and inner city places due to 
the community context in which the teacher works (e.g., poorer locale, segregated school, etc.) 
as well as the class background of students with whom the teacher interacts. 


Local Opportunity & Institutional Resources/Investments 


Our most general theoretical argument is that families and schools, as distinct although often 
overlapping institutional spheres, are embedded within and shaped by places — places that vary 
significantly in opportunity and, consequently, resources. Here we are recognizing both the 
spatial patterning of opportunity and the ways in which local opportunity permeates or mitigates 
inequality through more familiar and proximate institutional (i.e., family and school) channels. 

Clearly, resources and investments matter in each of these spheres. Yet, grouping 
institutional resources and investments together would obscure mediating processes through 
which resources come to influence educational outcomes. Prior work has done little to make 
the resource-investment connection explicit (for an exception, see Powell and Steelman 
1991). Our distinction between resources and investments brings discretionary dynamics and 
their interplay with local opportunity more explicitly into the conceptualization. 

Specific to our analyses, we expect that institutional resources at both family and school 
levels will be depressed in inner city and rural places — an expectation grounded in research 
on spatial stratification and labor market opportunity (Lichter 1989: Tickamyer and Duncan 
1990; Tomaskovic-Devey 1991; Wilson 1987). Where family/school resources are constrained, 
educational investments important for achievement/attainment should be limited as well. 
Discretionary investment at family and school levels, however, may also reflect cultural and 
political orientations tied to the local labor market opportunities of rural and inner city areas 
in the United States. Rather than exogenous or specific to certain populations as suggested 
in more traditional cultural studies, such differences are conceived of here as emergent 
phenomena, manifested in relation to local political and economic conditions. 
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Data and Measurement 


We draw from three waves of the National Education Longitudinal Survey (NELS) and the 
Common Core of Data (CCD). NELS is a large, nationally representative data set with 
student, parent, teacher and principal components. During the first wave in 1988, the 
National Center for Education Statistics drew random samples of about 25 8" graders in 
each of about 1,000 middle schools. Students were then traced in 1990 and 1992, with 
high follow-up response rates (National Center for Education Statistics 1992). These data 
provide a unique combination of measures of inner city and rural location, family/school 
resources and investments, achievement/attainment outcomes, as well as appropriate 
controls for individual, family and school characteristics. 

The CCD, from which school/district data are drawn, is the primary database dealing 
with public elementary and secondary education used by the National Center for 
Education Statistics. It is a comprehensive, annual (1986-1987 through 1991-1992), 
national statistical database of all public elementary and secondary schools and school 
districts across the United States with comparable data for all districts and schools. It is 
from this source that we draw important indicators pertaining to school racial composition 
and educational spending. Due to the exclusion of private school students from the CCD, 
our analyses focus on patterns of achievement/attainment among the U.S. public school 
population only. 

We take advantage of the longitudinal nature of the data by predicting 
achievement/attainment at a later time relative to family background and school attributes. 
This bolsters confidence pertaining to causal ordering. Because family attributes precede 
school attendance, they are measured at Time 1 (8" grade) and school attributes are 
measured at Time 2 (10" grade). Achievement is measured in 10" grade to prevent the 
loss of dropouts — a group that is arguably lower in the achievement distribution — while 
dropout status is measured in the 12" grade. Rather than lose cases or artificially reduce 
variation through general mean substitution, regression imputation with random error 
components was used to replace missing items for explanatory measures (Jinn and 
Sedransk 1989).2Definitions, means and standard deviations for achievement/attainment 
and key explanatory variables are reported in Table 1. 


Inner City and Rurality 


We measure both inner city and rural location dichotomously using information about the 
community provided by the principal of a student's school. This has a somewhat more 
subjective element than the other available indicator, which is based on census 
designation of urbanicity and rurality (see NCES 1992) and restricts the inner city and rural 
sub-samples to fewer students and schools.’ We believe, however, that this measure, 
drawn from the principal report, takes into account not only the geographic location of the 
school but also the character of the community and makeup of the student body (see also 
Roscigno and Crowley 2001). Localities specifically described by the principle as “rural” 
or “farming” were coded 1 for the rural measure. Urban localities described as large or 
medium-sized inner city metropolitan locales were coded 1 on our inner city indicator. The 
referent includes suburban and urban fringe locales. Approximately 20.4 percent of the 
students in our sample reside in a rural locality, and 6.2 percent reside in an urban inner 
city area. The remainder live in suburban/urban fringe locales. 
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Achievement/Attainment Outcomes 


Achievement is measured by standardized math and reading performance, derived from tests 
constructed by the Educational Testing Service. Following prior analyses (e.g., Roscigno 
1998), which draw from the same data and reveal similar family and school effects regardless | 
of achievement outcome, we use a composite score of math/reading achievement. Despite 
general limitations related to examining test scores, there is a strong benefit to using these. 
They are standardized across school and locality, thus increasing the generalizability of 
findings having to do with local and national patterns. The composite achievement score has 
a mean of 51.03 and a standard deviation of 10.11. Figure 1 offers a visual representation of 
achievement variation across inner city, rural and suburban adolescents. While the spread of 
high to low achievement is nearly equivalent, the means for rural and inner city adolescents 
are generally lower and the distributions are somewhat less normally distributed. Analyses 
of achievement include 16,903 students across 1,185 schools. 


Figure 1. Distribution of Achievement Among Suburban, Rural and Inner City Subsamples 
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Attainment, derived from the third wave of NELS, is measured by student dropout status. 
Dropouts include students who dropped out by the 12" grade and never returned to school. 
Approximately 5.7 percent of our sample is comprised of dropouts, and the sample size for 
analyses of dropout status is 17,345 students across 1,288 schools. The dropout rates for 
inner city and rural adolescents are 8.4 and 6.1 percent, respectively, compared to 5 percent 
for suburban adolescents. Racial comparisons likewise reveal expected patterns, with Black, 
Hispanic and Native American dropout rates at 8.2, 9.4 and 13 percent, respectively, 
compared to 2.9 and 4 percent for Asian and white students. 


Family Resource and Investment Measures 


We measure family SES with two standard measures, family income and parental education. 
Family income is derived from a parental response and falls into a 15-category ordinal scale 
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ranging from 1 (none) to 15 ($200,000 or more). This variable has been recoded to reflect 
midpoints and the natural log is used in the hierarchical analyses. Parental education is an 
ordinal scale (1-7) of highest level of education attained. Where one biological parent is 
present in the household, that parent's education is used. Where both are present, an average 
is used. Family structure is measured with two binomial measures, single parent and 
parent/step-parent (referent = traditional, two-parent family). Siblings are measured as the 
number of siblings residing in the student's household. Non-traditional parental configurations 
and more siblings are, hypothesized to reduce, on average, the material and emotional 
resources available to children (Downey 1995a). 

Educational investment at the family level is operationalized with measures of household 
educational items, cultural capital and parental involvement. Household educational items is 
a scale indicator (0-11) of the number of items in a student's household conducive to learning, 
and is detailed in Table 1. The alpha for this scale is .8. Cultural capital is also measured 
ordinally (0-8) and reflects whether or not the student has participated in cultural trips (i.e., 
museums) and cultural classes (i.e., art, music, dance, language). Measurement is consistent 
with previous analyses of cultural capital and its influence on educational success, 
emphasizing societally valued, high status and “high brow” European cultural attributes and 
/practices as they are played out at the household level (e.g., Aschaffenburg and Maas 1997; 
\DeGraaf 1986; DiMaggio 1982; Ganzeboom, DeGraaf and Robert 1990). The alpha for 
combined components is .7. Parental involvement is a standardized scale indicator of parent 
involvement and interactions with their child’s school, and with the child about academic 
course work and performance (alpha = .8). 


School Resources and Investments 


Our operationalization of school resources includes educational spending per pupil and the 
tace and social class composition of the school. Educational spending is measured as the 
current level of spending per pupil from all (federal, state and local) sources. Some prior 
research has shown that higher per-pupil expenditures leads to higher student educational 
achievement and attainment. Expenditure per pupil reflects a resource pool that can be 
differentially invested by district and school administrators into direct instruction, building 
maintenance, school security and other necessities (Condron and Roscigno 2003; Wenglinsky 
1997). 

Consistent with past and current research (e.g., Coleman 1966) we measure school class 
composition as the percentage of the total student body in the school receiving free or 
reduced-price lunch. This indicator was obtained from the principal (school) component of 
NELS. Schools with a high proportion of poor children in their classrooms are conceptualized 
to have a more difficult educational task. The racial composition of schools is likewise 
straightforward. The Common Core of Data reports racial breakdowns and, from these, we 
computed the percent nonwhite in the student body. Racial residential segregation is one of 
the primary mechanisms through which white advantage and non-white disadvantage have 
been constructed (Bankston and Caldas 1996; Jargowski 1997; Massey and Denton 1993; 
Renzulli and Evans 2005). Public schools with few minorities will face fewer resource 
inequalities stemming from this history of discrimination. Compared to suburban locales, 
urban inner cities have more poor and minority children and, thus have higher allocations of 
federal Title 1 money. This may elevate inner cities’ per-pupil expenditure relative to suburban 
locales. Despite potential alleviation of funding disadvantages through the allocation of 
federal money, rural locales will likely have lower spending per pupil than suburban places as 
a function of their poorer local tax bases. 
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Spatial variation across rural, inner city and suburban schools in class/race composition 
and spending per pupil likely translate into differential school investments. As such, we 
include three of arguably the most important investments schools can make with implications 
for achievement/attainment: smaller class sizes, teacher encouragement and a broader, more 
challenging curriculum. Class size is captured with the student-teacher ratio. It is measured 
straightforwardly as the number of students in the school, divided by the number of full-time, 
accredited teachers (Bidwell and Kasarda 1975; Fitzpatrick and Yoels 1992). We should expect 
lower student-teacher ratios in rural places by virtue of lower population densities rather than 
heightened resources, although this pattern may be mitigated somewhat by lower spending 
and its consequences for the (in)ability to hire new teachers and/or attract them to rural 
locales. The availability of advanced placement classes is our second investment measure, 
capturing the character of the curriculum and its potential effects on student performance. 
Teacher encouragement reflects a principal report of the extent to which teachers at the 
school press and encourage students to excel academically. We see this variable as tapping 
the time and emotional support that teachers invest in their students’ educational success. 


Controls 


Both the achievement and attainment models control for student race and gender status. 
These controls capture potential race- and gender-specific micro-political processes and 
biases that may be occurring in the classroom and family and, in the case of race, spatial 
concentration effects. Along with these more common controls, we include an indicator of 
whether or not the student has moved in the past four years. This selectivity control affords 
some confidence that the main effects are not biased by movement in and out of rural and 
central city areas. This control also likely captures potential disruption and the breaking of 
social capital ties for a student, both of which tend to have negative consequences for 
achievement/attainment (Ingersoll, Scamman and Eckerling 1989; Pribesh and Downey 1999; 
Reynolds 1991). Finally, we control for total school enrollment with other school-level 
predictors to at least partially account for potential school organization and density effects. 


Analytic Strategy and Results 


The argument that inner-city and rural deficits in achievement and attainment are a function of 
family/school resources and investments suggests that multilevel analysis techniques are 
appropriate. We use hierarchical linear and hierarchical logistic modeling (Bryk and 
Raudenbush 1992: Garner and Raudenbush 1991) to estimate multi-level effects on general 
achievement/attainment. OLS and simple logistic regression may produce biased slopes and 
standard errors, due to correlated errors and group-specific error variances (DiPrete and 
Forristal 1994; Kaufman 1995). Preliminary analyses, which explored random effects models 
as well as the possibility of interactions across levels, revealed no significant differences or 
major effects above and beyond those reported here.° Thus, in the analyses that follow, all 
regression coefficients with the exception of the intercept have been “fixed” or constrained 
to be constant. To aid in the interpretation of family and school-level effects, all explanatory 
variables have been centered on their means. For further discussion of mean centering and 
constraining effects within schools or localities, see especially Bryk and Raudenbush (1992), 
Garner and Raudenbush (1991), and Entwisle, Alexander and Olsen (1994). 

Given the sampling strategy used by the National Center for Education Statistics, there is 
likely only one school represented per locale in most cases and only one child per household 
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(NCES 1992). As such, we are constrained to using a two-level model. Locality designations 
and school attributes are measured at the school level (Level 2), while family attributes, 
achievement/attainment outcomes and individual controls pertaining to gender, race and 
whether the student has moved are measured at Level 1. The percent of individual baseline 
and spatial variation explained is estimated, where appropriate. These statistics are reported 
at the bottom of the tables. Although our primary analytic focus in on explaining rural, inner- 
city, suburban differences in educational outcomes, these summary statistics also tell us how 
well our models do in explaining all observed variation. ] 

We begin our analyses by first examining spatial disparities in family/school resources and 
investments. Family and school resource disparities for inner city, rural and suburban 
adolescents are examined using standard mean comparisons and t-tests. For family 
investments, we employ hierarchical linear modeling and model these investments as a joint 
function of family resources and place. For school investments, we use aggregate OLS 
analyses, again modeling investments as a function of resource and type of place. The first 
equation introduces dummy indicators for inner city and rural locales, thus capturing the 
average disparity relative to suburban/urban fringe locales. The second equation adds 
reSOUrces as a potential source of both family and school investments. Declines in coefficient 
magnitude for inner city and rural variables reflect the degree to which deficits in educational 
investment across localities are a function of spatial variations in available resources. 

The second portion of our analyses focuses on the consequences of place, resources and 
investments for achievement and the likelihood of dropping out of high school. First, inner city 
and rural deficits in these educational outcomes are reported. We then introduce family and 
school resources simultaneously in equation 2. This modeling strategy addresses the néed 
for multilevel analyses as well as concerns having to do with spurious school effect findings 
and inadequate model specification (Hanuschek 1996). Declines in inner city and rural 
coefficients allow us to disentangle the extent to which rural or inner city educational deficits 
are a function of resource inequalities. Consistent with our theoretical modeling and causal 
logic, family and school investments are then introduced in equation 3. 


Resource Inequalities and Implications for Educational Investments 


Table 2 reports means comparisons of family and school resources across our sample of 
inner city, rural and suburban adolescents. Not surprisingly, given previous research on spatial 
stratification, inner city and rural families lag behind those residing in suburban locales in 
income. The income deficit for inner city students, reported here as the natural log, is 
approximately $14,000 per year. For rural families, this gap is approximately $10,000. The 
parental education deficit is .45 and .69 for inner city and rural families, respectively, reflecting 
significantly less college experience among parents.’ Inner city families are much more likely 
to be single-parent households and have slightly more children compared to their 
counterparts in suburban locales. Rural adolescents are less likely than suburban/urban ftihge 
adolescents to be members of single-parent families, but are more likely to have siblings. 
Spatial disparities in school resources, captured by class and race compositional measures 
and spending per pupil, are obvious. Inner-city and rural students are much more likely to attend 
schools with poor student bodies. While inner-city students are also more likely to attend high 
minority concentration schools, rural adolescents are not. We expect that this rural pattern, 
pertaining to racial composition in particular, may vary significantly by regional racial composition 
and history. In Appalachia and certain portions of the Midwestern United States, for example; 
one is likely to find a high concentration of poor white students, while in the black-belt South the 
simultaneous concentration of non-white and poor students is more commonplace. 
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Table 2: Means (Standard Deviations) for Family and School Resources Across Inner City, Rural 
and Suburban/Urban Fringe Locales 





Suburban/Urban 
Inner City Rural Fringe 
Mean (S.D. Mean (S.D. Mean (S.D. 

Family Resources 
Income: 9.865 (1.558)* 10.045 (1.091)* 10.400 (1.059) 
Parental Education 2.997  (1.656)* 2.732 (1.427)* 3.422 (1.666) 
Single Parent wid (na)* 145 (na)* 164 = (na) 
Parent/Step-Parent 125 (na) 124 (na) AAT (hal 
Siblings 2.461 (1.696)* 2.326 (1.588)* 2.165 (1.503) 


School Resources 

Percent Receiving Free Lunch 35.137 (26.939)* 27.233 (22.727)* 15.624 (18.550) 
Percent Non-White 69.907 (28.943)* 16.924 (26.340)* 29.298 (29.217) 
Per Pupil Expenditure (in thous.) 5.625 (1.457), 4.404 (1.360)* 5.082 (1.585) 


Note: * denotes significant difference from suburban/urban fringe mean under the p < .01 level. 


Particularly noteworthy are the disparities in per pupil expenditures. Inner-city schools tend 
to provide, on average, higher expenditure per pupil. Federal funds in particular are targeted 
at high-poverty schools, and these federal transfers are incorporated in this measure. It is also 
the case that our spending measure is insensitive to how such money is allocated (i.e., 
instruction, construction, security). Indeed, we suspect that a lower proportion of available 
funds is actually allocated to instruction in inner city locales — something we hope future 
research will explore when appropriate data become available. Consistent with predictions, 
students living in rural locales are allocated approximately $600 less per year in educational 
expenditures than their suburban counterparts. This difference is very likely a function of 
depressed property taxes in rural localities and difficulties generating additional revenue 
through corporate taxation or school bond votes. 

The comparisons reported thus far suggest that the most problematic feature of institutional 
resource disadvantage, with potential implications for inner city achievement/attainment 
deficits, stems from disparities in income and parental education at the family level, and social 
class composition and racial segregation at the school level. For rural students, inequalities in 
family social class background are likewise apparent, as are disparities in school social class 
composition and expenditures per pupil. These disparities are likely important for achievement 
and attainment, in part, because they shape meaningful investments at both family and school 
levels. We now turn to analyses of family and school investments in Tables 3 and 4. 

Equation 1 of Table 3 reports the average gaps in household educational items, cultural 
capital and parental involvement across inner city and rural adolescents relative to their 
more suburban counterparts. Inner city and rural deficits in family educational investments 
are strong and statistically significant. On average, inner city adolescents are at a .80-, .24- 
and .03-unit disadvantage in terms of household educational items, cultural capital and 
parental involvement, respectively. For rural adolescents, these deficits are smaller when it 
comes to household educational items but larger in terms of cultural capital and parental 


involvement. 
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Equation 2 of this table highlights the extent to which these inequalities in educational 
investments are being driven by the inequalities in family resources. Note the significant decline 
in inner-city and rural coefficients for each type of investment after family resources are introduced. 
The coefficient for the inner city gap in educational items decreases by .37, while the disparity 
pertaining to cultural capital becomes non-significant. Rural deficits in these potentially influential 
investments diminish as well, by 59, 41 and 62 percent, respectively. These findings are consistent 
with those few analyses that explicitly tie educational investment decisions at the family level to the 
availability of family resources (DiMaggio 1982; Roscigno and Ainsworth-Darnell 1999; Teachman 
1987), but extend them by specifying the spatially varying nature of these inequalities. 

The effects of family resources, most notably income, parental education and number of 
siblings, on family investments are strong and generally in line with our expectations. Parental 
income and education, both of which were shown earlier to be lower in inner city and rural 
locales, have a strong positive effect on all three of our investment outcomes. The influence of 
non-traditional parental structures is strong and negative for educational items and parental 
involvement, although weak or non-significant in the case of cultural capital. Number of siblings 
appears to dilute all three forms of family educational investments. (See also Downey 1995a.) 

Approximately 45, 45 and 30 percent of the total spatial variation in household educational 
items, cultural capital and parental involvement, respectively, are accounted for in these models. 
lt appears that significant portions of family investment differences are tied to inner-city, rural 
and suburban inequalities in the availability of resources. Only in the purchase of household 
educational items is the inner city coefficient still significant. Although this may reflect 
differences in cultural expectations, we suspect that differences in available income in inner-city 
areas are largely responsible. Not only is the cost of living likely to be higher but also earnings 
are likely to be more unstable and wealth accumulation lower. For this reason, our single 
indicator of one year's family income is probably a weaker measure of financial resources in 
inner city neighborhoods. If there were general cultural differences between suburban and inner- 
city households they should be apparent in all indicators, not just the one that requires money. 

For all three family investment indicators, rural areas have a significant residual 
disadvantage relative to suburban locales. Approximately half of the rural disadvantage reflects 
household resources differences, but about half remains after adding household resources to 
the model. Some of this may reflect measurement error in family resources, but the 
consistency and magnitude of these gaps seem to suggest something unique in rural places. 

Table 4 reports parallel analyses for school investments. In general, inner city schools have 
higher student-teacher ratios while rural schools have lower student-teacher ratios (p , .001), 
even after controlling for school size differences. Rural and inner city schools are both 
disadvantaged insomuch as they are less likely to offer advanced placement classes, thus 
reducing curricular opportunities available to children, and exhibit lower levels of teacher 
encouragement. Rural schools are the most disadvantaged in each of these regards. 

School resources are added in the second models. The rural advantage in student-teacher ratio 
becomes non-significant. The patterns reported earlier in Table 2 and the significant effect of 
school enrollment in this model suggests to us that the rural-suburban differential in student- 
teacher ratio is primarily a function of population density. Inner city and rural deficits in availability of 
advanced placement classes diminish by .02 (or 10 percent) and .27 (or 52 percent), respectively, 
with the addition of school resource indicators. There are similar decreases having to do with 
teacher encouragement, suggesting quite clearly that school investment disadvantages in rural 
and inner city schools are partially a function of the availability of school resources. 

School resource effects generally correspond to our expectations. Educational 
expenditure decreases student-teacher ratios and increases the availability of a more diverse, 
rigorous curriculum. Class composition of the school has no impact on class size, but it is 
negatively related to the availability of AP classes and teacher encouragement. Racial 
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composition is related to higher student-teacher ratios — a pattern which is most likely a 
function of non-white concentration in the high density, urban centers of the Northeast and 
North Central United States. (See Massey and Denton 1993.) 

Table 4 presents aggregate OLS results for school-level investments; the R? tells us how much 
of the spatial variation in school investments is explained by school resources. At .28, .11 and .04 for 
student-teacher ratio, AP Classes and teacher encouragement, respectively, we have explained 
much less of the school spatial investment disparity compared to family investments. This suggests 
a great deal of local variability in educational investments, which we only poorly understand. 
Nevertheless, resource differences partially explain spatial variance in these investments. 

Findings presented in Tables 2, 3 and 4 are largely supportive of the conclusion that both 
educational resources and the investment of those resources at both family and school levels 
vary significantly between rural, inner city and suburban places. Family income and parental 
education are lower in inner city and rural locales. Inner city and rural schools are 
characterized by parallel resource disadvantages, specifically higher concentrations of poor 
students. In the case of inner city schools, this is coupled with high minority concentration. 
Rural students, in contrast, receive lower educational expenditures. 

Resource disparities at both family and school levels translate into inequalities in 
potentially influential investments, most notably household educational items, cultural capital, 
parental involvement, teacher encouragement and the availability of advanced placement 
classes. Inner city areas seem distinct from suburban ones largely in terms of their resources, 
but not as much in terms of their investments, Tables 3 and 4 suggest that, net of resource 
disparities, there are only two significant investment differences between inner city and 
suburban areas — household items and the availability of AP classes. Rural schools and 
families, on the other hand, have significantly lower investments than suburban areas on five 
of the six measures, even after we account for resource disparities. Thus, while both inner city 
and rural areas are burdened by poverty and its correlates, rural areas also seem to place a 
lower value on educational investments given comparable resources. The remaining question 
is, do institutional resource inequalities and their respective associations with educational 
investments uncovered thus far actually nelp us explain achievement/attainment deficits? 


Inner City & Rural Achievement/Attainment Deficits 


Tables 5 and 6 report hierarchical linear and hierarchical logistic modeling estimates of 
math/reading achievement and student dropout status on location, select controls and 
family/school resources and investments. The first equation introduces inner city and rural 
indicators along with individual-level controls, thus capturing the average gap in achievement 
(Table 5) and in the likelihood of dropping out of high school (Table 6). Average math/reading 
achievement is approximately 1.92 and 2.71 standardized points lower in inner city and rural 
localities (p < .001). The average likelihood of dropping out of high school is .25 log-odds, or 
approximately 28 percent higher, in inner city areas (p < .05).° In rural areas, the likelinood of 
dropping out is .17 log-odds, or 18 percent higher (p < .05). Notable among controls are the 
strong negative effects of non-white racial status on both outcomes (with the exception of 
Asian), the positive impact of gender on achievement, and the detrimental effect of moving. 

Equation 2 introduces family and school resources. Declines in the inner city and rural 
coefficients with this addition are striking. The inner city deficit in achievement becomes non- 
significant, while that for rural locales declines in magnitude by 1.56 points or 58 percent. The 
higher average likelihood of dropping out of high school is completely accounted for in both 
cases. In general, institutional resource differentials play a significant role in patterning 
achievement and attainment differences. 
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Table 5: Hierarchical Linear Modeling Estimates of Student Achievement and Deficits on 
Family/School Resources, Investments and Select Controls, 1990 


ee eee 
Math/Reading Achievement 


(1) (2) (3) 
Inner City tot = (Sarl 118  (.461) 173 (.436) 
Rural 2.100" “(aro)” 1.146 (.329)** -.779 — (.309)** 
Family 
Resources: 
Income OO tel» oot. Ula) 
Parental Education 1.154 (.047)*** .934  (.049)*** 
Single Parent -.546  (.204)** -.440  (.204)* 
Parent/Step-Parent -.302 (.228) -.194  (.227) 
Siblings -.343  (.046)*** -.320  (.045)*** 
Investments: 
Household Educational Items 248 (.040)*** 
Cultural Capital 519  (.041)** 
Parental Involvement -.030  (.174) 
School 
Resources: 
Percent Free Lunch -.055  (.007)*** -.043  (.006)*** 
Percent Non-White =016° CLO0a)"™ -.016  (.005)*** 
Per Pupil Expenditure (in thous.) 00" (.075)"" 000 ~=—(.000) 
Investments: 
Student-Teacher Ratio (Ln) -2.424 (.485)*** 
Number of AP Courses 088  (.019)*** 
Teacher Encouragement 947 (.163)*** 
Controls 
Race: 
Asian 1.809 #323)" 1.687 (.316)*** 1792 {315i 
African American 6.115 (.286)** -4.346 (.286)*** = -4.209 = (.283)*** 
Hispanic -4.634 (.270)** $2037 Tooi2TAye* yes 16 hea (2209) 
Native American 6.216 (.770)*™* -4.823  (.748)*** = -4.757 = (.742)*** 
Sex (1 = Female) 824 (.145)*** 978 (.141)*** 188 (.142)*** 
Moved in Past 4 Yrs. -1,155. (.193)"™* -.800  (.188)*** -.743  (.187)*** 
School Enrollment -.001  (.000)** -.000 (.000) 
Intercept 51.118 51.066 51.024 
Variance Components: 
Between Place 16.918 8.312 6.741 
Percent Explained 50.869 60.155 
Baseline Variation 19.07 11.977 71.169 
Percent Explained 4.502 5.574 


Note: Metric Coefficients (Standard Errors) 
+tp<.10 *p<.05 **p<.01 ***p<.001 (one-tailed tests) 
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Table 6: Hierarchical Logistic Modeling Estimates of Student Dropout Status and Completion 
Deficits on Family/School Resources, Investments and Select Controls, 1992 


Dropout Status 


(1) (2) (3) 
Inner City 247 AAT. -.013  (.134) -.044 — (.137) 
Rural 167 = (.088)* -.060  (.099) -.103  (.099) 
Family 
Resources: 
Income = 111. (026) "_ . .-.0/0.- 1,029)" 
Parental Education -.240  (.025)***  -.485  (.026)*** 
Single Parent 434 (.088)*** 394 (.087)*** 
Parent/Stepparent 397 —_(.098)*** ol6” (098) * 
Siblings yhOu me Oo) 120.25 019) 2 
Investments: 
Household Educational Items -.066  (.017)*** 
Cultural Capital “075° >(:022)"" 
Parental Involvement “101 - 1.070), 
School 
Resources: 
Percent Free Lunch 007 _-(.002)*** 005% 2002)" 
Percent Non-White 002 © (.001)+ 003 = (.002)* 
Per Pupil Expend. (thous.) pat ZUeealet jis -.000  (.000)*** 
Investments: 
Student-Teacher Ratio (Ln) -.142  (.175) 
Number of AP Courses -.014 — (.009) 
Teacher Encouragement -052 = (.052) 
Controls 
Race: 
‘Asian -.286  (.131)* -.317 — (.149)* -.362  (.153)™ 
African American RSet Ul). Olt, (2119) -.019 = (.118) 
Hispanic (obmet NGL) a 000 lide) 044 (.112) 
Native American 4000 cucli2aT)taceterecDAL oy -254)c7 640  (.257)™ 
Sex (1 = Female) -.004  (.060) -.055 = (.063) -.043  (.064) 
Moved in Past 4 Years B33 PE 0b0 1217-085)" 200  (.085)** 
School Enrollment 001 (.000)*** 000  (.000)*** 
Intercept -2,863 -3.082 -3.106 


Note: Log-Odd Coefficients (Standard Errors) 
ps MOONE POS pol ep <001 (one-tailed tests) 


2138 © Social Forces Volume 84, Number 4 ¢ June 2006 


Resource effects are also in the directions one would expect and with a great deal of 
consistency across outcome. Supplementary analyses (not shown) introduced familial and 
school resources independently, and revealed quite similar albeit slightly larger effects on 
both counts. While family resources provided about twice the explanatory power relative to 
school resources, institutional resources within both arenas remain influential whether 
introduced independently or together. 

Family class background, captured by income and parental education, has a strong positive 
influence on achievement and decreases the likelihood of dropping out. The influence of family 
structure (with the exception of the influence of parent/step-parent on achievement) is also 
strong and consistent across outcomes. Non-traditional family forms and number of siblings 
exert a negative influence on achievement and increase the likelihood of dropping out. 

Notable are the persistent and strong effects of school resources, even after accounting 
for family resources. The concentration of poor students and non-white students depresses 
achievement and magnifies the average likelihood of dropping out. The influence of per-pupil 
expenditure is significant and in the expected directions. Students residing in inner city and 
rural areas are disadvantaged on many of the influential family and school resource 
dimensions, so it should come as no surprise that the model explains a substantial portion 
of the rural and inner city gaps in achievement and attainment. 

We also expect that resources influence achievement/attainment partially through 
investments that families and schools make. Tables 3 and 4 detailed associations between 
resources and investments. We now follow through on our causal logic by introducing family 
and school investments in equation 3 of Tables 5 and 6. Note the further decline in the rural 
achievement deficit after family/school investments are added. Consistent with expectations 
and causal associations detailed earlier, it appears that a portion of family resource effects, 
most notably those for income and parental education, are mediated through household 
investments in educational items, cultural capital and parental involvement (compare 
equations 2 and 3). Household educational items and cultural capital have strong 
consequences for achievement and the likelihood of dropping out of high school. Parental 
involvement, however, has no direct impact on achievement, but does reduce the likelihood 
of dropping out. 

School investments matter as well for achievement. Their impact on attainment, however, 
is less clear. Number of AP courses offered, lower student-teacher ratios and teacher 
encouragement each result in higher average levels of achievement, and these investments 
mediate some of the class and race compositional influence and almost the entire influence 
of monetary expenditure. Some mediation similarly appears to be occurring in the case of 
dropout status, although none of the school level investments reach significance. This 
suggests that the schoollevel investments we have included may be geared toward students 
on the upper end of the educational achievement distribution, and are less relevant to those 
considering dropping out. If this is the case, then future research might consider explicitly 
extending operationalization of investments to ones more meaningful for vulnerable 
segments of the student population. Despite this limitation, it is notable that the combined 
influence of resources and investments explains significant portions of inner city and rural 
deficits in achievement/attainment. The rural achievement deficit remains significant in the 
final models, but it is quite small at only 8 percent of a standard deviation. 


Conclusions 


Educational research, while effectively highlighting the importance of family and school- 
level processes, has tended to pay much less attention to spatial inequalities in 
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achievement and attainment. Current educational policy has also neglected the role of 
place, focusing instead on introducing competition to public education, increasing 
mandatory standardized testing, and identifying what have been deemed “failing” schools. 
Such policy foci and aims are problematic given significant variation in educational 
opportunity across geographic space and its consequences for educational resource and 
investments. 

This article reflects an effort to contribute to the sociological literature and to public 
dialogue pertaining to educational success and vulnerability in three important ways. First, we 
develop.a model that explains U.S. spatial variation in educational success with particular 
emphasis on disparities between inner city, rural and suburban places. Second, we make the 
theoretically important distinction between educational resources and investments, how they 
may be related, and why they are not be perfectly correlated. Finally, we empirically model 
rural and inner deficits in achievement and attainment, and delineate the role of resources and 
investment inequalities at both family and school levels. 

The analyses presented are limited to public school students, thus warranting some 
caution. The patterns revealed are nevertheless noteworthy. Families and schools in 
America’s inner cities and rural places simply lack many of the resources that promote 
educational achievement and attainment. Families in these locales have lower family income, 
less parental education and more siblings per household. Inner city schools have high 
concentrations of poor and non-white students. Rural schools have high concentrations of 
poor students and lower per-pupil expenditures. These resource shortfalls explain most inner 
city and rural deficits in achievement and attainment. Although some of the relationship 
between resources and educational outcomes is direct, it is also partially mediated through 
investments made by families and schools. 

We interpret spatial variations in education investments that remain after controlling for 
resource inequalities as potentially indicative of divergent educational expectations. We also 
interpret remaining direct effects of geographic location as possibly capturing peer group 
cultural differences in educational aspirations and value, something that we do not model. 
Interestingly, though, and despite repeated characterization of inner city teenagers as 
developing an antipathy toward education (Anderson 1990; Fordham and Ogbu 1986; 
McLeod 1995), we find no clear evidence of subcultural patterns or of lower achievement or 
higher dropout rates after we control for educational resources. (See also Ainsworth-Darnell 
and Downey 1998; Cook and Ludwig 1997; Hochschild 1995; Tyson, Darity and Castellino 
2005.) Instead, inner city investments of both families and schools are equivalent to suburban 
investments after resource inequalities are taken into account. Rural areas, on the other hand, 
display some evidence of lower investments in education for similar levels of resources. We 
conceptualize these differences in expectations and investments, to the extent that they 
actually exist, to be embedded in community histories and especially stratification histories. 
Communities develop different cultural “tool boxes” (Swidler 1986) or “roadmaps” 
(Tomaskovic-Devey and Tomaskovic-Devey 1988) precisely because they have faced different 
practical problems. 

Recognizing the embedded nature of families and schools and the consequences for 
resource and investment disparities across place offers a more dynamic and fluid picture than 
do overly individualistic frameworks of educational processes and outcomes. Such views are 
limited and should be supplemented with an understanding that the dynamics of stratification 
are ultimately manifested at a more local level, and mediated through proximate institutions. 
Indeed, an integrative and multi-level conceptual frame, such as the one presented here, 
would be beneficial for most analyses of stratification, for theory pertaining to the institutional 
foundations of inequality, and for understanding the sociological significance of spatial 
inequality for individuals and populations. 
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Notes 


1. The race and class compositions of schools vary tremendously across rural, inner city and 
suburban locals and are often used as proxies for school resources. The history of many 
inner city neighborhoods is clearly one of historical residential, and thus educational, 
segregation (Massey and Denton 1993; Orfield and Yun 1999). When we think about the 
problems of inner city neighborhoods we: are often talking about poverty, social 
disorganization and crime — the result of historical city-suburban development including 
racist housing policies (Wilson 1986; Jargowski 1997). Similarly, the South, the rural black- 
belt South in particular, has a disgraceful history of segregated public schools, white-only 
academies following desegregation, and profound economic disadvantages in minority 
neighborhoods and schools (Walters and James 1995; Falk and Rankin 1992; Tomaskovic- 
Devey and Roscigno 1996). 


2. Although cultural capital and household educational items might be conceived as 
resources from which students can draw, their use is shaped by class background and 
social status (Roscigno and Ainsworth-Darnell 1999). Families with few resources can 
decide to invest heavily in education, thus partially compensating for class disadvantage. 


3. Prior to imputation, we ran analyses only on cases with valid data all measures. Aside from 
minor differences in coefficient magnitudes, statistically significant findings were almost 
identical to those presented here. 


4. The census-defined rural sample, for example, includes 5,589 students, while the more 
restrictive principal report yields a sample of 3,597. The two measures are correlated at .5. 


5. Random effects models revealed no substantive differences whatsoever. Our analyses 
of cross level interactions, specifically between locale type and family and school level 
investments, only revealed two mildly significant interactions: rural * cultural capital and 
inner city * household educational items. Both suggest that the returns to these 
investments are only a little higher in these locales, respectively. Because these effects 
were small, however, such results are not reported. They are, nevertheless, are available 
from the authors by request. 


6. Residual effects of inner city or rural location on achievement may signify cultural 
expectations about the importance of education. Because parental involvement and 
teacher encouragement are in the model, these residual effects also may capture 
differences in peer culture associated with rural, inner city or suburban residence. Of 
course, such residual effects could also reflect incomplete measurement of resources 
and investments. The lack of residual effects for inner city areas suggests that our 
measures of resources and investments are fairly effective. 


7. Avalue of 2 on this scale is equivalent to a high school degree or GED equivalent, a3 
reflects a high school degree with some college experience, while a 4 equates to a four- 
year college degree. 


8. This percentage is based on conversion of log-odds into odds ratios. The findings in this 
table are available in odds ratio format from the authors. 
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Social Movement Tactics, Organizational Change and the 
Spread of African-American Studies 
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Abstract 

Social movement research suggests that protest is effective because it de-legitimizes existing 
policies and imposes costs on power holders. I test this hypothesis with data on African- 
American student protest and the creation of departments of African-American Studies. I 
find that non-disruptive protest, such as rallies and demonstrations, has a positive effect on 
the probability that a university will create a Department of African-American Studies. 
Disruptive protest, such as sit-ins and vandalism, has no significant effect. I argue that 
non-disruptive protests are successful because they allow sympathetic administrators to act 
on behalf of political challengers, while disruptive protests deprive administrators of the 
legitimacy needed to enact change. I also find limited evidence of intra-university mimicry 
as a factor in the creation of African-American Studies programs. 


Beginning in 1968, African-American students and their allies staged marches, 
demonstrations and sit-ins so that university administrators would create academic units 
dedicated to African-American culture and history (Bunzel 1968; Bailey 2001; Orrick 1970; 
Crouchett 1971; Glasker 2002; Myles 1971). For the next 30 years, 120 universities offered 
African-American Studies degrees and 100 more offered degree concentrations (College 
Board 2002). As of 2005, approximately 9 percent of all institutions of higher education in the 
United States have academic units offering African-American Studies degrees. Although black 
student activists demanded African-American Studies programs at many campuses, not all of 
these campuses established programs, even those with substantial black student populations 
and campus unrest. This outcome suggests that activists were not equally effective in 
pressing their demands. 

This study explores the effects of movement tactics on organizational change. The African- 
American Studies movement raises important questions about social movements and their 
impact on organizations because a university's response to the African-American Studies 
movement should depend on how student activists assert their demands. Although social 
movement researchers often discuss the link between tactics and outcomes (e.g., Andrews 
2001; Gamson 1990 [1975]; McAdam and Su 2002) and how movements target organizations 
(e.g., Binder 2000, 2002; Davis, McAdam, Scott and Zald 2004; Lounsbury 2001; Lounsbury, 
Ventresca and Hirsch 2003), scholarship infrequently addresses the differing tactical effects 
on organizations. This study contributes to the literature by examining the difference between 
disruptive tactics, such as vandalism and violence, and non-disruptive tactics, such as rallies, 
on the probability that administrators will make concessions to protestors. 
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The subsequent sections review the literature and present hypotheses. Using longitudinal 
data on black student protests and universities, | estimate a discrete time logit model of 
African-American Studies program creation where African-American Studies program creation 
depends on protest events, national trends in African-American Studies program adoption 
and control variables, such as university size and student demography. The concluding 
section offers an explanation for the difference between disruptive and non-disruptive tactics, 
which is based on case studies of the black student movement. 


Protest and Organizational Change 


Social movements targeting organizations employ a variety of tactics. Some movements use 
disruptive tactics that prevent the target organization from completing its tasks. Sitins, a 
movement tactic used by labor groups in the early 20" century, forced industrial plants to 
cease operations until their demands were met. Other social movements choose less direct 
methods. For example, the early 20" century women’s suffrage movement tried to influence 
state legislatures through the use of marches, rallies and publicity. Rallies were an attempt 
to change public opinion so that state legislatures and other political bureaucracies would 
approve and enforce women’s right to vote. Suffragists did not try to disrupt legislators’ daily 
routines through sit-ins or riots. 

The variety of social movement tactics raises an important question. Which tactics 
encourage organizational change? Is it better for a movement to directly interfere with an 
organization? Or, is it better for a movement to appeal to public opinion and hope that 
managers will institute new policies and enact change? This study addresses this question 
with an empirical analysis of the African-American Studies movement, which used both kinds 
of tactics to persuade university administrators that they should establish departments of 
African-American Studies. 

Although social movement researchers have not directly addressed whether disruptive 
movement tactics spur organizational change, a voluminous literature on movements 
suggests the mechanisms linking movement actions to other types of social change. Gamson 
argued ([1975] 1990) that violent social movements are more likely to achieve their goals than 
nonviolent movements. Analyzing data on American social movements in the 19" and 20" 
centuries, Gamson argued that movements employing strikes, violence and other disruptive 
techniques are more able to draw attention to their goals, impose costs on political 
incumbents, and ultimately achieve their goals than movements using non-disruptive 
techniques. Since Gamson’s work, other scholars have reanalyzed his data (Goldstone 1980) 
and tested his hypothesis with other data (Frey, Dietz and Kalof 1992; Mirowsky and Ross 
1981; Ragin 1987; Stedley and Foley 1979). Much evidence does show that disruptive tactics 
do correlate with a movement's goal attainment (e.g., Cress and Snow 2000). It should also 
be noted that some scholars argue that violent or disruptive protest damages a social 
movement's reputation and has negative effects on goal attainment (Schumaker 1975, 1978). 
Giugni (1999) believes that the evidence is mixed on this issue. Much of the debate hinges on 
definitions and measures of protest and its outcomes. The appendix to Gamson (1990 [1975]) 
reprints the exchanges among Gamson, Goldstone and others. 

Political scientists argue that riots and contentious gatherings tactics allow movement 
participants to extract concessions from organizational participants in exchange for ending 
protest, which has been called the social control hypothesis. (Durman 1973; Isaac and Kelly 
1981; Piven and Cloward 1977, 1992; Welch 1975) According to these researchers, state 
actors — legislators, law enforcement officials, etc. — have a strong interest in preventing 
public challenges to their authority. If movement leaders succeed in mobilizing a large number 
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of persons and publicly challenging the state, then state actors might perceive their authority 
to be eroding. Social control theories imply that state actors mollify disruptive groups through 
policy change or the establishment of institutions catering to movement actors. Scholars have 
tested this hypothesis in various contexts by estimating the effect of the number of protest 
events on state budgets. For example, Fording (2001) found that black protest within a state 
results in expansions of the Aid to Families with Dependent Children program. Andrews 
(2001) arrived at a similar conclusion regarding the Civil Rights movements and the Great 
Society programs of the mid-1960s. 

This research on violence and other disruptive tactics suggests that they impose serious 
costs on an organization. Firm owners lose income when striking laborers stage sit-ins and 
work stoppages. Violent tactics destroy property and reduce a firm's wealth. Also, an 
organization's leaders face erosion of their authority if disruptions go unchecked. The 
corresponding hypothesis is: 


Hypothesis 7: Disruptive tactics, é.g., sit-ins, riots, vandalism, riots and 
other tactics aimed at preventing an organization from achieving its 
goals, will have a positive effect on the probability that an organization 
will change in response to the movement. 


Social movements also employ non-disruptive tactics focusing on an organization's 
legitimacy and public image. The movement indirectly changes an organization by showing 
how behavior contradicts stated ideologies, policies and social mores. Examples abound: 
anti-apartheid movements in the 1980s pressured universities and other organizations to 
sever ties with firms that invested in South Africa; the anti-sweatshop movement encourages 
garment manufacturers to assume responsibility for working conditions by generating 
negative publicity for manufacturers; and the women’s suffrage movement employed mass 
rallies and marches to persuade the public that denying women’s right to vote was 
inconsistent with democratic principles. Movement tactics such as rallies also seek to directly 
change public opinion. For example, environmental movements have tried to persuade the 
public that logging practices are illegitimate because they are damaging to the environment. 
Activists hope that public opinion change might lead to regulation of industry or changes in 
firm behavior. 

Della Porta and Diani (1999:173) identify two additional motivations behind non-disruptive 
tactics. First, non-disruptive tactics are often motivated by a belief that power holders will 
change their behavior if the movement demonstrates that many people agree with the 
movement's demands. Large rallies and demonstrations are supposed to show mass support 
for the movement. This sort of behavior is an appeal to democratic values. Non-disruptive 
events are also characterized by symbolically charged appeals to abstract moral principles. 

_Hunger strikes are the most sensational example. Campus teach-ins are a less extreme 
example where students and teachers show moral rectitude by conducting classes on their 
movement's cause. This leads to: 


Hypothesis 2: Non-disruptive tactics, e.g., mass demonstrations, 
rallies, hunger strikes and other tactics aimed at challenging an 
organization's legitimacy, will have a positive effect on the probability 
that an organization will change in response to the movement. 


Neo-institutional theory suggests that protest might have important indirect 
consequences. A central argument is that organizations seek legitimacy by copying other 
organizations (DiMaggio and Powell 1983; Meyer and Rowan 1977; Scott 2000). When 
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managers do not know what to do, they will look to their peers. A movement challenges 
existing practices and creates uncertainty for an organization. Neo-institutional theory 
indicates that successful mobilizations create uncertainty for organizations. Organizations that 
concede to a movement are then copied by others because early adopters confer legitimacy 
to the movement's demands. Such mimicry is a vital part of the story of how movements 
change industries. Organizations might copy similar organizations (Haveman 1993), others in 
the same geographical region (Hedstrom 1994; Knoke 1982; Myers 1997), or they might 
simply follow national trends (Strang and Soule1998): 


Hypothesis 3: The number of organizations changing in response to 
a movement, nationally or within a region or peer group in a given 
time period, has a positive effect on the probability that other 
organizations will adopt that form in subsequent time periods. 


A related issue is how protest interacts with mimetic processes or structural variables that 
are known to correlate with organizational change. Protest might magnify mimetic effects by 
adding urgency to the desire to find legitimacy by copying other organizations. A manager 
confronted with demonstrators might be more inclined to change behavior if his or her peers 
have conceded to the movement. Similarly, resources and structural variables, such as size, 
might have a larger effect when there is protest. For example, an interdisciplinary academic 
program, such as a Department of African-American Studies, might be easily established in a 
university with many academic programs. Professors already working for the university might 
be more inclined to promote a new academic unit if they know there is strong demand for it as 
indicated by protests for the program. The following hypothesis summarizes this argument: 


Hypothesis 4: The interaction of the number of protest events and 
mimetic variables or structural variables, such as size, will have a 
positive effect on the probability that an organization will adopt an 
organizational form promoted by a movement. 


Context: The Movement for African-American Studies 


This study uses data on the movement for African-American Studies because students 
employed many different tactics to get departments of African-American Studies established 
at their universities. Tactics included sit-ins, rallies and occasional violence and vandalism 
(Myles 1971; Orrick 1970). Black student activists took their cue from the broader Civil Rights 
and Black Power movements, which used “direct action” to achieve their goals. Historians 
often view “direct action” as a departure from an earlier style of black politics emphasizing 
court action and government lobbying (Meier, Rudwick and Broderick 1971). 

Scholars trace the beginnings of “direct action” to the Congress for Racial Equality (CORE). 
According to its statement of purpose (CORE 1971 [1942]: 239), the organization was a 
coalition of local groups whose purpose was to eliminate the color line through “direct, non- 
violent action.” CORE conducted the first lunch counter sit-in when CORE member James 
Farmer led 25 people to a Chicago coffee house where he had previously been refused 
service (Farmer 1965: 60-62). By the late 1950s, sit-ins and other forms of “direct action” had 
become popular within the Civil Rights movement. By the 1960s, “direct action” was at the 
forefront for the struggle for African-American freedom. “Direct action” was soon followed by 
a resurgence of black nationalism, which some scholars think is correlated with American 
participation in foreign wars such as the Vietnam War (Henderson 2000). 
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The turn towards direct action coincided with black intellectuals’ disenchantment with 
American colleges and universities (Carr 1998). Ever since Carter Woodson (1933) wrote that 
American colleges failed to serve African-Americans, black intellectuals urged that 
educational institutions dedicate themselves to developing and transmitting knowledge about 
the African-American community. In the early 20" century, black intellectuals made significant 
moves in this direction. W.E.B. DuBois’ research center at Atlanta University, founded in1910, 
is an example of a research institution dedicated to the African-American community. Carter 
Woodson founded the Association for Negro History in 1927, another example of an 
intellectual organization dedicated to the study of the African-American community. 

“Direct action” and the desire for autonomous African-American intellectual institutions 
converged when black students mobilized for departments of African-American Studies. 
Bobby Seale and Huey Newton founded the Soul Brothers Association (predecessor of the 
Black Panthers founded in 1966) and demanded “Afro-American history” courses at Oakland's 
Lane College (Seale 1991 [1970]). In 1968, black students allied with the Black Panthers 
organized a strike at San Francisco State College that resulted in the creation of the first 
Department of African-American Studies (Karenga 1982: 21-24; Orrick 1970; Rojas 2007). The 
San Francisco State College strike was successful because the students had already created 
a functioning African-American Studies curriculum as part of an experimental college and 
students were well versed in the Civil Rights movement's tactics. Soon afterwards, black 
students at other campuses successfully mobilized for African-American Studies, and 
activists and students. developed proposals for departments, programs, courses and even all- 
black colleges within predominantly white universities (Van Deburg 1992). 

African-American Studies programs are one of the black student movement's most visible 
and enduring achievements. Approximately 120 universities have instructional units offering 
African-American Studies degrees. Another 100 campuses offer minor degrees and non- 
degree certifications. African-American Studies programs fulfill multiple roles in universities as 
“institutionalized expressions of racial difference” (Drake 1979) and support networks for 
students and faculty (Aldridge and Young 2000; Hine 1990; Stewart 2000). The movement for 
African-American Studies also resulted in more courses, programs and new curricula 
(Aldridge 2000). Some programs, such as Temple University’s Department, have become 
centers of afrocentricity, which is “a mode of thought and action in which the centrality of 
African interests, values, and perspectives predominate.” (Asante 2003: 2) African-American 
Studies is often seen as treating “the Black or African-American experience as an integral part 
of the broader society.” (Anderson 1990: 2) 

This approach to African-American Studies reflects a belief that there should be black conscious 
institutions, a view held by many in the African-American community today. For example, Dawson 
(2001: 330) finds that 62.2 percent of adult African-Americans would support “black male 
academies.” McCormick and Franklin (2000: 327) find that a majority of Million Man March 
participants think that resources should be pooled in black-owned banks and businesses (327). 

African-American Studies programs are the legacy of a moment when “direct action” 
defined black politics, before the current turn to electoral politics (Tate 1993). The growth of 
African-American Studies programs provides evidence about the effects of student 
mobilization, the institutionalization of racial consciousness and the development of a 
politically aware academic discipline. 


Data and Model Estimation 


| test the hypotheses with longitudinal data on university size, endowment, ethnic 
enrollments, awarded degrees, campus unrest and founding dates of degree-granting 
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African-American Studies programs. The cases in the data set are universities. Data were 
collected for all American institutions of higher education that offer four-year degrees from 
1968 to 1998. There were 1,423 colleges and universities that provided enough data for 
analysis. 


Dependent Variable 


| determined the founding dates of African-American Studies program in the following 
manner. First, | created a list of African-American Studies programs by reading all editions of 
The Index of College Majors (College Board 1977-1998). | collected founding dates of 
programs from historical accounts of African-American Studies programs (such as Downs 
1999; Huggins 1985; Myles 1970; Orrick 1970), reference books (Mitchell’s Multicultural 
Guide to Education, Mitchell 1996), online college catalogs, and departmental websites and 
brochures. If a program's founding date was not available in these public sources, | contacted 
the current chair by telephone or e-mail. | was able to acquire the founding dates of all degree- 
granting programs except one. 


Independent Variable 


Campus unrest data were collected from the New York Times and the Los Angeles Times. 
Both newspapers had good coverage of the Northeast, the Midwest and the West, while the 
New York Times had good coverage of the South. Every report of black student campus 
unrest was coded for the date (day, month and year), the kind of event (sit-in, 
march/demonstration, riot, interpersonal violence, etc.), and the college where the event 
occurred. | used multiple search methods to identify relevant articles and search engines to 
identify articles about campus unrest that involved black students. | used multiple 
combinations of key words such as “black students” and “African-American students” and 
various key words such as “riots,” “sit-ins” and “marches.” | also scanned through these two 
newspapers on microfilm and then verified the results of the search by consulting yearly 
indexes such as the Reader's Guide. 

| define disruptive black student campus protest events to be any instance of protest 
carried out by black students that is an attempt to prevent the university from performing its 
instructional, research or administrative duties. | also include in this category any attempt to 
directly impose monetary costs on noncompliant universities. The disruptive protest category 
includes sit-ins/building occupations, violence among students, classroom disruptions, 
heckling, work stoppages/strikes and property damage. Non-disruptive forms of black student 
protest are any form of protest carried out by black students that does not interfere with the 
university's daily activities. This second category includes rallies, “happenings,” protest 
theatre, hunger strikes and nonviolent demonstrations. Teach-ins that do not coincide with 
building occupations are counted as non-disruptive. 


Control Variables 


Scholars have found that structural and institutional features of an organization predict 
change. In this study, | control for an organization's internal structure and resources, client 
demography and institutional type. The organizational literature (Aldrich and Auster 1986: 
Downs 1967; Merton 1957; Tsouderos 1955) predicts that organizational size correlates with 
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the probability of change. This study employs the number of clients — students — as a measure 
of organizational size. Another literature shows that organizations are more likely to adopt 
innovations (Hage 1999; Zammutto and O'Connor 1992) if they have a fine division of labor. 
That is, organizations with specialized workers are more able to assimilate change than 
organizations without specialized skills and knowledge. In the present study, | measure the 
division of labor in an organization by the number of different degrees awarded by a university. 
Organizational scholars also argue that unused resources facilitate change (Cohen, March and 
Olsen 1972; Paulsen 1990). This study uses university endowments as a measure of spare 
resources. Other researchers note that change is more likely when a constituency will benefit 
from change (Katzenstein 1999; Scully and Creed 1999). The analysis includes the proportion 
of students who are black because they are the constituency most likely to participate in an 
African-American Studies program. To control for the possibility that public universities and 
historically black institutions might be more amenable to black students, | include dummy 
variables for these two characteristics. Data about student enrollments, black enrollment and 
other university characteristics are obtained from surveys such as the Higher Education 
General Information Survey. Table 1 describes each variable and its source. 


Table 1: Variables and Sources 


Years 
Variable Source Measured 
Organizational Variables 
Total Enrollment HEGIS/IPEDS (U.S. Department of Education) 1968-1969 
Enrollment by Race U.S. Civil Rights Survey 1968, 1970, 
(Department of Health, Education and Welfare) he 1974, 
HEGIS/IPEDS 1976-1998 
Curricular Diversity/ HEGIS/IPEDS 1968-1998 
Degrees Awarded 
Per Capita Endowment HEGIS/IPEDS 1968-1998 
Public HEGIS/IPEDS 1968-1998 
Historically Black College = HEGIS/IPEDS 1968-1998 
Carnegie Classification HEGIS/IPEDS 1968-1998 


Campus Unrest 
Annual Black Student New York Times, Los Angeles Times 1968-1998 
Protests on Campuses 


African-American Studies Programs Creations 


Existence of a Program Index of College Majors 1968-1998 
(College Board) 
Founding Date Historical accounts, reference guides, survey 


conducted by author 
a en 
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Qualitative Data 


In the discussion of my quantitative results, | use information from case studies | have done 
of African-American Studies programs. The data for the case studies come from university 
archives such as the University of Illinois at Chicago, the University of Chicago, San Francisco 
State University; public libraries such as the Vivian Harsh Collection at the Carter Woodson 
Library in Chicago and the Schaumburg Collection. of the New York Public Library in Harlem; 
the federal government's collection of materials regarding protests at the Lyndon Johnson 
Presidential Library in Austin, Texas; and personal collections, interviews and published 
university histories. Rojas (2007) describes these materials in more detail. 

| tested the hypotheses by estimating a discrete time logit model, because the creation 
of an academic program happens on a yearly basis and is not a continuous time process. The 
data used in the model estimates are pooled cross sectional data. Cases are discarded the 
year after an African-American Studies program was created. Table 2 presents summary 
statistics and the variable correlation matrix. Diagram 1 presents the cumulative hazard of 
adopting an African-American Studies program. 


Figure 1. Nelson-Aelen Cumulative Hazard Function for the Creation of African-American 
Studies Programs 


Nelson-Aalen Cumulative Hazard Estimate 
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Results 


The first hypothesis is that black students who disrupt the university's activities will increase 
the probability that their university will establish a degree-granting African-American Studies 
program. Social movement research suggests that disruptive acts encourage university 
administrators to placate protestors. Similarly, theory suggests that non-disruptive tactics 
might change public opinion and therefore influence university administrators. Table 3 shows 
that there is a significant correlation between black student activism and the creation of a 
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African-American Studies program. The results of Model 1, which include control variables, 
show that there is a statistically significant effect for both types of protest. The logit model 
estimates the effects of the independent variables on the log-odds that a university will create 
an African-American Studies program. Therefore, the multiplicative effect of a single non- 
disruptive protest conducted by black students on the odds that a university creates an African- 
American Studies program is e?** = 10.50. That is, a single non-disruptive protest event 
multiplies the odds of organizational change by 10. The effect of disruptive protests is also 
large, B = 2.278 (e**”* = 9.76). A single disruptive protest event multiplies the odds ninefold. 
Both protest effects decrease when other variables are included in the analysis. In Model 
4, the inclusion of control variables decreases the non-disruptive protest effect to 1.398 —a 
decrease of 40 percent. The effect of disruptive protests decreases from 2.278 to .815—a 
decrease of 64 percent. This is not surprising, given that researchers have found that features of 
the university such as size and research status predict campus unrest (Soule 1997; Scott and El- 
Assal 1969). Therefore, the inclusion of control variables should decrease the protest effects. 

There is an important difference between disruptive and non-disruptive events. In models 3 and 
4, which include student demographic data, enrollment data and institutional descriptor variables 
such as “historically black college,” disruptive protest has no significant effect, but non-disruptive 
protest is significant at the a = .01 level. The difference between disruptive and non-disruptive 
protest suggests that not all protests are the same. Disruptive protests have no impact on 
structural change once the features of the student body and university are included in the analysis. 

Neo-institutional theory suggested that the number of universities in a year that create an 
African-American Studies program will have a positive effect on the probability that other 
universities will create their own program. Table 3 provides evidence that there is a mimicry 
effect, but the effect only occurs within peer groups. That is, an African-American Studies 
program is likely to be created in a university when other similar schools create programs. 
Conversations with African-American Studies instructors corroborate this finding. In recent 
interviews with professors at research universities, | was told about the “Harvard effect,” the 
tendency of university administrators to support African-American Studies after Harvard 
University invested money in its Department of African-American Studies. 

The mimicry effect is notable but small when compared to the estimated protest effect. In 
Model 4, a single incidence of non-disruptive protest multiplies program creation odds by 
4.075. That is, a single rally or demonstration quadruples the odds of an African-American 
Studies program creation. In contrast, a single program creation in a university's Carnegie 
category multiplies the odds by e"*' = 1.198, or about 20 percent. 

| also suggest that the effect of any kind of protest might increase when other universities 
create African-American Studies programs. Therefore, one would expect interactions 
between protests and the number of other universities that have created programs to have 
significant effects on future African-American Studies program creation. Table 4 presents 
interaction effects. Model 5 includes protest, mimetic variables and their interactions. Model 
6 includes protest, structural variables and their interactions. The parameter estimates are 
revealing. The protest-mimicry interactions (Model 5) are not significant. That is, universities 
are no more responsive to their peers when there is campus protest. In Model 6, the only 
variables that are significant are log-enrollments, curricular diversity, and the interaction 
between protest and curricular diversity. The interaction effect is negative. According to 
Model 6, the universities most likely to adopt African-American Studies programs are large 
and structurally differentiated. The coincidence of protest with curricular diversity decreases 
program creation in Model 6, which suggest that perhaps the largest schools respond 
negatively to protests. It is possible that administrators at schools with an extremely large 
number of academic programs might try to institute African-American Studies within existing 
programs as a quick response to protests. Future research can address this conjecture. 
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Table 3: Discrete Time Logistic Model of African-American Studies Program Creation Year T-1 





Variable 1 2 3 4 


Non-disruptive Protests 20020 1.582** ozo 1.398** 
(.500) (.523) (.516) (.523) 
Disruptive Protests Lelor 13300" .900 815 
(.469) (.530) (.563) (.553) 
All Program Creations 014 021 023 
(.017) (.018) (.018) 
Program Creations in Carnegie Group .248*** ube 180re- 
(.020) (.023) (.027) 
Program Creations in Region 043 045 .037 
(.056) (.060) (.061) 
Log-enrollments (85492 1.045*** 
(.186) (.217) 
% Of Black Students 309 -.019 
(.181) (1.430) 
Per Capita Endowments .001 .001 
(.001) (.001) 
Curricular Diversity 124* CS 
(.059) (.065) 
Public University -.492 
C2 it) 
Research University -.210 
(.423) 
Masters College .006 
(.392) 
Historically Black College 1.299 
(1.172) 
Constant -5.834°* -6.519*™ -Ij2eo | = 16.708" 
(.094) (.174) (1.454) (1.875) 
Log-likelinood -777.704 447.444 -408.866 -406.061 
Degrees of Freedom 2 5 9 13 
iMG 44.50 278.07 352.20 357.81 
R 027 237 301 305 
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Table 4: Estimates of Protest Interaction Effects 


Variable 
# of Non-Disruptive Protests 


# of Disruptive Protests 

All Program Creations, Year T-1 

Program Creations in Carnegie Category, Year T-1 
Program Creations in Region, Year T-1 

All Program Creations x 

All Protests 

Program Creations in Carnegie Category x 
All Protests 

Program Creations in Region x 

All Protests 

Log-enrollments 

% of Black Students 

Per Capita Endowments 

Curricular Diversity 

Log-enrollments x All Protest 

% Black x Protest 


Endowments x Protest 


Curricular Diversity x Protest 


Constant 


Log-likelihood 
Degrees of Freedom 


R’ 


*p<.05 *p<.0l **p<.001 





5 6 

2.749%* 1.504 
(.829) (4.921) 

2.328" AA0 
(878) (5.313) 
012 


(.017) 

012** 

(.017) 

056 

(.057) 

030 

(.135) 

-.042 

(.087) 

-.498 

(.588) 
356** 
(134) 
-036 
(.644) 
001 
(.001) 
345*** 
(.053) 
813 
(594) 
4.396 
(11.627) 
033 
(.022) 
-.606** 
(.193) 

-6.559"* -12.020"** 

174 891 

-454.034  -666.292 

8 10 

282.630 249.080 

237 158 
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The structural features of universities have mixed effects. Some, like log-enrollments, have 
predictable positive effects. In Model 4, the multiplicative effect of log-enrollments on the 
odds of program creation is 1.046. The odds of program creation are nearly tripled (e' = 
2.86) when log-enrollments increase by one. Similarly, curricular diversity has positive effects 
on program creation. For each additional program of study, the odds of program creation are 
multiplied by e™ = 1.143. A university with fifteen distinct courses of study in the humanities 
and social sciences multiplies the odds by e8** = e'®*"* = 7.463. Larger, intellectually diverse 
institutions are the ones most likely to create new, interdisciplinary academic programs. 
Interestingly, after control variables are included, university endowments are not a significant 
predictor of African-American Studies program creation. 

Institutional control variables, such as public ownership, do not have significant effects 
on African-American Studies program creation. This is expected because the structural 
variables describe different institutional types. For example, public universities tend to offer 
more courses of study and enroll more students. Similarly, historically black colleges, by 
definition, tend to have mostly black student populations. 

Surprisingly, the proportion of students who are black does not have a significant effect on 
African-American Studies program creation. | found this to be the case in Model 3, which 
does not control for the type of institution. Black enrollment does not have a significant effect 
when the historically black college variable is omitted from the analysis. 

The model estimates support the following statements about the limited diffusion of 
African-American Studies programs. (1) Protest has a positive effect on program creation. (2) 
Only non-disruptive protest has a significant effect when control variables are included in the 
analysis. (3) There are very specific mimicry effects. Universities do not follow national trends; 
they belong to peer groups, as defined by the Carnegie classification. In the analyses, the 
creation of an African-American Studies program only increases the likelihood that other 
universities create a program when both universities belong to the same Carnegie group. This 
is consistent with the prior research on diffusion processes that | cited above. Mimicry was 
limited; only 120 of the approximately 1,400 institutions in the analysis offered African- 
American Studies. The mimicry effects are consistent with Soule’s (1997) research showing 
that protest tactics diffused within similar schools. (4) Two structural features of universities 
stand out as significant predictors of change: size and internal differentiation as measured 
by the number of majors offered by the university. (5) The ethnic composition of the student 
body is not a statistically significant predictor of African-American Studies program creation. 
(6) There is no support for the hypothesis that protest has interactions with any of the other 
variables in the analysis, except for curricular diversity. (7) The model that captures the most 
variance includes protest, mimicry variables and control variables. (8) Protest effects are larger 
than mimicry effects and most control variable effects in the estimated models. 


- Discussion and Conclusion 


Through analyses of data from American institutions of higher education, | found that only 
non-disruptive protest encourages the creation of African-American Studies programs. 
Diagram 2 illustrates the relationship between protest and organizational change found in the 
data on African-American Studies programs. | also found that the number of African-American 
Studies programs created within a Carnegie category has an effect on the creation of other 
African-American Studies programs. The empirical analysis depicts the spread of African- 
American Studies as a limited diffusion process driven by protest, made possible by a 
university's size and internal complexity, and propagated by mimetic diffusion within 
organizational peer groups. 
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While much has been learned from prior research on social movements and organizations, 
the effect of protest, mimicry and the internal structure of organizations have rarely been 


Figure 2. Effect of Protest Tactics on Organizational Change 


Non-disruptive Protest 


Probability of Organizational Change 


No Protest Disruptive Protest 





Types of Protest 


simultaneously assessed. For example, Tolbert and Zucker's (1983) study of municipal reform 
did not include measurements of protest activities, although they did estimate the effects of 
organizational size. Davis and Thompson's (1994: 158-165) examination of shareholder politics 
did not discuss specific movement tactics. Lounsbury's (2001) study of recycling programs 
among universities did not include measures of protest. This study expands on prior research 
by estimating the effects of measured political activity, organizational structure and mimicry. 

The findings of this study suggest that controversial organizational change is driven 
primarily by protest, although other factors are important, and that the protest tactics examined 
in this study are not sufficient to initiate widespread adoption of a movement's demands. 
Despite all the efforts on behalf of African-American Studies, programs exist today in only 9 
percent of institutions of higher education that award at least a bachelor’s degree. Most 
programs were founded in the years 1968-1973, with only a handful of programs created in 
other years. The protest effect is larger than the mimicry effect, which strongly suggests that 
African-American Studies never became a “taken-for-granted" feature of the American 
universities. If this did happen, protests would have been followed by the wide scale adoption 
of African-American Studies, as universities strived to be legitimate by copying their peers. It 
is likely that an interdisciplinary field focused on an American minority was inconsistent with 
the political culture that legitimizes the university. Unlike the Civil Rights movement in general, 
the African-American Studies movement failed to change the discourse within higher 
education so that “illegitimate” academic practices could be fully institutionalized. 

The findings raise important questions about how bureaucracies respond to protest. First, 
why does disruptive protest not have significant effects when structural variables and 
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institutional-type variables are included in the logistic regression model? Social movement 
theories suggest that both disruptive and passive protest should be effective, although for 
different reasons. The results presented in Table 3 point to a subtler story, in which some 
kinds of protest matter and others do not. 

It is possible that any activity depriving sympathetic bureaucratic insiders of their ability to 
advocate on behalf of a social movement inhibits organizational change. In her study of 
curricular reform movements, Amy Binder (2000, 2002) argues that bureaucratic insiders — not 
elected officials — are in the best position to enact reforms and structural change in 
organizations. Not only do insiders have knowledge of the daily workings of a bureaucracy, 
they have a good understanding of an organization's culture and wider political context. 
Insiders know which arguments will work and which will be rejected by power holders. 

In a separate analysis of black student protest and administrative response at various 
California State Colleges (Rojas 2007), | found that disruptive protests were often followed 
by state intervention, which created turmoil among college administrators (Orrick 1970). The 
State Board of Trustees would fire administrators they felt were not doing enough to maintain 
order. At least three times in the late 1960s, the California legislature passed statutes that 
redefined the power and responsibilities of college presidents. During the Third World Strike 
in 1968-1969, the San Francisco State College Board of Trustees interfered with the daily 
workings of the university. Trustees, legislative leaders and Governor Ronald Reagan thought 
that college administrators were not enforcing disciplinary standards on campus or that 
students presented a credible threat to shut down the campus that had to be countered. 

State intervention reduced the discretion of sympathetic administrators. Disruptive 
student protest forced administrators to worry about disciplinary policies, police actions and 
political intervention rather than curricular reform. In my examination of internal reports from 
the California State College system in the late 1960s, | found multiple discussions of 
disciplinary procedures and the administration's attempts to negotiate with the state 
legislature and Governor Reagan, but few discussions of curricular reform. This obsession 
with discipline and public image was justified. At San Francisco State College in the 1967- 
1968 academic year alone, two college presidents were fired by the board of trustees 
because they were thought to be incapable of imposing order on their campuses. 

Protest undermines the ability of administrators to project an image of authority, and it is 
authority which gives administrators opportunities to convert initially illegitimate movement 
demands into legitimate organizational routines (Elsbach and Sutton 1992). Administrators in 
public universities must obtain the support of political actors who are interested in projecting 
a “law and order” image. Similarly, private school administrators want to reassure donors and 
other constituents that they are in control of the campus. 

The empirical analysis raises a second question: why isn't there a statistically significant 
effect for black student enrollment? My hypothesis is that protest in favor of Black Studies and 
related issues is strongly correlated with the presence of well organized student groups. On 
the campuses | studied, black student protests were organized by former Student Nonviolent 
Coordinating Committee activists and Black Panthers. “Black Student Unions” were created. 
Students would hold “rap sessions” at student union meetings where they would discuss 
grievances and develop ideas for university reforms. Older students with experience in the Civil 
Rights movement would teach younger students effective protest tactics. Without the formal 
structure of the Black Student Union, it would have been less likely for black students to 
develop the sense of solidarity needed for collective action. There would not have been a 
forum in which movement strategies could be discussed, planned, practiced and developed. 

Therefore, it is not the sheer numbers of the potential constituency of the African-American 
Studies program but the extent to which a mobilized constituency demands this change. This 
theory explains the lack of an effect of the black student population on African-American Studies 
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program creation. The theory also explains the wide variety of campuses where African- 
American Studies were implemented. For example, some schools, such as Cornell University 
(Downs 1999) and San Francisco State College (Orrick 1970), had relatively small black student 
populations, sometimes less than 10 percent. Other colleges, such as the historically black 
Howard University (Myles 1970), also experienced protests for African-American Studies. 
Schools with black student protest and successful African-American Studies programs have in 
common black student unions, SNCC chapters and-other campus organizations. 

The presence of black student unions and former civil rights activists on campuses with 
departments of African-American Studies suggests that political constituencies within organizations 
may matter most when they are integrated into a broader protest movement. Movement activists 
with political experience know how to effectively reframe the organization's activities and create new 
understandings of what the organization should do. Formal organization brings together interested 
individuals and makes communication and coordination possible. Therefore, it is not the 
constituency that matters, but whether the constituency is organized and mobilized. 

This study shows that protest is the crucial factor behind the creation of African-American 
Studies programs. It also demonstrates that the type of protest matters: only non-disruptive 
protest has an effect. Also, protest is not needed everywhere because mimicry occurs. This 
study showed that the spread of African-American Studies programs involves both political 
and mimetic processes. Future research can address new questions raised by this study. 
Replications of this study can determine if the differing protest effects exist in other 
situations. It is possible that the dynamics of the African-American Studies movement are 
unique. Examinations of other academic reform movements might find that there is little 
difference between tactics. Other research should investigate the interactions between 
demonstrators and administrators as the later try to retain their authority. This study suggests 
that movement scholars interested in outcomes should pay attention not only to mobilization 
processes and movement strategies, but also to the effects of movement tactics on the 
cultural and political underpinnings of a targeted organization. 
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Abstract 

We investigate how the distribution of AIDS cases in the United States increased 
homophobia over time. Using pooled GSS surveys, 1973-1998, with state-reported AIDS 
cases, we estimate models for variation in homophobic attitudes as a function of the time- 
specific proportion of AIDS cases in a given state. AIDS incidence had a negative impact on 
civil rights attitudes during the period 1981-1998, but demonstrated an impact only on 
morality attitudes from 1986 to 1991. 


Introduction 


From early 1979 through the end of 1982, the number of diagnosed AIDS cases in the United 
States rose from 0 to 900. By the end of 1983, there were 3,000 cases. In July of 1985, when 
it became public knowledge that Rock Hudson, a movie star known for classic good looks 
and macho roles, was afflicted with the disease, the total number of U.S. AIDS cases, 
including more than 6,000 persons who had already died, exceeded 12,000 (Shilts 1987). 

Other than its sheer virulence, there are several characteristics of AIDS that make it a 
unique disease. First, in its earliest manifestations it affected two stigmatized sub-populations 
that were marginal to mainstream America: injection drug users and male homosexuals. 
Neither of these groups drew much coverage in the mainstream press and a disease that 
affected them alone did not draw much media attention (Kinsella 1989). As a result, for many 
months, most Americans were totally unaware of the disease. Second, from an 
epidemiological perspective, the spread of AIDS was comparatively slow. Unlike flu or 
cholera, which can inundate large areas in a short period of time, a long latency period ensued 
— years in some cases — between infection with the HIV virus and outwardly discernable 
symptoms. Finally, unlike many other diseases, AIDS manifested itself in several different 
ways, ranging from Kaposi's sarcoma to various opportunistic infections, all of which 
eventually were recognized to be the result of a compromised immune system. Thus, AIDS 
moved very slowly into the collective consciousness of the nation and into the forefront of 
medical research. 

As awareness of the disease increased, but commensurate knowledge of its origins and 
prevention did not, the public became extremely fearful of persons infected with the HIV/AIDS 
virus (Ron and Rogers 1989). By mid-1983 medical personnel and police officers in San 
Francisco were routinely wearing rubber gloves and facemasks when dealing with persons 
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who might be infected. School systems around the country worried about how to handle 
children who might be infected. Right wing attacks on gays began to appear with increasing 
frequency in the press and elsewhere (Shilts 1987). This antigay harassment included 
references to AIDS, clearly linking it to anti-gay violence (Herek 1997). 

Gay men, particularly in New York and San Francisco, where enormous strides had been 
made in the area of civil rights and general acceptance, began to seriously worry about a 
major backlash. The AIDS epidemic, particularly in its early stages became synonymous with 
male homosexuality, and the association was such that the media labeled it the “gay plague.” 
In this paper, we ask, did the onset of the HIV/AIDS epidemic cause a backlash against gay 
men? Because of the relative slowness with which the epidemic moved, we are afforded a 
rare opportunity to study how attitudes change in the face of an unprecedented threat to 
public health, one which was overlaid with strong sexual connotations. 

We use data from the 1973 through 1998 General Social Survey (GSS) in conjunction with 
time-specific state-level AIDS incidence data to answer this question. We estimate multilevel 
models to investigate the extent to which attitudes at the national level changed over time in 
response to the spread of the AIDS epidemic net of cohort replacement, general period and 
compositional effects." 


Background 


AIDS evokes stigma, and thus, negative attitudes, in three ways: (1) it is contracted through 
voluntary and disapproved behaviors — behaviors that are already stigmatized (e.g., 
homosexual sex) (2) it is contagious and puts others at risk, and (3) is incurable, meaning fatal 
at worst and chronic at best (Callen 1990; Herek 1999; Rosenberg 1989). Herek (1999) defines 
AlDS-related stigma as prejudice against people believed to have it, expressed through 
ostracism, discrimination and legislation that deprives persons with AIDS of basic human 
rights. Thus, the association of AIDS with a stigmatized group has an impact on attitudes 
towards persons with AIDS and the persons most likely to contract AIDS — gay men. 

The potential backlash, mentioned earlier, resulted from this association of HIV/AIDS with 
gay men. Based on the combined stigma of homosexuality and HIV/AIDS we would expect 
attitudes towards homosexuals, in general, to become more negative. Instead, Yang (2000) 
found a tremendous increase between 1992 and 2000 in support for civil rights issues 
including gay adoption, gays in the military and inclusion of gays in anti-discrimination 
legislation, but he also found that gays were still the least liked of all social groups in 2000. 
Likewise, a recent study by Loftus (2001) analyzed trends in attitudes about both civil rights 
and morality of homosexuals from the mid 1970s to mid 1990s. She found that American 
attitudes towards homosexuals were more tolerant where civil rights were concerned than 
when morality was the issue. Both sets of results suggest the need to model separately 
attitudes towards civil rights and morality regarding homosexuals. 

Did AlDS-related stigma have the same impact on attitudes toward homosexuals over 
time? At the onset of AIDS, the early period, the primary victims were gay men and injection 
drug users, and there was little information in the media concerning the disease. The majority 
of the public was aware of AIDS, but most were misinformed as to its transmission. As time 
passed, AIDS diffused into the more mainstream population; there was extensive media 
coverage, active official interest in AIDS education, and better information regarding the 
transmission of AIDS (Rogers, et al. 1993; Singer et al. 1987; Kinsella 1989; Gagnon and Nardi 
1997). By the 1990s, AIDS had become institutionalized, and incorporated into mainstream 
society. Gagnon and Nardi argue, “...it [AIDS] has changed from an acute disease, ill 
understood and without a specialized and responsive medical and social infrastructure, to a 
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‘chronic’ disease with a range of stable social practices at the individual and personal network 
levels and from local communities to the highest levels of the federal government.” (1997: 3) 
Herek (1997) asked a variation of our research question: did the AIDS epidemic cause 
prejudice and hostility or was ita mechanism through which Americans could enact their pre- 
existing prejudices? Herek concluded that the AIDS epidemic did not cause negative 
attitudes, but simply focused pre-existing homophobia. However, Herek did not decompose 
trends in the survey items he used. Thus, the effect of AIDS may be confounded by other 
opposing trends such as cohort effects, period effects and/or compositional effects. 

Research on attitude change is often limited to assessing change through cohort, age and/or 
period effects because the data used are cross sectional time series data (Stouffer 1955; Davis 
1975; 1979; Smith 1978; 1980). Cohort effects are the common historical experiences of a group 
born in the same time period, and cohort replacement effects are due to the replacement of older 
cohorts by younger, better-educated cohorts over time. Period effects are social changes that 
occur during specific historical periods that affect all cohorts (Firebaugh 1997). Thus in order to 
unpack trends in attitudes towards homosexuals to determine the impact of HIV/AIDS, we must 
separate out cohort replacement effects, general period effects and compositional effects. 

Prior research on attitudes toward homosexuality shows that there are indeed 
compositional effects in attitudes towards homosexuals. Males are less tolerant of 
homosexuality than females, but blacks and whites do not hold significantly different attitudes 
(Glenn and Weaver 1979; Herek 1988; Loftus 2001; Smith 1985; 1992). Smith (1992) found 
that residents of large metropolitan areas are more tolerant than residents of rural areas. 
Individuals from a fundamentalist Protestant religion, other Protestants and Catholics have 
been found to be less tolerant than Jews, agnostics and atheists, and those with conservative 
political identity tend to be less tolerant than those with liberal political leanings. Furthermore, 
single and divorced individuals tend to be more tolerant than their married or widowed 
counterparts (Herek 1988; Loftus 2001; Smith 1992). Therefore, we will control for 
compositional and demographic effects in our analysis. 

In sum, we place our analysis squarely within the cohort replacement-period effect 
framework, decomposing cohort effects, period effects and the effect of AIDS over the 
course of the AIDS epidemic. We distinguish moralistic attitudes from civil rights attitudes, 
and we control for demographic differences in our models. We are interested in examining 
how attitudes towards homosexuals changed at the onset of the AIDS epidemics, and over 
the course of the epidemic. How did attitudes change as knowledge of AIDS transmission 
improved, and as the AIDS epidemic spread to non-stigmatized groups? By treating state 
level data on AIDS prevalence as a macro-level variable in a multilevel model, we are able to 
investigate how attitudes changed as a function of events at a somewhat more local level, 
that is, how attitudes changed as the epidemic spread from the urban and coastal areas 
across the country. 


Data and Methods 


We draw on two sources of national data — the General Social Survey (GSS), which asks 
questions regarding the morality of homosexuality and civil rights of homosexuals, and 
Centers for Disease Control (CDC) data on the number of AIDS cases by state from 1981 to 
the present. The GSS is a geographically stratified probability sample of U.S. adults. Data 
from the GSS are from the 1973 through 1998 surveys, however, the GSS was not carried 
out in some years (1972, 1975, 1978, 1983 and 1986). The total number of cases available 
for this analysis is in excess of 22,000. 
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Dependent Variables 


The GSS contains three items that speak to the issue of tolerance for the rights of male 
homosexuals: should an admitted homosexual be allowed to (1) deliver public speeches, (2) 
teach in a college or university, or (3) have his book favoring homosexuality be removed from 
the public library. The last item, having a book removed from the library, was reverse coded 
before creating the scale. We combined these items into a single scale by summing them 
(Cronbach's alpha = .83). The scale ranges from zero for someone who answered “no” to all 
three items to three for someone who answered, “yes” to all three; thus higher scores 
indicated more tolerance for the civil rights of homosexuals. For about 3 percent of the 
respondents, who answered only two of the three items, we did a simple imputation of the 
third based on the mean of the remaining two. An additional 16 percent who answered none 
or only one of the items were eliminated. Across all years, 22 percent of individuals on 
average said “no” and 51 percent said “yes” to all three civil rights attitudes. 

The GSS also asks a question on the morality of the homosexual sex act. It asks: “What about 
sexual relations between two adults of the same sex — do you think it is always wrong, almost 
always wrong, wrong only sometimes or not wrong at all?” The responses range from 0, “always 
wrong’ to 3, “not wrong at all,” which is consistent with the coding of the civil rights measure. 
Across all years, 71 percent of individuals on average said sexual relations were “always wrong” 
and 17 percent said sexual relations were “always right” to the morality attitude question.” 

A benefit of using these items is that they have been replicated across many waves of 
data. The civil rights questions are somewhat outdated and perhaps some of the steep 
decline in civil rights attitudes might reflect this. For example, Yang (2000) showed increasing 
support for gays in the military, gay adoption and gay marriage throughout the 1990s. These 
items, however, do not show the high levels of tolerance that the civil rights items 
demonstrate. Our analysis might be improved if these somewhat more timely items 
regarding attitudes towards homosexuals were used; however, trend analysis would be 
limited using such items, and we are interested in long-term trend analysis. The morality 
attitude question does not specify the sex of the homosexual. AIDS differentially affects male 
and female homosexuals, and thus its impact on attitudes towards homosexuals might be 
confounded in some way. We feel the benefits of these items far outweigh their limitations. 


Independent Variables 


AIDS 

The CDC publishes a yearly report on the status of HIV/AIDS in the United States and its 
territories. Since 1986, the number of AIDS cases has been identified by state for the current 
and preceding year.* Prior to 1986, the CDC reported cumulative state-specific AIDS statistics 
only for some states. These cumulative totals had to be broken out and allocated by year for 
those few states the CDC specified, and the national cumulative totals needed to be allocated 
ona yearly basis to the remaining states. We used two different allocation methods for the 
four-year period, and found that the results did not vary based on the allocation method used. 
A detailed explanation of our allocation method is available on request. The level of AIDS ina 
given state was measured as a proportion of total cases in the United States for each year, 
rather than as a rate of incidence per 100,000 in each state.4 In addition, as the GSS was 
administered in the spring and AIDS data were distributed at the end of the year, we assumed 
a time delay of the impact of AIDS on attitudes, and therefore used AIDS incidence from the 
prior year. The average proportion of the total AIDS cases in each of 45 states from 1981 until 
1999 was .056 with a standard deviation of .088. 
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Time 

Recall that we have data from 1973 to 1998. Earlier, we discussed how the impact of AIDS on 
Americans has changed over time. Based on this, we think it is useful to recognize four 
distinct periods with respect to the spread of AIDS, media coverage of AIDS, official 
responses to AIDS and public knowledge of the AIDS epidemic. The first period is pre-AIDS, 
which is the time under study up until 1981 when the disease soon to be known as AIDS was 
recognized. The second period, from 1981 to 1986, we label the “early period.” The third 
period, which we label “diffusion period,” extends from 1986 to 1991. The fourth period, from 
1992 on, which we use as our reference category, we labeled the “incorporation period.” We 
expect AIDS to have a strong negative impact on attitudes in the “early period,” a smaller, 
negative effect in the “diffusion period,” and no effect in the “incorporation period.” 


Cohorts 

We created seven birth cohorts based on 10-year intervals. An eighth birth cohort, the 
youngest, is based on a 12-year birth cohort, because the last birth cohort does not age into 
the study until 1993, and it is not practical to use it separately. The exact years included in all 
birth cohorts can be found in Table 1. The oldest cohort is the reference category. 


Demographics 

We include a number of demographic and compositional variables in our models, based on prior 
_research. Sex, race, marital status, religiosity, political leanings and urbanicity are shown to have 
an impact on attitudes towards homosexuals and are included as controls in the models. 


Analysis 


The basic logic of the analysis will be to model individual level responses to the survey items 
as a function of individual characteristics and the (time-varying) prevalence of AIDS within each 
state. We use ordered logistic regression in which we assume that the categories are ordered, 
but the distances between them are not necessarily equal. To handle the multilevel nature of 
our analysis, we used HLM v5.05 (Raudenbush et al. 2000) restricted maximum likelihood 
procedures for ordinal outcomes. HLM allows for two-level, nonlinear person-specific random 
intercept models, but does not provide population averaged results. We will report the latter 
based on a formula found in Diggle et al. (1994). The model can be expressed as: 


ij 


ier 
ij 





log =a,+B,X, +4, 


where, each pj; corresponds to a cumulative dichotomization of tolerance toward homosexuality 
where respondent /is in the" category or lower, By Xj is a vector of explanatory variables and 
coefficients that predict the probability of being in a lower or less tolerant attitudinal category 
and Us represents the random effect associated with states.° A negative coefficient can be 
interpreted as decreasing the probability of being in the lower category (least tolerant) while a 
positive coefficient increases the probability of being in the lower category. 


Results 


Table 1 shows the logged coefficients, standard errors and odds ratios for models 
predicting attitudes towards the civil rights of homosexuals and morality of homosexuality. 
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Both models control for sex, race, marital status, urbanicity, religion and political 
preferences although the results are not shown in Table 1. Different models were needed 
to handle the differences in average attitudes towards morality and civil rights attitudes. 
For the morality attitude but not the civil rights attitude, the effect of AIDS incidence 
interacted with time. For both outcomes, cohort membership did not interact with either 
time or AIDS incidence. We will first discuss the results for changes in attitudes towards 
the civil rights of homosexuals and then the changes in attitudes towards the morality of 


homosexual behavior. 


Table 1: Population Averaged Results for Models of Morality and Civil Rights Attitudes* 


Civil Rights of Morality of 
Homosexuals Homosexuality 
Std. Odds Std. Odds 
Estimate Error Ratio Estimate Error Ratio 
Intercept -588"* = 132 1534279 192 
Threshold 2 132 ane018 ott) 82010 
Threshold 3 RES orn 01S B46"* M022 
AIDS Incidence 139% 275 ~=—-2.094 -.105 .460 .900 
TIME PERIOD 
Pre-AIDS (1973-1980) 378 =—.054 ~—s: 1.460 .079 .063 1.083 
Early (1981-1985) aod napeO52 wiht424 .362** 069 1.437 
Diffusing (1986-1991) 258 > «049 -1:295 512-0) ..060 14 
Incorporation (Post-1991) 
Reference 1.000 1.000 
INTERACTIONS 
AIDS Incidence (Early) .196 461 di2tZ, 
AIDS Incidence (Diffusing) 1.278* 466 3.590 
COHORTS (Birth Years) 
1 ref. (1894-1903) 1.000 1.000 
2 (1904-1913) = 533" ~=.085 587 -.515* 194 598 
S (1914-1923) =/982" e100 394 -.697** = 185 498 
4 (1924-1933)  -1.140**  .093 320 LOT Ee e152 362 
5 (1934-1943)  -1.397** 085 .092 “Wwid0™weet52 320 
6 (1944-1953)  -1.705** .089 182 -1.431* 164 239 
7 (1954-1963)  -1.625** .081 197 -1.364" = .153 256 
8 (1964-1975)  -1.633** 093 195 21510" aired 65 .208 
Level 2 Variance Oss .159* 


* All models control for sex, race, marital status, religious affiliation, political identity and 


urbanicity of residence. 
7 pi. 05s pi<=-000 
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Civil Rights 


Contrary to our expectations that AIDS would have an impact only in the early period and the 
diffusing period, AIDS incidence has a constant and negative effect on attitudes towards the 
civil rights of homosexuals across a// time points. The effect of AIDS is to significantly 
increase the probability of being intolerant, with a 1 percent increase in AIDS resulting in a 
doubling of the odds of being intolerant (O.R. = 2). This constant negative impact is masked 
by a general period effect, however, that shows increasing tolerance in attitudes over time. 
Compared to attitudes in the incorporation period (post 1991), individuals are 46 percent more 
likely to be intolerant in the pre-AIDS period, 42 percent more likely to be intolerant in the early 
period, and 30 percent more likely to be intolerant in the diffusing period. Figure 1 shows the 
estimated probabilities of being in the least tolerant civil rights category over time and AIDS 
incidence for Cohort 5. As this figure shows, there is virtually no change in the estimated 
probability of being in the least tolerant category of civil rights attitudes prior to 1987 due to 
the opposing effects of AIDS incidence and time. After 1987, the general period effect of time 
begins to outweigh the effect of AIDS incidence, and we see a decline in the probability of 
holding intolerant attitudes towards homosexuals’ civil rights. 

There is also a cohort replacement effect in operation that countered the negative impact 
of AIDS incidence and increased tolerance in attitudes towards the civil rights of homosexuals. 
In Table 1, each successively younger cohort is more tolerant in their attitudes towards the civil 
rights of homosexuals. Cohort 2, the second oldest cohort, is 41 percent less likely to be 
intolerant compared to the oldest cohort and Cohort 8, the youngest cohort, is 80 percent less 
likely to be intolerant compared to the oldest cohort. Thus, as aging cohorts leave the sample 
and younger cohorts enter, a liberalizing cohort replacement effect also counteracts the 
constant and negative effect of AIDS on civil rights attitudes towards homosexuals. 


Morality Attitudes 


AIDS incidence interacts with time in the model predicting attitudes towards the morality of 
homosexuality. Recall that the incorporation period (post 1991) is the reference category; 
therefore, the main effect of AIDS incidence is the effect of AIDS at that time. There is no 
significant effect of AIDS incidence on morality attitudes in the incorporation period. There 
can be no effect of AIDS incidence prior to the onset of AIDS, so we model interactions of 
AIDS with time only for the early period and the diffusing period. Contrary to our expectations, 
there is no significant impact of AIDS incidence on attitudes in the early period. There is a 
significant effect of AIDS incidence in the diffusing period, however, with a 1 percent increase 
in AIDS incidence in the diffusion period tripling the odds of being intolerant (O.R. = 3.59) 
compared to respondents in the incorporation period. 

The general period effect also differs for attitudes towards morality compared to civil rights 
attitudes. For each successive period, individuals living in states with no AIDS incidence 
become increasingly intolerant in their attitudes towards the morality of homosexuality 
compared to individuals in the incorporation period. Compared to attitudes in the incorporation 
period (post 1991), individuals were 8 percent more likely to be intolerant, but not significantly, 
in the pre-AIDS period, 44 percent more likely to be intolerant in the early period, and 77 
percent more likely to be intolerant in the diffusing period. Figure 1 shows effects of AIDS 
incidence and general period effects on the estimated probability of being in the least tolerant 
category of morality attitudes for Cohort 5. The overall pattern of change over time for morality 
attitudes is to become increasingly intolerant in each successive time period until the 
incorporation period when there is a decrease to the pre-AIDS level of intolerance. 
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There is a cohort replacement effect that counters both the negative impact of AIDS and 
the negative general period effect of time, similar to the cohort effect for the civil rights 
attitude. Table 1 shows that each successively younger cohort is more tolerant in their 
attitudes towards the morality of homosexual relations. Cohort 2, the second oldest cohort, is 
40 percent less likely to be intolerant compared to the oldest cohort and Cohort 8, the 
youngest cohort, is 79 percent less likely to be intolerant compared to the oldest cohort. 


Demographic Controls 


Although not presented in Table 1, we included sex, race, marital status, religiosity, political 
leanings and urbanicity in our models. Our findings confirm earlier research findings on the 
associations between demographics and attitudes towards homosexuals. Males are more 
intolerant than females. Widowed and married persons are more intolerant than never married, 
separated or divorced persons. Whites and blacks are more tolerant than other races. Liberals 
are more tolerant than conservatives. Jews and atheists are more tolerant than other religious 
groups. Residents of urban areas are more tolerant than residents of rural areas. 


Discussion and Conclusion 


Gays and lesbians have made extensive gains in civil rights and general acceptance since the 
late 1960s, particularly in New York and California. In 1981, a new disease, later called AIDS, 
contracted primarily by two stigmatized subpopulations, homosexual men and intravenous drug 
users, evolved. Media attention, after announcing the discovery of the disease and labeling it the 
“gay plague,” was virtually nonexistent until 1986 (Kinsella 1989). The timing of AIDS, the lack 
of knowledge and extensive fear over its transmission led gay men to fear the occurrence of a 
public backlash. We asked whether the AIDS epidemic had a negative effect on American 
attitudes towards homosexuality. We expected that the spread of AIDS over time would cause 
attitudes to become more negative during the early period, just after the onset of AIDS, and the 
diffusing period, from 1986 to 1991, when members of the non-stigmatized population were 
increasingly contracting the disease. We did not expect AIDS to have an impact after society had 
adjusted to and incorporated the disease into everyday life (post 1991). We further supposed 
that attitudes could differ for the civil rights of homosexuals and the morality of the homosexual 
sex act because Americans are likely to support the civil rights of stigmatized groups while 
condemning the stigmatized behavior. We decomposed cohort effects and general period 
effects from AIDS period effects using a sample GSS data along with state specific AIDS 
incidence from the CDC for a period of 25 years to answer these questions. 

We found that gay men were right to be concerned about a backlash following the onset 
of the AIDS epidemic. Net of compositional effects, general period effects and cohort 
replacement effects, the disease significantly increased intolerance towards the civil rights of 
homosexuals across all time points, and it increased intolerance towards the morality of 
homosexuality during the diffusing period, 1986-1991. 

Consistent with our expectations, intolerance towards the morality of homosexuality 
increased in the diffusion period, a period of increased agitation by religious and political factions 
against AIDS prevention education campaigns. Poirier (1988) suggested that the religious right 
took advantage of the AIDS epidemic to renew a campaign against sexual permissiveness in the 
late 1980s, and that this backlash particularly targeted homosexuality. “The discourse against 
AIDS has become increasingly a moralistic condemnation of homosexuality, empowered by the 
doctrinal and biblical interpretations of sex and nature that are ancient in origin and, in Catholic 


2176 © Social Forces Volume 84, Number 4 * June 2006 


and fundamentalist churches, still extremely articulate,” Poirier stated (1988, 462). Therefore, the 
effect of AIDS on morality attitudes was particularly strong during the diffusion period. 

In the early period, AIDS incidence did not have a significant impact on morality attitudes 
over and above the increasingly intolerant general period effect of time. Individuals were 
becoming increasingly intolerant, though it was not tied directly to state-specific AIDS 
incidence. That the effect of AIDS was not significant during the early period could be due, 
in part, to low levels of reported AIDS cases in states with the greatest levels of intolerance, 
such as the Bible Belt in the South. 

What explains the constant negative effect of AIDS on civil rights attitudes? One possible 
explanation is that the AIDS epidemic made gay men much more visible in our society than 
they were previously. Because of the gay rights movement and the AIDS epidemic, gays have 
become more vocal and in some respects more active in their pursuits of many civil rights 
that previously were not available to them, such as partner benefits, protection in 
employment, opportunities to serve in the military and even marriage and child adoption 
rights. There could be members of the American public, not captured by the demographic 
and composition controls included in these models, whose attitudes were affected adversely 
by AIDS and who continue to be negative toward the increasing civil rights demands of gay 
men and women. In cohort vs. period effect models, age cannot be included as a predictor 
variable due to the relationships between age, cohort membership and time. AIDS may have 
had a negative impact on older individuals within each cohort. This would be a profitable area 
for future research, aS would research updating both the civil rights attitudes and morality 
measures to more current issues. 


Notes 


1. Figures on AIDS cases cited here and elsewhere in this paper come from various Centers 
for Disease Control (CDC) publications and from Shilts (1987). The remainder of this 
introduction draws heavily for both fact and inspiration from Shilts’ superb book. 


2. HIV/AIDS primarily affected gay men and not lesbians. One set of measures from the GSS 
focus on attitudes towards homosexual ma/es, but the morality question is ambiguous 
regarding the sex of the homosexual having relations. Given the invisibility of lesbians in 
general and lesbian sex in particular, this ambiguous question is most likely perceived as 
asking about male homosexual behavior. 


3. We would have preferred to use AIDS cases contracted through homosexual 
transmission, but the CDC does not break that information out by state. The majority of 
AIDS cases through 1988 were ae homosexual transmission either singly or as part 
of multiple risks. 


4. This is how the CDC reported AIDS cases from 1981 through 1987. Public understanding 
of the distribution of AIDS was based on each state's percentage of the total. If we were 
to create epidemiological incidence rates, we would be creating measures that do not 
accurately reflect the common understanding of the contraction and spread of AIDS 
across the United States. If we made rates, we would find that D.C. was by far the most 
stricken metropolitan area, not New York or San Francisco. 


5. Only 45 of the 50 states plus D.C. are represented in the sample across all years. We have 
chosen to model this variation as a random effect. 
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The Ties That Bind and Those That Don’t: 
Toward Reconciling Group Threat and Contact Theories of 
Prejudice 
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Abstract 

Does interracial/interethnic propinquity breed hostility or harmony? Group threat and 
contact theories generally answer hostility and harmony, respectively. I propose that a 
historically and culturally rooted racial/ethnic hierarchy differentially shapes whites’ present- 
day threat of, contact with, and ultimately, prejudice towards blacks, Hispanics and Asians. 
Because Hispanics and Asians have ascended in this hierarchy, they arouse less threat and 
have more comfortable interactions with whites. Results from multilevel models of 2000 
General Social Survey and Census data indicate that the real presence of blacks — not 
Hispanics or Asians — living near whites heightens whites’ prejudice. Moreover, whites who 
know Hispanics and Asians are less prejudiced towards them, but whites need to both know 
and feel close to blacks to experience reduced prejudice. Implications are discussed. 


If the problem of the 20" century was that of the color line (DuBois 1903), then the problem of 
the 21° century is that of increasingly numerous color lines. According to the U.S. Census 
(2004a, 2004b), Hispanics now number 40 million in the United States, surpassing blacks as 
the largest racial/ethnic group in the country. Closely following Hispanics and blacks are 
Asians, with a population of 12 million, which is more than a 50 percent increase since 1990. 

These trends raise an important question about current and future racial/ethnic relations in 
the United States. Does interracial/interethnic propinquity breed hostility or harmony? Since 
Park's (1950) famous race relations cycle, race/ethnicity scholars have wrestled with this 
question. Moreover, debates over propinquity and prejudice are central to other areas of 
sociology, as evidenced by research on intergroup contact and attitudes toward the homeless 
(Lee, Farrell and Link 2004), mentally ill (Desforges et al. 1991), elderly (Caspi 1984), 
homosexuals (Herek and Capitanio 1996), and people with AIDS and disabilities (Makas 1993; 
Werth and Lord 1992). Furthermore, this question has important implications for 
desegregation and affirmative action policies (Krysan 2000). 

Two theories of prejudice provide different answers to this question. Group threat theory 
suggests that the presence of a large minority population living near whites arouses 
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economic and/or political threat among whites, thus resulting in prejudice (Blalock 1967). In 
contrast, contact theory proposes that the presence of a large minority population living near 
whites may lead to more frequent interactions between minorities and whites (Sigelman, 
Welch, Combs and Bledsoe 1996). Depending on its quality among other things, this contact 
is associated with reduced prejudice (Allport [1954] 1979). 

Despite their contributions, group threat and contact theories have been criticized. One 
criticism of both theories is their almost complete focus on black-white race relations, 
assuming that threat and contact operate similarly across all racial/ethnic groups. Additionally, 
empirical studies largely test these theories in isolation from one another. Criticisms of 
research in the group threat tradition include its heavy reliance on measures of racial/ethnic 
composition as indicators of threat, which neglects economic and political measures that 
Blalock (1967) assumed were important. This research also uses measures of racial/ethnic 
composition at broad levels of analysis (Oliver and Mendelberg 2000). Moreover, this research 
does not measure or only indirectly measures perceptions of threat and the size of minority 
populations, which assume importance in contemporary variants of group threat theory (Bobo 
and Hutchings 1996; Glaser 2003). Contact theory is criticized because its numerous 
conditions of contact may make it unfalsifiable, and it is somewhat ambiguous regarding the 
quality of contact needed to reduce prejudice. The theory is also potentially prone to self- 
selection bias and reverse causality: contact may reduce prejudice, but prejudiced people 
may avoid contact (Pettigrew 1998). 

This paper attempts to theoretically and empirically address criticisms of group threat and 
contact theories, especially their focus on black-white race relations and assumption that threat 
and contact operate similarly across all racial/ethnic groups. Building on my previous work 
(Dixon and Rosenbaum 2004), | propose that a historically and culturally rooted racial/ethnic 
hierarchy differentially shapes whites’ present-day threat of, contact with, and ultimately, 
prejudice towards blacks, Hispanics and Asians. Because Hispanics and Asians have been 
“whitened” to some extent and have ascended in this hierarchy, they represent less of a threat 
and have more comfortable interactions with whites than blacks. | hypothesize that the real and 
perceived presence of blacks — not Hispanics or Asians — living near whites arouses prejudice. 
Moreover, whites who know Hispanics and Asians will be less prejudiced towards them, but 
whites need to know and fee/ close to blacks to experience reduced prejudice. 

Using descriptive, correlative, regression and multilevel modeling analyses on 2000 
General Social Survey (GSS) data linked to Census 2000 metropolitan area/county-level data, 
| establish the validity of these data and test these hypotheses. While the use of any cross- 
sectional data cannot unequivocally prove causality, these data improve upon prior research 
because they capture processes assumed to come causally prior to contact and measure 
different forms of contact. These data include multiple measures of “objective” threat at the 
metropolitan/county level, whites’ perceptions of the proportion of racial/ethnic minorities in 
their communities, and indicators of whether whites know or know and feel close to a 
member of these minority groups. 


Background 


Prejudice has traditionally been defined as “an antipathy based upon a faulty and inflexible 
generalization.” (Allport [1954] 1979: 9) Blumer (1958) adds that prejudice is historically and 
culturally rooted and consists of feelings that a minority group is inferior, different, alien and 
threatening to one’s own racial/ethnic group. Others argue that “symbolic racism” is a new 
form of anti-black prejudice expressed through values and attitudes, including opposition to 
busing and affirmative action policies (Kinder and Sears 1981). The focus of this paper Is on 


Reconciling Group Threat and Contact Theories + 2181 


prejudice, which departs from symbolic racism’s emphasis on anti-b/ack sentiment and policy 
attitudes. 

By more traditional definitions, prejudice against various minority groups has declined in 
the United States. In 1977, a national survey reported that 27 percent of whites believe the 
cause of black-white inequality is that blacks “have less in-born ability to learn.” Only 10 
percent believed that in 1996 (Schuman et al. 1997). Some suggest that this change merely 
reflects a change in cultural norms — that it is no longer appropriate to express such feelings 
(Sears 1988). 

Although most Americans no longer believe or admit to believing in the biological 
superiority of their racial group, they believe in stereotypes of racial/ethnic minorities as 
possessing inferior characteristics. In 1990 and 1994, 42 percent of Americans judged blacks 
to be “lazy.” (Gilens 1999) In 2000, 35 percent of whites surveyed judged blacks to be lazy; 
23 percent said the same for Hispanics; and 11 percent applied that label to Asians. (See 
Appendix A.) Despite historical fluctuations, blacks have generally faced and continue to face 
the most prejudice, followed by Hispanics and then Asians (Bogardus 1958; Smith and 
Dempsey 1983; Wilson 1996; Yancey 2003). 


Group Threat Theory 


According to group threat theory, a sizable minority population living near the dominant group 
leads to economic and/or political threat and ultimately prejudice among the dominant group 
(Blalock 1967). Supporting this hypothesis is research showing a relatively consistent 
relationship between the size of the black population and anti-black prejudice among whites 
(see, e.g., Fox 2004; Giles and Buckner 1993; Glaser 1994; Quillian 1996; Taylor 1998). 

Yet, several criticisms and questions plague group threat theory. First, its applicability 
appears to be primarily limited to present-day, black-white relations. Groups other than blacks 
— such as Europeans, Chinese and Japanese — threatened (native) whites when they 
immigrated (Bonacich 1972; Lieberson 1980; Olzak 1992). During the post-Reconstruction Era 
in the United States, though, the immigration of groups other than blacks increased prejudice 
and violence against blacks (Olzak 1992). Increasingly, too, groups other than blacks no longer 
threaten whites: Several studies indicate that there is not a positive relationship between the 
presence of a large Hispanic (Dixon and Rosenbaum 2004; Hood and Morris 1997; Fox 2004; 
see Huddy and Sears 1995 for an exception) or Asian population in a given area (Taylor 1998) 
and prejudice toward these minorities. Previous research suggests that only the presence of 
a large black population is associated with anti-black prejudice among whites today. Yet, even 
this research may be limited by its omission of measures of other perceptions and contact. 

But, what threat do racial/ethnic minorities pose? Blalock (1967) and others concentrated 
on “objective” economic and political threat. From this perspective, the presence of a 
racial/ethnic group is only the first step in the causal chain to prejudice. Subsequent steps 
include “real” interracial and interethnic conflict over economic and political resources (Bobo 
1983). With few exceptions (Oliver and Mendelberg 2000; Quillian 1996), contemporary 
research has not measured these other potential sources of threat. 

Moreover, exactly where is the threat located? Previous research has measured group threat 
as the racial/ethnic composition of the population in the country (McLaren 2003), census region 
(Quillian 1996), state (Fox 2004), and in metropolitan areas (MAs)/counties (Taylor 1998). However, 
given the large degree of segregation within these areas (Massey and Denton 1993), threat may 
operate differently across these areas than within them. Oliver and Mendelberg (2000) find that 
negative stereotypes of blacks are significantly greater in metropolitan areas — but not zip codes 
within these metropolitan areas — in which there are larger concentrations of blacks. Oliver and 
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Wong (2003) suggest that the explanation for this is because whites are more likely to have 
contact with blacks under ideal conditions at the local/neighborhood level. Jackman and Crane 
(1986: 474) offer some support for this, as they find that “sheer proximity to blacks [in the 
neighborhood] appears to be of little value, unless it is accompanied by personal contact.” 

To what extent is this threat felt? In some contemporary variants of this theory, perceptions 
assume central importance (Bobo and Hutchings 1996). Perceptions are viewed as the causal 
link between realistic threats from the environment and prejudice, or they are seen as having 
no basis in reality (Bobo 1983). Regardless, research in the group threat tradition usually does 
not measure or only indirectly measures perceptions of threat. When research has directly 
measured perceptions of threat, it has found that feelings of threat are significantly related 
to anti-black prejudice (Taylor 1998). Still, other perceptions may be important. 

Another criticism of research in the group threat tradition is that it either does not measure 
or it indirectly measures perceptions of the size of the minority populations (Glaser 2003). 
Research finds whites’ estimates of the black population, for example, are inaccurate: More 
than half of whites believe the nationwide black population is 30 percent or more, when it is 
12 to 13 percent (Nadeau, Niemi and Levine 1993; Sigelman and Niemi 2001). Gallagher 
(2003) argues that these perceptions are rooted in disproportionately negative media images 
of blacks and blacks’ “hypervisibility.” In fact, research indicates there is a linear relationship 
between hypothetical neighborhood racial/ethnic composition and prejudice, with some 
studies also suggesting “tipping points” — 20, 33 or 50 percent black — at which whites feel 
uncomfortable moving into or staying in a neighborhood (Bobo and Zubrinsky 1996; Farley 
et al. 1994). Given that this relationship does not tend to exist for Hispanics and Asians 
(Emerson, Yancey and Chai 2001), this suggests that whites’ perceptions of the racial/ethnic 
composition of their communities may fuel prejudice against blacks (Nadeau, Niemi and 
Levine 1993; Sigelman and Niemi 2001). However, this relationship may be conditional on ~ 
contact (Jackman and Crane 1986; Oliver and Mendelberg 2000; Oliver and Wong 2003). 

To address these criticisms and unanswered questions, the analyses below incorporate 
measures of the size of the black, Hispanic and Asian population at the metropolitan/county 
level, measures of racial/ethnic inequality at this level, and perceptions of the size of the black, 
Hispanic and Asian population in whites’ communities. 


Contact Theory 


From contact theory's perspective, a sizable minority population may represent opportunities 
for contact between minority and dominant group members (Blau 1994; Sigelman et al. 1996). 
By facilitating knowledge of minority groups, this contact may help dominant group members 
develop favorable views of minority groups. Yet, Allport ([1954] 1979) argues that the effects of 
contact on prejudice depend on the quality of contact, whether contact is voluntary or not, the 
extent to which contact is between dominant and minority members of “equal status,” and 
whether contact occurs in a competitive or collaborative environment, among other conditions 
(Dovidio, Gaertner and Kawakami 2008; Forbes 1997; Pettigrew 1998). 

Empirical support for contact theory is qualified. Friendships and some acquaintanceships 
among various racial/ethnic groups are related to a reduction of prejudice (Jackman and Crane 
1986; McLaren 2003; Powers and Ellison 1995; Sigelman and Welch 1993). Personal and 
impersonal contact in some interactive settings is associated with reduced prejudice against 
blacks (Yancey 1999) and Hispanics (Dixon and Rosenbaum 2004). Equal (socioeconomic) 
status contact is also associated with reduced prejudice against blacks (Jackman and Crane 
1986). Environment is important, too, as collaborative contact in the classroom is associated 
with reduced prejudice towards a variety of groups (Pettigrew 1998). Yet, contact is not 
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always related to a reduction of prejudice. To explain this, researchers often speculate on the 
less-than-optimal conditions under which contact occurs. 

Pettigrew (1998) contends that contact theory has been “overburdened” by these 
facilitating conditions. In fact, contact theory's numerous qualifiers and conditions appear to 
be primarily applicable to whites’ contact with blacks — the racial interaction that has been 
studied the most. While conditions such as those above are likely important for sustained, 
long-term behavioral change against blacks, at least a short-term reduction of prejudice 
against all groups may occur regardless of these conditions. This can be accomplished if 
contact facilitates knowledge of minority groups, helps dominant group members reassess 
their own group, forms affective ties, and changes behavior, according to Pettigrew (1998). 

The quality of contact encompasses all of the processes immediately above. At a single 
point in time, stronger ties such as friendships should be most associated with a reduction of 
prejudice precisely because they help dominant group members learn about other groups, 
reassess their own group, form affective ties, and ultimately change behavior. Weaker ties, 
such as acquaintanceships, should be somewhat less associated with a reduction of 
prejudice because they do not give one the opportunity to fully do this. The two qualitatively 
different measures of contact — one that parallels interracial/interethnic friendship and another 
that parallels acquaintanceships — that are used in the analyses below build upon these ideas, 
but they represent an improvement over research that does not measure contact directly or 
only uses measures of friendships. 

Nevertheless, contact theory and its empirical indicators are limited by their potential for 
self-selection bias and reverse causality. Contact may reduce prejudice, but prejudiced people 
may avoid contact. In general, though, the effects of contact are greater than self-selection 
bias (Pettigrew 1998; Powers and Ellison 1995). Survey researchers have dealt with the 
possibility of self-selection bias by using measures of contact that are not as intimate or 
voluntary as friendships but have the potential to build friendships (Jackman and Crane 1986; 
Sigelman et al. 1996). Moreover, some survey researchers have also shown that the effects of 
contact are greater than alternate bases of prejudice that may come casually prior to contact. 
For example, McLaren (2003) finds that interracial/interethnic friendships are negatively 
related to prejudice among Western Europeans, even after controlling for some “objective” 
and “subjective” threats that group threat theory specifies. 

The analyses below represent an attempt to parcel out potential effects of self-selection 
bias as they include “objective” measures of group threat, perceptions of the proportion of 
minorities in whites’ communities, and qualitatively different forms of contact. 


Toward Reconciling Group Threat and Contact Theories of Prejudice 


Some theoretical attempts to reconcile the effects of threat and contact on prejudice focus on 
collective, historical and cultural processes. Upon initial contact in history, according to 
Blumer (1958), dominant groups — those with power, prestige and skills — define minority 
groups’ positions in society, often creating negative stereotypes of them. In the United States, 
stereotypes of blacks as stupid, idle and morally inept date back to slavery (Drake 1987). 
Beliefs that Mexicans are lazy and have too many children became popular in the mid-19" 
century (Martinez 1994; Omi and Winant 1994). Notions of Asian inferiority coincided with 19" 
and 20" century anti-immigration movements and World War Il (Blauner 1972). While specific 
stereotypes change over time, a racial/ethnic hierarchy remains rooted in culture (Blumer 
1958). Blumer sees contemporary prejudice as a dominant group's reaction to threats that 
may rearrange groups’ positions in this hierarchy. Yet, Blumer (1958:6) believes this 
racial/ethnic hierarchy is so firmly rooted in history and culture that the dominant group 
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interprets it “not by generalizing from experiences gained in close, first-hand contacts, but 
through the transcending characterizations that are made of the group as an entity,” 
suggesting that contact will do little to reduce prejudice. 

Others disagree and see minority group cultural assimilation as an intermediate theoretical 
process between threat and contact. Park (1950) held that competition follows initial 
racial/ethnic contact, accommodation follows competition, and assimilation follows 
accommodation. As this cycle repeats, prejudice decreases. Gordon (1964) added that racial 
and ethnic minorities who have contact with dominant group members also have cultural 
attitudes similar to the dominant group. Yet, Moynihan and Glazer (1963) argued that it might 
not be possible for groups, such as blacks and Puerto Ricans, to culturally assimilate because 
of discrimination they face and their — especially blacks’ — hypervisibility. 

Synthesizing these insights suggests the existence of a historically and culturally rooted 
racial/ethnic hierarchy in which some minority groups may shift positions depending on the 
dominant group's perceptions of their cultural assimilation. Although the authors above would 
likely place whites at the top of this hierarchy, they do not fully recognize the fluidity of race 
across time and its bearing on groups’ positions in this hierarchy. 

“Whiteness"or an identity as well as an ideology through which “whites” judge 
subordinate groups’ level of cultural assimilation, has been conferred upon and actively 
pursued by groups that were once considered “other” races and once threatened whites. For 
the Irish, labor unions and the Democratic Party helped them assimilate and become “white” 
themselves around the turn of the 20" century (Ignatiev 1995). Italians became “white” partly 
because of their 20th-century entry into the middle class (Gans 1999). Educational and 
economic opportunities opened up by the G.I. Bill and the Federal Housing Authority helped 
Jews become “white.” (Brodkin 2000) 

Shades of whiteness have also been conferred upon and actively pursued by Hispanics 
and Asians, as some of these groups have been dubbed “honorary whites.” (Bonilla-Silva 
2004) Some members of the largest Hispanic group — Mexicans — were labeled “white” after 
the Mexican War of 1846 (Omi and Winant 1986). Hispanics are considered an ethnic group 
today and were never considered a “racial” group as a whole, albeit Mexicans were 
considered a different race on one census (Farley 2002). Facilitating the “whiteness” process 
for Asians is the perceived similarity between Asian and American values toward work and 
Asians’ postWorld War Il status as the “model minority.” (Steinberg 1989) As Gans (1999: 372) 
argues, “upward socioeconomic mobility and increasing intermarriage with whites may even 
end up in eliminating the boundary that now constructs them [Asians] as a separate race." 
Further, Yancey (2003) presents contemporary evidence indicating that Hispanics and Asians 
not only identify as whites,' but also identify with whites as judged by their sociopolitical 
attitudes (Bonilla-Silva 2004), 

By contrast, blacks have long been perceived as so physically and culturally different from 
whites to warrant a separate “racial” category both in the public mind and the legal sphere. 
Even beyond their enslavement, blacks’ historical experience has been qualitatively different: 
only blacks were consistently subjected to the infamous “one-drop rule” (Yancey 2003). Today, 
blacks’ attitudes — particularly their racial attitudes — differ from whites’: Because of 
discrimination, blacks do not believe in the “American dream” to the extent that whites do 
(Hochschild 1995); blacks are far more supportive of corrective racial policies than whites and, 
for that matter, Hispanics and Asians (see Krysan 2000; Yancey 2003). For their part, whites 
have historically expressed greater social distance toward blacks than toward most Hispanic 
or Asian groups, especially when forced to choose among the three (Smith and Dempsey 
1983). Today, whites feel less comfortable living around or marrying blacks as opposed to 
Hispanics or Asians (Bonilla-Silva 2004; Wilson 1996). Whites’ discomfort of living in close 
proximity to blacks is also reflected in the historically higher levels of black-white segregation 
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(Massey and Denton 1993). Consequently, some scholars argue that a “black/non-black” 
divide lies ahead, with blacks again at the bottom of the hierarchy (Gans 1999; Yancey 2003). 

Shades of whiteness bring privileges to a minority group, including limited ascendancy in 
the racial/ethnic hierarchy. With this ascendancy comes less threat from the dominant group. 
This is why the available contemporary evidence indicates that the presence of blacks — not 
Hispanics or Asians — threatens whites (Taylor 1998). While part of this threat may be 
economic or political, a larger part is likely cultural and social. The real or imagined presence 
of blacks evokes all kinds of negative cultural images, including the black welfare queen 
(Gilens 1999) and black crime (Emerson, Yancey and Chai 2001). These cultural images 
reinforce whites’ views that blacks are “beneath” and “not one of us.” As such, | hypothesize 
that the real and perceived presence of blacks is related to anti-black prejudice, but the real 
and perceived presence of Hispanics and Asians does not have the same effect on whites’ 
prejudice toward these groups. 

This racial/ethnic hierarchy and its cultural images of minorities influence contact and Its 
effects on prejudice. The framework and evidence above acknowledge self-selection bias, but 
suggest that the relationship between contact and prejudice is not entirely due to it. With their 
ascendancy in the racial/ethnic hierarchy, Hispanics and Asians have more comfortable . 
interactions with whites. | thus expect that merely knowing a Hispanic or Asian person is 
enough for whites to experience reduced prejudice, but knowing a black person is not. 
Moreover, | expect that whites who know and fee/ close to a member of all of these minority 
groups will be less prejudiced (Pettigrew 1998). 


Data 


To test these hypotheses, | use 2000 General Social Survey (GSS) data linked to Census 2000 
data. The GSS is one of the most widely used and well-respected social science data sources. 
The GSS is a multistage, stratified sample of the non-institutionalized, English-speaking adult 
population living in the United States (Davis, Smith and Marsden 2001). | limit the sample to 
non-Hispanic and non-Asian whites. After deleting missing values, the sample sizes are 861, 
801 and 771 for analyses of anti-black, anti-Hispanic and anti-Asian prejudice, respectively.’ 

With permission from the National Opinion Research Council (NORC), | am able to link 
respondents to the 100 metropolitan areas (MAs) and non-metropolitan counties from which 
they were sampled. (See Davis, Smith and Marsden 2001 for NORC's sampling strategy.) 
MA/county racial/ethnic concentration and economic inequality measures are based on 
Census 2000 data and compiled by the Lewis Mumford Center (accessed at: 
http://mumford1 .dyndns.org/ cen2000/ data.html), the Racial Residential Project (accessed at: 
http://enceladus.icpsr.umich.edu/ race/ racestart.asp), and American Fact Finder.‘ Individuals 
constitute one level of analysis; MAs/counties in which individuals live constitute the other. 
Variables measured at the MA/county are level-2 variables; individuals’ characteristics are 
level-1 variables. 


Dependent Variables 


Consistent with Allport's ([1954] 1979) description of prejudice as irrational generalizations 
about a group of people, | use the best available questions to measure prejudice, which is the 
object of concern for both group threat and contact theory. These questions have been used 
as measures of stereotypes or prejudice (e.g., Bobo and Zubrinsky 1996; Dixon and 
Rosenbaum 2004: Gilens 1999; Oliver and Mendelberg 2000; Oliver and Wong 2003). They 
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ask whites to rate, on a scale from 1 to 7, the degree to which blacks, Hispanics, Asians and 
whites are “unintelligent,” “lazy” and “lacking commitment to strong families.” Other 
questions, such as if racial/ethnic groups are unpatriotic or not, may better tap prejudice 
towards Hispanics or Asians (Wilson 1996), but were not asked in the 2000 GSS.* 

Because whites judge other groups in reference to themselves (Blumer 1958), | create a 
prejudice index by subtracting whites’ placement of each minority group from their placement 
of their own group, consistent with previous research (e.g., Taylor 1998; Wilson 1996). | sum 
these difference scores to construct prejudice scales ranging from -6 to 18 for anti-black 
prejudice, -8 to 13 for anti-Hispanic prejudice, and -10 to 16 for anti-Asian prejudice, with 
higher scores indicating greater prejudice and 0 indicating no prejudice.® 


Level 2 Independent Variables 


| follow some prior research by operationalizing group threat theory's central measure of 
threat as the percentage of the population that is black, Hispanic and Asian, respectively, at 
the MA/county level (Taylor 1998).’ | also include a measure of the ratio of minoritywhite per 
capita income in the MA/county as a proxy for economic threat.® | include logged population 
as a control. 


Level 1 Independent Variables 


To address criticisms that group threat research focuses on one level of analysis and does not 
account for perceptions, | include variables capturing respondents’ estimates of the percentage 
of their (self-defined) community population that is black, Hispanic and Asian, respectively. 

They are based on the following questions: ‘Just your best guess — what percentage of 
the people who live in your local community is ... (black, Hispanic, Asian)?” These variables 
are perceptions of group threat theory's central measure of threat, racial/ethnic composition. 
They are also potentially better controls for self-selection bias than whites’ neighborhood 
racial preferences because the questions ask about where whites live now rather than where 
they may want to live. By themselves, they are not measures of threat. More ideal measures 
of perceived threat or hypervisibility would combine these variables with data on actual 
racial/ethnic composition of whites’ communities. Yet, it is impossible to know how whites 
are defining their communities; additionally, GSS confidentiality requirements preclude further 
analyses. 

In line with Pettigrew’s argument (1998) that the quality of contact is important for reduction 
of prejudice, | use multiple and qualitatively different measures of contact. | use a dummy 
variable series to measure contact derived from two sets of questions in the GSS. The first set 
of questions asks whether respondents personally know any blacks, Hispanics and Asians. The 
second set of questions asks respondents who answered affirmatively to the first whether 
they feel close to any of the blacks, Hispanics and Asians they know. The reference group in 
each model consists of whites who do not personally know any members of the minority 
group under examination; another group in each model consists of whites who know but do 
not indicate feeling close to any members of the minority group under examination: the final 
group consists of whites who know and feel close to a member of the minority group under 
examination. While these measures may not be as valid as questions that ask about whites’ 
friendship circles and then ask whites to indicate whether or not their friends are black 
(Jackman and Crane 1986; Smith 1999), they improve on prior research. For example, prior 
reseach has employed proxies for contact or one form of contact such as friendship, both of 
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which may provide unreliable estimates of racial/ethnic contact. Moreover, the measures | use 
expand the coverage of interracial/interethnic friendships to Hispanics and Asians. 

Other level-1 variables include the usual controls in analyses of racial attitudes (Krysan 
2000): family income (logged using midpoints of categories),’° age (in years), education (in 
years), and sex (male = 1). Based on previous research, poorer, older and less educated white 
males should express the greatest amount of prejudice, at least toward blacks (Firebaugh and 
Davis 1988; Hughes and Tuch 2003; Quillian 1996). 


Analytic Strategy, Methods & Models 


Because some of the measures used in this research have been criticized or are new, their 
validity is assessed with descriptive analyses, followed by descriptive, correlative and OLS 
regression analyses on individual-level variables to preliminarily gauge the extent of self- 
selection bias. Finally, multilevel modeling using HLM shows if and how threat and contact 
are related to prejudice after all other variables are controlled. HLM is used in the latter 
analysis because it accounts for the dependence of individual observations and provides 
unbiased standard errors of level-2 variables (Raudenbush and Bryk 2002)."' 

Model equations are in the notes.” All of the coefficients except for the intercept are 
fixed."? Given that this is largely a fixed model, the interpretation of coefficients is relatively 
straightforward: Positive coefficients indicate greater prejudice in unit ;; negative coefficients 
represent less prejudice in unit ji bearing in mind that grand mean centering is used for 
continuous variables and binary variables are uncentered. 


Results 


Validity of Measures 


Table 1 displays descriptive statistics for selected variables of theoretical interest. First, we 
consider the measures of prejudice to assess their validity. If these measures are merely 
products of social desirability as some suggest, we would expect them to vary little. (See also 
Taylor 1998.) As Table 1 reveals, however, all of the prejudice scales vary a good deal, each 
with a standard deviation of more than 3. Moreover, the individual items included in each 
prejudice scale vary quite a bit. As Appendix A implies, the minimum variance for any one of 
these measures is 1.08 and the maximum is 1.80. 

The theoretical framework above assumes that whites should express the most prejudice 
towards blacks. Looking now at the mean prejudice scores in Table 1, we see that whites 
express greater prejudice towards blacks than toward Hispanics and Asians (2.24, 1.04 and - 
55, respectively). Consistent with other research (Wilson 1996; Yancey 2003), these 
differences are significant (p < .001). 

Finally, we turn to the measures of contact. If these measures merely reflect social 
desirability, we would expect overwhelming numbers of whites to say they know and feel 
close to a black person, for example, and for such estimates to be out of line with previous 
research. Looking at the descriptive statistics for these variables in Table 1, we see that 46 
percent of whites say they know and feel close to a black person. Previous surveys asking 
about blackwhite friendships, which are comparable to this measure, have found that 
anywhere from 31.5 to 71 percent of whites say they have black friends (Smith 1999). 
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Self-Selection Bias 


We now turn to the problem of self-selection bias. Within the framework presented above, if 
self-selection was the driving force shaping contact, we might expect whites to have the least 
contact with blacks. Yet, Table 1 reveals that with one exception, whites have the most 
contact with blacks, followed by Hispanics and then Asians. Aside from the similar 
percentage of whites who know blacks and Hispanics (43 and 44 percent, respectively), all 
other differences are statistically significant (p < .01). 

If self-selection was the driving force shaping contact, we might also expect a weaker 
relationship between contact and racial/ethnic composition than between contact and prejudice. 
Table 2 presents correlations that show that this is not necessarily so. Although Table 2 must 
be interpreted with some caution given the small number of individuals in some MSAs/counties, 
it shows that contact — as measured by the MA/county-level proportion of whites who know 
(including those who know and feel close to) a member of a targeted minority — is significantly 
structured by the racial/ethnic composition of the MA/county. (See also Blau 1994; Sigelman et 
al. 1996.) Table 2 also indicates that there is generally a stronger relationship between contact 
and racial/ethnic composition than between contact and prejudice. The correlations in the Asian 
model (r = .39 and -.52, respectively), however, do not fit this pattern likely because Asians are 
the group toward which the least prejudice has been and continues to be directed. 


Table 2: Pairwise Correlations Among MA/County-Level Measures of Contact,* Racial/Ethnic 
Composition and Prejudice, by Targeted Minority 


Proportion of Contact with Targeted Minority (in MA/County) 





Blacks Hispanics Asians 
Percent Targeted Minority 
(in MA/county) ee 43* a 
Anti-Minority Prejudice 
average of MA/count -.020 -.24* "oe 


Source: General Social Survey (2000) and Census (2000) 

Notes: **p<.01 *p <.05 (two-tailed tests). 

*The measure of contact used here is the MA/county-level proportion of 
whites who know — including those who say they feel close to — a black, 
Hispanic or Asian person, respectively. 


These findings argue against extreme self-selection bias. However, some self-selection 
bias may still be operating, particularly through age and other individual-level variables that 
might suppress the relationship between intimate contact and prejudice (Pettigrew 1998; 
Sigelman et al. 1996).'* If the framework and research above are correct, we would expect 
two things. First, whites who know and feel close to a member of a targeted minority should 
still be less prejudiced after accounting for self-selection bias. Second, the individual-level 
processes through which self-selection may operate should partly reduce the effects of 
contact. To explore this, | run OLS regressions and show how the coefficients for the contact 
measures change with the introduction of each individual-level variable. Table 3 presents 
results from these models. 

In Table 3, each row represents the coefficients for a particular form of contact with a 
targeted minority group on prejudice, bearing in mind that variables for both contact 
measures are in the same model. The first column is a baseline model with only contact 
measures. Each other column represents a model that includes an additional variable from 
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the previous one, such that the final column represents the full level-one model. Table 3 
indicates that the coefficients of both contact measures decrease with the addition of most 
variables, particularly with education and age. While the theoretical perspective above would 
have expected that that respondents’ estimates of the percentage of blacks in their 
community to suppress the effects of contact, Table 3 reveals that they do not. 

In short, the above analyses suggest some self-selection bias is at work, but not enough 
to make the effects of contact substantively unimportant. 


Multi-level Analysis Results with Level 2 and Level 1 Controls 


Now, we turn to the multilevel results. Preliminary analyses show that there is significant and thus 
sufficient variation in prejudice among metropolitan areas/counties to justify the use of multilevel 
modeling.'® Table 4 presents the results from random-intercept hierarchical linear model of anti- 
black, ant-Hispanic and anti-Asian prejudice on group threat, contact and personal characteristics. 

Although group threat theory assumes that threat operates similarly across all racial/ethnic 
groups, results in Table 4 cast doubt on that assumption. Instead, the results are more in line 
with the framework above, which suggests that only blacks threaten whites. Looking at the 
traditional measure of threat used in group threat research — the percentage of the targeted 
minority group in a MA/county — we see that its expected positive relationship only holds for 
the black population share and anti-black prejudice. Anti-black prejudice in a MA/county with 
an average percentage of blacks would increase by .03 to .04 points on a 25-point scale in 
models 1 and 2 respectively if the percentage of blacks increased by 1 percentage point, all 
else constant."® The relationship between anti-Asian prejudice and the percentage of the 
population that is Asian is opposite of what group threat theory would predict. Anti-Asian 
prejudice decreases as the percentage of Asians in a MA/county increases, all else constant. 
Supplementary analyses reveal that part of this effect is due to extremely unprejudiced whites 
living in areas with high concentrations of Asians, such as in San Francisco, and outlying 
cases in Modesto, California and York, Pennsylvania. Exclusion of these areas reduces this 
effect to near insignificance (p = .046), which is consistent with the nearly significant negative 
relationship Taylor (1998) finds in her analysis of anti-Asian prejudice with 1990 data. 

Table 4 reveals that there is no relationship between targeted minoritywhite per capita income 
and prejudice."’ Furthermore, there is no relationship between respondents’ estimates of the 
targeted minority population in their communities and prejudice." The first finding is perhaps not 
surprising because the threat is likely cultural — and to a lesser extent, political — rather than 
economic. While respondents’ estimates of the percentage of blacks in their communities were 
expected to be directly related to prejudice, this is not the case as is discussed further below." 

Whereas contact theory assumes that contact operates similarly across racial and ethnic 
groups, the framework presented here does not. Table 4 reveals that after controlling for 
group threat and personal characteristics, superficial contact — such as knowing a member of 
a targeted minority — is associated with reduced prejudice against Asians and Hispanics, 
albeit the effect for the latter is small. Looking at the coefficient for knowing an Asian person 
in the final column of Table 4 reveals that a woman who is average on all characteristics, lives 
in an MA/county with an average percentage of Asians, and knows an Asian person would be 
expected to have a prejudice score 1.21 points lower on a 27-point scale than her counterpart 
who neither knows nor feels close to an Asian person, all else constant. Yet as the framework 
above predicts, there is no relationship between this impersonal contact and prejudice when 
the targeted minority is blacks. 

As contact theory and the theoretical framework above expect, the most intimate form of 
contact — knowing and feeling close to a member of a targeted minority — is most strongly related 


Reconciling Group Threat and Contact Theories + 2195 


to the reduction of prejudice, even after extensive controls at both levels of analysis. Looking at 
the coefficient for this variable in the anti-black prejudice model 2, for example, reveals that a 
woman who is average on all characteristics, lives in an MA/county with an average percentage 
of blacks, and knows and feels close to a black person would be expected to have a prejudice 
score 1.74 points lower on a 25-point scale than her counterpart who neither knows nor feels 
close to a black person. While some of this effect may be due to unmeasured self-selection bias, 
these results combined with the analyses above suggest this effect is substantively meaningful. 

Finally, we turn back to the finding that respondents’ estimates of the percentage of blacks 
in their community are not related to prejudice. Some previous research suggests that percent 
black does not affect prejudice at the neighborhood/community level because whites are more 
likely to have contact with blacks here under ideal conditions (Oliver and Mendelberg 2000; 
Oliver and Wong 2003). By having more meaningful contact with blacks, whites are not as 
threatened by large numbers of blacks at this more local level (Jackman and Crane 1986). Yet, 
itis only possible to test the interactive effects of contact and whites’ estimates of percent black 
in their communities. | re-estimate model 2 with the variable for knowing and feeling close to a 
black person as the only form of contact and an interaction term between this variable and 
perceived percent black. This effect is significant in the expected direction, given theoretical 
expectations (p = .0295, one-tailed test). Figure 1 shows that among whites who know and feel 
close to a black person, prejudice decreases as estimates of the percentage of blacks in their 
communities increase. For other whites, the opposite relationship exists. 


Figure 1. Interaction Between Perceived Percent Black and Contact 
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Conclusion 


Since Park (1950) developed his race relations cycle, scholars have attempted to address 
whether propinquity breeds hostility or harmony. Whereas group threat theory answers 


2196 © Social Forces Volume 84, Number 4 * June 2006 


hostility, contact theory generally answers harmony. Despite the changing racial and ethnic 
landscape of the United States, these theories remain focused on black-white relations, 
assuming that threat and contact operate similarly across different racial/ethnic groups. This 
paper attempted to address criticisms of group threat and contact theories and reconcile 
them. The framework presented here posits that a historically and culturally rooted 
racial/ethnic hierarchy differentially shapes whites’ threat of, contact with, and ultimately 
prejudice toward minorities. 

Through analyses of 2000 GSS data linked to census data, | find that group threat and 
contact theories’ implications are both right: propinquity breeds hostility and harmony. Yet, 
in line with the framework presented here, the effects of propinquity depend on the minority 
group in question and the actual form of contact. In whites’ broader, “objective” racial/ethnic 
environment, large numbers of blacks may arouse threat and give rise to prejudice, but large 
numbers of Hispanics and Asians do not (e.g., Taylor 1998). When whites have relatively 
superficial contact with members of these minorities, they are not less prejudiced against 
blacks but are less prejudiced against Hispanics and Asians. When whites know and feel 
close to members of these minorities, they are less prejudiced toward all groups. These latter 
findings are consistent with the strong effects of friendships on prejudice (Pettigrew 1998). 

Although Blalock (1967) and others have argued that this threat, particularly against blacks, 
has an economic basis, the results here do not support such a conclusion. While the basis 
of this prejudice is more likely cultural threat, it is possible that it is also political in nature. 
Though this paper attempted to measure potential economic threat, one challenge that it 
leaves to future research is to empirically disentangle alternate sources of threat. 

These results also suggest that threat operates somewhat differently at different levels of 
analysis, lending credence to some previous research (Oliver and Mendelberg 2000). This is 
most likely because a large presence of blacks in white’s broader, racial/ethnic environment 
evokes negative, culturally and historically rooted stereotypes of blacks. Whites feel that 
blacks are not like them and are threatened by their way of life. When whites begin to know 
and feel close to a black person, however, these stereotypes are disconfirmed; whites feel 
less threatened; and, they express less prejudice. Although this is a more optimistic picture 
of future racial and ethnic relations, it still suggests that prejudice will change slowly — one 
person at a time. 

What is less clear from the results is the role that whites’ perceptions play in prejudice. 
Whites’ perceptions of the racial/ethnic composition of their self-defined communities do not 
directly shape anti-black prejudice as expected, but instead partly depend on contact 
(Jackman and Crane 1986). Threat may simply be less pronounced in whites’ communities, 
and as Oliver and Wong (2003) suggest, contact may better militate against prejudice there. 
Yet, this finding may be due to the fact that perceptions of racial/ethnic context directly shape 
perceived threat, but not prejudice. One study indicates that Germans’ perceptions of 
racial/ethnic composition are directly related to feelings of threat, but not prejudice 
(Semyonov et al. 2004). Despite this paper's incorporation of measures of perceptions of 
racial/ethnic composition, future research should build measures of perceived threat into its 
models. Given that some definitions of prejudice incorporate threat and others do not (Allport 
[1954] 1979; Blumer 1958), this represents a large conceptual and empirical challenge. 

The racial/ethnic hierarchy that shapes whites’ threat of, contact with, and prejudice 
towards minorities may be even more complex than existing data allow researchers to 
ascertain. Hence, one challenge is for survey researchers to begin to collect fine-grained data 
on Hispanics, Asians, Native Americans, Arabs and multiracial individuals as well as different 
measures of prejudice towards them. For example, some might suggest that the reason that 
whites who know a Hispanic person are only slightly less prejudiced is because of the 
tremendous amount of variation among different Hispanic groups in becoming white and 
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different sources of prejudice toward them (Bonilla-Silva 2004). This suggests that my 
hypothesis needs qualification. Furthermore, Native Americans’ contemporary position and 
Arabs’ post-9/11 position in this hierarchy may be comparable to or even worse than that of 
blacks. Finally, if the future hierarchy consists of blacks and non-blacks (Gans 1999; Yancey 
2003), then it is also important to understand how whites view multiracial individuals, 
particularly those who are considered to be partly black. 

Despite the challenges this paper leaves to future research, it is important to remember its 
primary contribution: to reconcile group threat and contact theories by proposing that a 
historically and culturally rooted racial/ethnic hierarchy differentially shapes whites’ present- 
day threat of, contact with, and ultimately, prejudice towards blacks, Hispanics and Asians. 


Notes 


1. For example, Yancey (2003:127) reports that 50 percent of the Hispanic population self- 
identified as white in 1990, while 95 percent did in 1992. 


2. The majority of missing cases are missing due to non-response on the dependent 
variables, perceptions of the minority population, income or some combination thereof. 
Supplementary analyses indicate that the substantive results remain the same when 
cases missing information on independent variables are included. 


3. Due to missing data, the respective level-2 sample sizes are 99, 97 and 98. The number of 
individuals within each MA/county ranges from one to 35. Because estimates may be 
biased by these small sample sizes, | ran models excluding MA/counties with fewer than 
three individuals and obtained the same substantive results. 


4. While it may be preferable to measure racial/ethnic composition at the census tract or 
block level, it is not possible due to GSS confidentiality concerns. Yet, | include measures 
of respondents’ perceptions of minority concentration in their own (self-defined) 
communities. 


5. Other questions, asking the degree to which groups are violence-prone and poor, loaded 
on a separate factor in principal components factor analyses and/or reduced scale alphas 
when included in lieu of the questions used here. To the best of my knowledge, no other 
GSS 2000 questions measuring prejudice are exactly comparable across these targeted 
racial/ethnic groups. Policy attitudes are no exception. 


6. Chronbach’s alphas for anti-black, anti-Hispanic and anti-Asian prejudice are .71, .60 and 
.68, respectively. 


7. In other analyses, | also included dissimilarity indices, but they did not change the results. 


8. Complete political data at the MA/county level is unavailable. Because racial/ethnic 
composition is probably correlated with political measures, it may be capturing effects of 
political threat. 


9. Supplementary analyses with mean replacement produced the same substantive results. 


10. Mean replacement is used for missing cases, but the results are the same without it. 
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ital 


| use full maximum likelihood estimation, but the results are the same with restricted 
maximum likelihood estimation. As a further diagnostic check (Raudenbush and Bryk 
2002), | compared the model-based standard errors with robust or Huber-corrected 
standard errors of level-2 variables, which are sometimes sensitive to HLM assumptions 
depending on level-2 sample size. Doing so produced the same substantive results. | use 
the former here because robust standard errors may overcorrect lower-level variances 
(Maas and Hox 2004). 


12. The level 1 and 2 models are, respectively: 


ce} Baj=voot Yor (MEAN PERCENT MINORITY); +7099 (MEAN INCOME INEQUALITY 


(1) PREJUDICE; = B oj + B 1) (ESTIMATE PERCENTAGE MINORITY; — ESTIMATE 
PERCENTAGE MINORITY) + B 9; (JUST KNOWS MINORITY) + B 3; (KNOWS AND FEELS 
CLOSE TO MINORITY) + Bay FAMILY INCOME; — FAMILY IN Home) + Bg (AGE; = 
AGE) + B 6; | (EDUCATION; EDUCATION) + B7j (MALE) + ‘ij 


y+ 
J 
¥03 (MEAN LOGGED POPULATION) + u0j 


The term, r;, represents the level-1 or individual random error; Ug; represents the level-2 
or MA/county random effect. Substituting equation 2 into equation 1 yields the full 
equation. 


. The slopes for community-level estimates of racial/ethnic composition and contact 


measures do not vary randomly among MA/counties or improve model fit when allowed 
to vary randomly. 


14. Supplementary logistic regression analyses revealed that age and education were most 


TS: 


16. 


consistently related to whites’ contact with all targeted minority groups. 


The variance in anti-black, anti-Hispanic and anti-Asian prejudice among MAs/counties is 
5, 4 and 8.5 percent, respectively. 


The effects of racial/ethnic composition may be founded with region (Lieberson 1980). As 
such, | ran models including binary variables for regions in which these racial/ethnic 
minorities are concentrated. (See Appendix B.) 


. In lieu of this measure of re/ative inequality, | also used abso/ute economic indicators such 


as unemployment, but they were insignificant. 


. Multilevel models alternately including perceived and actual racial/ethnic composition 


produced the same results. 


. Neither the effects of proportion minority nor estimates of proportion minority are 


curvilinear (see Taylor 1998), except for a small curvilinear effect for perceived percent 
Hispanic. When perceived percent black is included as binary variables representing 20, 
33, and 50 percent black, it is also insignificant. 
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Appendix A: Question Wording and Descriptive Statistics for Stereotype Items 


INTERVIEWER: “Now | have some questions about different groups in our society. I'm going 
to show you a seven-point scale on which the characteristics of people in a group can be 
rated. In the first statement a score of 1 means that you think almost all of the people in that 
group are [i.e. hard-working]. A score of 7 means that you think almost everyone in the group 
are [i.e. lazy]. A score of 4 means you think that the group is not towards one end or another, 
and of course you may choose any number in between that comes closest to where you think 
people in the group stand.” 


Table A1: Descriptive Statistics for “Intelligent-Unintelligent” Stereotype in Percentages 


Intelligent Unintelligent 
1 2 3 4 5 6 7 
Blacks? 3 6 15 50 18 6 1 
Hispanics? 3 5 13 54 18 6 1 
Asians° 8 18 23 43 6 3 0 


* Mean = 3.96; standard deviation = 1.08. ° Mean = 4.01; standard deviation = 1.04. 


“Mean = 3.31; standard deviation = 1.15. 


Table A2: Descriptive Statistics for “Hardworking-Lazy” Stereotype in Percentages 





Hardworking Lazy 
| 2 3 4 5 6 7 
Blacks? 2 4 12 46 22 14 2 
Hispanics? 5 10 19 43 16 6 1 
Asians° 10 19 27 34 8 2 | 


4 Mean = 4.24; standard deviation = 1.15.” Mean = 3.78; standard deviation = 1.22. 
“Mean = 3.19; standard deviation = 1.21. 


Table A3: Descriptive Statistics for “Commitment-Lack of Commitment” Stereotype in 
Percentages 


Commitment to Lacks Commitment 

Strong Families to Strong Families 
1 2 3 4 5 6 7 
Blacks* 4 10 18 34 21 11 2 
Hispanics? 10 22 23 31 10 3 | 
Asians° 14 ZS 24 31 6 1 0 


4 Mean = 3.99; standard deviation = 1.34.” Mean = 3.24; standard deviation = 1.31. 
“Mean = 2.96; standard deviation = 1.24. 
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Appendix B 
Table B1. Random-Intercept Hierarchical Linear Model of Anti-Minority Prejudice on Region, 
Group Threat, Contact, and Personal Characteristics: Non-Hispanic and Non-Asian Whites Only 


Anti- Anti- Anti- 
Prejudice Black Hispanic Asian 
Fixed Effects 
Level-2 Variables 
Region * .30 -.05 -.53* 
(.30) (.31) (.25) 
Percentage of Targeted Minority in MA/county 3" -.01 -.08** 
(.01) (.02) (.03) 
Targeted Minority/White Per Capita Income -.28 .03 -.13 
(.40) (.96) (.34) 
Level-1 Variables 
Estimate of Percentage Targeted Minority in -.01° 01 .02 
Community (.01) (.01) (.01) 
Knows a Member of Targeted Minority -.56 -.67* 3 i 0 phe 
(.37) (.30) (.25) 
Knows and Feels Close to a Member of Targeted = Lian 21 Odden -1.52*** 
Minority? (.38) (.32) (.28) 
Family Income (Logged) -.09 -.05 -.13* 
(.06) (.07) (.06) 
Age .04*** Ss 01 
(.01) (.01) (.01) 
Education 311" -.12* -.13"* 
(.04) (.04) (.04) 
Male .88*** 27 AT 
(.22) (.22) (.21) 
Constant Zot 1:82" .48* 
(.36) (.28) (.22) 
Random Effects 
Variance Components 
Intercept, Ug , 44 47" 09 
Level-1 Effect, 1; 9.69 8.73 7.83 
Fit Statistics 
Level-2 Pseudo R? 16 04 89 
Level-1 Pseudo R? 42 08 07 
Deviance 4408.70 4021.01 3781.09 


Source: General Social Survey (2000) and Census (2000) 
Notes: See Table 4. 
*Region: In anti-black prejudice model, South = 1 (else = 0). In other models, West = 1 (else = 0). 
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Abstract 

Prior research has shown that individuals living in the South express significantly less 
tolerant attitudes than the rest of the nation, while individuals residing in urban areas 
express significantly more tolerant attitudes than their rural peers. We seek to explain these 
generally unspecified Southern and urban effects by identifying demographic contextual 
factors that affect individuals tolerance levels. Using 1976-2000 General Social Survey and 
1990 U.S. Census data, we find that net of individual factors, residing in an area with a 
larger proportion of college graduates significantly increases individual levels of tolerance, 
while residing in an area with a larger proportion of evangelical Protestants significantly 
decreases tolerance. We also find that the Southern and urban effects on tolerance become 
non-significant after contextual-level controls are added. 


Civil liberties are a cornerstone of American ideology, but they have been denied to many 
Americans either through de jure or de facto means for much of the nation’s history. While 
court decisions and legislation in the past 50 years have explicitly guaranteed these rights, 
there nevertheless remains public debate in the United States about such issues as free 
speech, privacy and assembly, especially for controversial groups. The passage of referenda 
that restrict the civil rights of illegal immigrants and the movement to ban same-sex marriages 
are just two recent examples that suggest that not all Americans are comfortable with putting 
the principle of universal rights into practice. 

It is therefore no surprise that for more than 40 years social scientists have been interested 
in Americans’ willingness to extend civil liberties to unpopular groups, which has often been 
defined in the literature as “tolerance.” (Dynes 1967; Stouffer 1963) This prior research 
focuses primarily on the individuallevel determinants of tolerance, but it also shows that 
significant spatial variations in tolerance exist. Specifically, individuals who live in the South 
and in rural areas are, on average, less tolerant than people who live outside of the South and 
in urban areas, even after controlling for individual characteristics (Abrahamson and Carter 
1984; Dynes 1967; Nunn, Crockett and Williams 1978; Wilson 1991, 1994). 

Theories explaining tolerance have emphasized that exposure to diverse values, beliefs 
and lifestyles tend to increase one’s willingness to extend civil liberties (Stouffer 1963; Wirth 
1938). Inherent in these models is an assumption of contextual effects: individuals’ attitudes 
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are determined not only by their personal characteristics (e.g., education, age, income) but 
also by the characteristics of the people and social contexts that surround them. However, the 
numerous studies of tolerance to date have focused almost exclusively on individual-level 
factors and have not sought to explain these spatial variations (Abrahamson and Carter 1984; 
Beatty and Walter 1984; Dynes 1967; Ellison and Musick 1993; Nunn, Crockett and Williams 
1978; Stouffer 1963; Tuch 1987; Weakliem and Biggert 1999; Williams, Nunn and St. Peter 
1976; Wilson 1991, 1994). 

In this paper, we show that the Southern and urban “effects” are actually proxies for more 
specific contextual factors that affect individual levels of tolerance. Using multilevel modeling 
with 1976-2000 General Social Survey (GSS) data, we test a series of contextual hypotheses 
by controlling for individual-level characteristics while adding demographic contextual 
properties such as percent of the population who are college graduates, racial diversity, 
religious composition, percent over the age of 65, and percent new residents, in addition to 
the Southern and urban measures. We find that the proportion of college-educated individuals 
in an area explains the rural/urban gap in tolerance levels, while religious composition explains 
the Southern effect. 


Theoretical and Empirical Evidence 


Theoretical Explanations for Spatial Variation in Tolerance 


Two “environmental” variables that have consistently been associated with individuals’ 
tolerance towards civil liberties are region and urban status. People located outside of the 
South and/or inside urban areas have been found to exhibit significantly higher tolerance 
levels than people residing in the South and/or rural areas (Abrahamson and Carter 1984; 
Dynes 1967; Nunn, Crockett and Williams 1978; Wilson 1991, 1994). 

Two competing, but not mutually exclusive, explanations have been offered for these 
spatial variations in tolerance — compositional and contextual (Figure 1, upper panel). 
According to the compositional explanation, spatial variations in tolerance exist because some 
areas have more people with individual characteristics that are associated with lower 
tolerance (e.g., less education, fundamentalist religious affiliations, older age) than other 
areas. Therefore, the apparent environmental influence on attitudes is simply the 
consequence of individual-level characteristics (which may or may not be measurable) and 
not a contextual effect at all (Hauser 1970). 

In contrast, the contextual explanation asserts that variation in norms and values across 
places exist above and beyond any compositional effects. Advocates of the contextual 
explanation have typically offered a subcultural explanation in which the cities and regions are 
described as having su/ generis cultural dimensions that affect tolerance. (For cities, see Wirth 
1938 and Stouffer 1963; for regions, see Nunn, Crockett and Williams 1978 and Weakliem and 
Biggert 1999.) These cultural differences are reflected in the tolerance of residents, regardless 
of their individual characteristics. Even if two places were to have identical populations, they 
could have different average levels of tolerance as a result of their specific cultural 
characteristics. Subcultural explanations often either imply or directly state that these cultural 
factors are not reducible to specific empirical characteristics of the place. As a result, any spatial 
variation in tolerance not explained by compositional effects has been attributed to general 
socio-historical cultural factors rather than specific place-level demographic measures. 

We expand on the existing compositional and subcultural conceptualizations by introducing 
a demographic explanation as an alternative contextual effect on individual tolerance (Figure 


Accounting for Spatial Variation in Tolerance + 2207 


Figure 1. Conceptualizations of Spatial Variation in Tolerance 


Previous: 


COMPOSITIONAL CONTEXTUAL 


Individual Characteristics Subculture 


Current: 


COMPOSITIONAL CONTEXTUAL 


Demographic Characteristics 


aoe" 


Individual Characteristics Subculture 


1, lower panel). We propose that the demographic characteristics of a place (such as the 
proportion of a population with a college degree) may have contextual effects on individuals’ 
levels of tolerance. While the demographic measures are simply an aggregation of the 
individual-level characteristics that are included in the compositional explanation, we believe 
that the characteristics of the community taken as a whole may have a distinct effect on 
individual levels of tolerance. Just as the compositional and subcultural explanations are not 
mutually exclusive, the demographic explanation is not necessarily exclusive of the other two 
explanations. It is possible that all three factors contribute to spatial variations in tolerance." 


Contextual Effects on Individuals’ Attitudes 


As previously noted, there are no studies to date that have examined specific contextual 
explanations of Americans’ willingness to extend civil liberties. However, recent multilevel 
studies have found evidence in support of contextual explanations when examining spatial 
variations in racial prejudice (Taylor 1998), in attitudes towards gender equality (Moore and 
Vanneman 2008) and in support for the death penalty (Baumer, Messner and Rosenfeld 2003). 

Taylor (1998) conducted a multilevel analysis linking 1990 General Social Survey and U.S. 
Census information and found a significant contextual effect of percentage of black residents 
in an area on whites’ racial attitudes even after controlling for compositional effects. Taylor 
also found percentage black to be a stronger contextual indicator of whites’ racial attitudes 
than region (measured as South/non-South), but warned that racial composition cannot easily 
be disentangled from Southern history and culture. Thus, she concluded that the subcultural 
hypothesis should not be completely disregarded. 

Moore and Vanneman (2003) merged General Social Survey data with U.S. Census and 
National Survey of Religious Identification data to examine the association between the 
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proportion of fundamentalist Protestants in a state and individuals’ gender attitudes. They also 
found evidence of contextual effects. Individuals residing in states with higher proportions of 
fundamentalist Protestants expressed significantly more conservative gender attitudes even 
after controlling for an array of individual-level characteristics. Like Taylor (1998), they found 
that the Southern region effect became non-significant with the addition of other contextual 
indicators. 

In a study of spatial variation of support for the death penalty, Baumer, Messner and 
Rosenfeld (2003) found that contextual homicide rates, racial composition and political 
climate all had significant effects on the likelihood of supporting the death penalty, even after 
controlling for individual characteristics. As in the other studies cited here, regional variations 
that existed after controlling for compositional effects became non-significant with the 
addition of other contextual variables. 


Variables Associated with Decreased Tolerance 


Researchers have argued that the South/non-South regional gap in tolerance is due to social 
and cultural factors not accounted for by demographic variation (Nunn, Crockett and Williams 
1978; Weakliem and Biggert 1999). Reed (1983) claims that Southerners behave similarly to 
ethnic groups, retaining a regional consciousness and traditional value orientation that is 
protectionist in nature. In their replication of Stouffer's (1963) groundbreaking tolerance 
research, Nunn, Crockett and Williams (1978:105) associated the South with “an insularity of 
mind that is slow to change, actively belligerent toward the new, and openly intolerant toward 
a diversity of viewpoints.” Southerners may have stronger resistance to the cosmopolitan 
“marketplace of ideas” that has moved the rest of the nation towards greater tolerance levels 
(Weakliem and Biggert 1999). We test the strength of this subcultural explanation of tolerance 
by incorporating several variables measuring both the compositional and demographic 
characteristics of a place. 

Religious denominations not only generate world views that shape their members’ political 
tolerance attitudes both from the pulpit and through informal interactions among like-minded 
congregationists (Billings and Scott 1994), but they may also have an impact on the larger 
community (i.e., contextual effects). Religious leaders directly and indirectly affect norms, 
political priorities and decisions, and the overall culture of a community. Therefore, we may 
expect that the influence of religious institutions would diffuse beyond the membership of the 
church, synagogue or mosque. Previous research has documented the individual-level effects 
of religious affiliation, attendance and theological conservatism on levels of tolerance (Beatty 
and Walter 1984; Ellison and Musick 1993; Nunn, Crockett and Williams 1978; Stouffer 1963). 
On average, Jews score highest on measures of tolerance, while Protestants — and 
particularly, fundamentalist/evangelical Protestants — score lowest. Religious attendance has 
a negative effect on tolerance and this effect is strongest for those who are members of less 
tolerant denominations, such as conservative and fundamentalist Protestants (Beatty and 
Walter 1984). We include religious affiliation and attendance as compositional controls in our 
model and the distribution of religious affiliations in the area to test for demographic effects. 

Research consistently reveals a negative correlation between age and tolerance (Bobo and 
Licari 1989; McCutcheon 1985; Stouffer 1963), but Wilson (1994) showed that this 
relationship is generally due to differences in tolerance across cohorts, rather than a change 
in attitudes as people age. For most of the 20th century, each new cohort has been born into 
a generally more liberal and tolerant atmosphere and in turn, has expressed greater levels of 
tolerance. We include age at the individual level and percent of the population over age 65 at 
the contextual level in our study. 
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Evidence about the relationship between racial/ethnic group identification and levels of 
tolerance has been somewhat less consistent. Bobo and Licari (1989) found only those blacks 
who already held negative attitudes towards militarists were significantly less tolerant than 
whites. They found no significant differences across race for any other target groups. Ellison 
and Musick (1993) found blacks to be significantly less tolerant of both racists and militarists 
but not significantly different from whites regarding communists, atheists or homosexuals. 
Wilson (1991) found that minorities expressed significantly lower tolerance levels than whites 
for both left- and right-wing groups.” Stouffer (1963) found whites to be more tolerant than 
non-whites but these differences were minimal after taking rural/urban residence and 
education into account. We include a four-category race/ethnicity variable at the individual 
level and a racial diversity index to capture any compositional and demographic contextual 
effects, respectively. 


Variables Associated with Increased Tolerance 


Research has consistently shown that urban areas have higher average tolerance levels than 
rural areas in the United States. Both Wirth (1938) and Stouffer (1963) argued that the densely 
populated heterogeneous environments found in many cities increase opportunities for 
individuals to interact with diverse groups in a variety of social relationships. This in turn leads to 
an increased willingness to tolerate differences and to develop a shared “live and let live” 
mentality. As with the Southern effect, the urban effect is typically considered to be a subcultural 
phenomenon, rather than one that can be explained through compositional and/or demographic 
factors. In this study, we identify other compositional and contextual variables that may have a 
positive effect on tolerance in order to test the subcultural hypothesis regarding cities. 

The positive relationship between an individual's education level and tolerance is one of 
the most consistently documented results in the empirical research (Bobo and Licari 1989; 
Davis 1975; Dynes 1967; Hyman and Wright 1979; McCutcheon 1985; Stouffer 1963; Weil 
1985; Williams, Nunn and St. Peter 1976). Stouffer (1963:127) discussed education's role in 
providing a setting where one is exposed to values and ideas that may differ from those 
learned in the home, thus creating a tolerance-producing situation. As education increases, so 
does one’s knowledge of diverse cultures, openness to new ideas, and willingness to risk 
uncertainty and ambiguity. The resulting cognitive sophistication leads to consistency in 
applying a democratic ideology and general norms of tolerance to disliked groups (Bobo and 
Licari 1989). We include individuals’ years of school completed as a compositional control and 
percent of the population with a college degree as a demographic measure in our analysis. 

Geographic mobility has been cited as a primary contributor to regional and urban diversity 
as well as to increased exposure to diversity at the individual level (Stouffer 1963; Williams, 
Nunn and St. Peter 1976; Wilson 1986, 1991). As individuals emigrate from less tolerant areas 
to more tolerant areas, they become exposed to, and are likely to, adopt the more tolerant 
attitudes in their new location. Additionally, we might expect that immigrants to less tolerant 
communities will contribute to the diffusion of tolerance by exposing the residents of these 
areas to more tolerant attitudes.? Wilson (1991) found that even migrants between 
communities of the same size experienced increases in their tolerance levels, perhaps in 
response to the “culture shock” that Stouffer (1963) claimed could occur when individuals are 
exposed to previously unknown ways Of life. Individuals native to an area are exposed to an 
increased diversity of ideas and attitudes as the immigrant (both international and domestic) 
population increases, which should lead to increased tolerance levels. Accordingly, we 
include in our multilevel analysis an individual-level control for residential mobility as well as 
a contextual measure for the percentage of recent migrants to the area. 
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Data and Methods 


Tolerance can be defined as the willingness to apply universalistic norms of treatment and 
rights towards people with viewpoints different from or even repugnant to one’s own 
attitudes (Stouffer 1963). We apply the most commonly used metric within the tolerance 
literature: the willingness to extend civil liberties to persons with group associations that are 
either on the left or the right of the political spectrum (Abrahamson and Carter 1984; Dynes 
1967; McCutcheon 1985; Stouffer 1963; Williams, Nunn and Peter 1976; Wilson 1991, 1994).* 

In this study, we use individual-level data from the 1976-2000 waves of the NORC General 
Social Surveys (GSS), a nationwide survey of non-institutionalized English-speaking adults in 
the United States. Jo/erance is measured using a 15-item battery that asks if the respondent 
would be willing to allow members of two politically right non-normative groups (racists and 
militarists) and three politically left non-normative groups (communists, atheists and 
homosexuals) to make a public speech, teach in a local college, and/or have an authored book 
in the public library. The total number of affirmative answers to the 15 scenarios is the 
respondent's score on the tolerance scale. The items in the scale are highly correlated, with 
an overall Cronbach's alpha of .92 and bivariate associations (Kendall's tau-b) among the 
fifteen items ranging from .25 to .68, with all correlations significant at p < .001. 

To control for compositional effects, we included a set of individual-level controls that have 
been used in previous studies of tolerance. They are birth year, gender, race/ethnicity, years of 
school completed, family income (logged), religious affiliation,> frequency of religious service 
attendance, and residential mobility (currently living in a different city/town/county than when 
age 16). In addition, the year of survey is included to control for national trends in tolerance 
across survey years. After the deletion of cases with missing data, there are 16,512 
respondents in the data set. Descriptive statistics for the individual-level variables are 
presented in the upper panel of the appendix table. 

The sampling frame of the GSS uses primary sampling units (PSUs), which are either 
metropolitan areas (MAs) or rural counties. (See Davis, Smith and Marsden 2001 for further 
details on the sampling method of the GSS.) In total, 179 distinct PSUs were used over the 
surveys included in this study.® Slightly more than half (58 percent) of the PSUs are 
metropolitan areas and more than one-third of the PSUs (39 percent) are located in the South. 
Forty-three of the PSUs are Southern rural counties (24 percent). 

The PSU data set includes the following variables: region (South/non-South),” a dummy 
variable indicating whether the PSU is a metropolitan area or a county, proportion of the 
population that has completed college, an index of racial diversity®, proportion of the 
population that has moved into the MA or county in the past five years, and proportion of the 
population that is over age 65. All of these variables were calculated from 1990 U.S. Census 
data. Variables measuring the religious composition of the PSU were calculated from the GSS 
data. The descriptive statistics for PSU-level variables may be found in the lower panel of the 
appendix table. 

In the analysis, we used hierarchical linear modeling, which is the most appropriate 
regression method for simultaneously estimating micro-level and contextual-level models 
(Bryk and Raudenbush 2002). This two-level hierarchical model corrects for underestimates in 
the standard errors of PSU-level variables that can occur if individual-level data is simply 
appended with contextual variables. The method also adjusts for the correlated error among 
individuals within the same PSU and uses the appropriate degrees of freedom for the PSU- 
level hypotheses. 

The models were run in a stepwise manner, starting with a model that includes only the 
contextual variables for region and urban/rural area. Individual-level controls were added to the 
second model to determine whether the effects of region and urban status were 
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compositional in nature. The other PSU-level variables were added in the third model, allowing 
for the testing of the demographic explanation. In a supplemental stage, we included 
interactions between the contextual variables and their corresponding individual-level 
variables to see if the effects of context are equal across different types of individuals in a PSU 
(for example, whether the percentage of college graduates in a county or MA has an equal 
effect on tolerance for people with and without college degrees). 


Results 


Figure 2 shows the distribution of the individually weighted means of the tolerance scores 
across the PSUs in the sample. There is a wide range of average tolerance scores across 
PSUs, from a low of 3.14 (Lincoln County, AR) to a high of 13.34 (Austin-San Marcos, TX). The 
mean average tolerance score across the PSUs is 8.74, with a standard deviation of 1.93. 


Figure 2. Distribution of Mean Tolerance Scores Across 179 PSUs 
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An unconditional two-level hierarchical model shows that this variation across the PSUs is 
statistically significant (t99 = 3.083; x? = 2123; p < .001 ). The intraclass correlation 
coefficient also reveals that a substantial amount of the variation in the level of tolerance 
across individuals clusters at the PSU level (Snijders and Bosker 1999). The total variance of 
the random effects at the individual (r;) and PSU (tg9) levels in the unconditional model is 
25.124, which means that 12.3 percent (= 3.083/25.124) of the variation in the observed 
scores can be attributed to the PSU level. In other words, the data show that tolerance is not 
determined exclusively at the individual level. However, multivariate analysis is needed to 
partition out the compositional, contextual demographic and subcultural components of this 
spatial variance. 


2212 ¢ Social Forces Volume 84, Number 4 * June 2006 


Table 1: Compositional and Contextual Effects on Tolerance 


Model 1 Model 2 Model 3 


Individual-level PSU-level 
Intercept Intercept2 ops Paes I ashi Oot 
South -1.547*** -.688*** -.254 
Percentage Metropolitan area ; debe .160*** 118 
College Graduates 053% 
Evangelical Protestant -.016** 
Mainline Protestant -.013 
Black Protestant -.016 
Jewish -.049 
Other Affiliation -.001 
No Affiliation .010 
More than 65 Years Old -.013 
New Residents 017 
Racial Diversity Index .003 
Year -.007 -.009 
Male .056 .059 
Years of Education One) SOEs 
Family Income (log) so2Iae webs 
Birth Year 50% ‘059""* 
Different MA/County than Age 16 .463*** 42" 
Race/Ethnicity (White is reference) 
Black -.499* -.503* 
Hispanic StS § 118957 
Asian -.836"* -.869** 
Religious Attendance -.233*** -.230*** 
Religion (Catholic is reference) 
Evangelical Protestant -1.422°* = -1.368*** 
Mainline Protestant -.154 -.118 
Black Protestant -.617™* -.560** 
Jewish ar" .386* 
Other affiliation 147 e132 
No affiliation : ope OU 
PSU Variance (tau) 1.898 418 .268 
x | 163*** 493*** 364*** 


*p<.05 **p<.01 ***p <.001 (two-tailedtest) 
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Table 1 presents the results of the multilevel models that were run in the analysis. The 
results in Model 1, which includes only the contextual measures of South/non-South and 
urban/rural, are consistent with previous research. The mean tolerance levels of Southern 
PSUs are significantly lower than the mean tolerance levels of non-Southern PSUs and the 
mean tolerance level of urban areas is significantly higher than the mean tolerance of rural 
areas. The inclusion of the variables explains nearly 40 percent of the variance across PSUs, 
aS T99 Is reduced from 3.083 to 1.898. 

Model 2 adds the individual-level controls to the model, in order to test the compositional 
explanation. If the effects of region and urban status are compositional, the PSU-level 
coefficients in Model 1 will become non-significant with the addition of the controls in Model 
2. In general, the effects of the individual-level variables are consistent with previous research. 
Additional years of education, more income, a later birth cohort and migration to a new MA or 
county have positive effects on tolerance at the individual level. More frequent attendance at 
religious services and affiliation with an evangelical or black Protestant denomination have 
negative effects on individual tolerance (compared to Catholic affiliation).2 Jews and 
individuals who reported no religious affiliation had higher levels of tolerance than did 
members of other religious groups. Whites have higher levels of tolerance on average than do 
non-whites, with Hispanics reporting the lowest levels of tolerance, followed by Asians and 
then blacks."° Only gender and the year of the survey did not have a significant effect. 

More relevant to the study at hand is an examination of the compositional dimension of 
the regional and urban/rural gaps in tolerance. The magnitude of the coefficients for both 
region and urban status are substantially smaller in Model 2 than they are in Model 1, but they 
remain statistically significant at p < .001. Slightly more than half of the PSU-level effects 
observed in Model 1 are accounted for by compositional effects, and the PSU-level variance 
is also reduced substantially (from 1.898 to .418). Nevertheless, the chi-square value of tq 
(493) is still statistically significant at p < .001, which indicates that the individual-level scores 
remain substantially correlated by PSU even after controlling for compositional effects. 

Model 3 adds the remainder of the contextual variables to determine whether the 
Southern and urban effects can be explained by the demographic characteristics of these 
areas. The addition of the contextual demographic variables reduces the South coefficient by 
approximately two-thirds (from -.688 to -.254) and it becomes statistically non-significant. 
This suggests that the Southern effect can be explained by the combination of 
compositional and demographic contextual factors. Of the original effect of 1.547 points on 
the tolerance scale, 55.5 percent was explained by compositional factors in Model 2 and 
28.1 percent was explained by the contextual factors in Model 3. Between the two sets of 
factors, the Southern region effect was reduced to statistical non-significance, which 
indicates that after the demographic and compositional factors are accounted for, the 
subcultural effect is minimal. 

Additional stepwise analysis (Table 2) indicates that the main contextual reduction in the 
Southern coefficient in Model 3 results from the inclusion of the religion variables. Just as the 
individual control for membership in an evangelical Protestant church has a negative effect on 
tolerance, living in an area with a higher proportion of evangelical Protestants also has a 
negative effect on individual tolerance levels. The lower levels of tolerance in the South seem 
to be primarily a Bible Belt effect (Ellison and Musick 1993; Kosmin and Lachman 1993). 

Similarly, the urban effect is reduced to non-significance by the addition of the 
demographic explanatory variables in Model 3 (Table 1). With the inclusion of these variables, 
the difference between MAs and rural counties in predicted mean tolerance is virtually zero. 
Once again, we can account for the initial difference between urban and rural areas through a 
combination of compositional and contextual factors. The compositional factors accounted 
for 50.2 percent of the original effect, while the demographic contextual factors explained a 
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Table 2: Stepwise Decomposition of South and Urban Coefficients 


I eee eee ee eee eee ee ee 
Model South Coefficient Urban Coefficient 
Model 2 -.688*** GO; 

Model 2 + Education Control =o 7ilicae -.199 
Model 2 + Religion Controls -.160 20 
Model 3 -.254 118 


*™D < .001 (two-tailed test) 


further 42.1 percent. The non-significance of the remaining effect suggests that there is very 
little subcultural difference between urban and rural areas in terms of tolerance. 

Further stepwise analysis (Table 2) reveals that the contextual reduction of the urban 
coefficient to non-significance is due to the inclusion of the PSU-level education variable. 
Therefore, we conclude that the urban/rural gap in tolerance is due to the systematic 
differences between urban and rural areas in the percentage of residents with college degrees. 
Were urban areas and rural areas to have equal shares of college graduates, we would not 
expect to find differences in their mean levels of tolerance, all other factors being equal." 

Our multilevel analysis supports two of the three main explanations for spatial variation in 
levels of tolerance: the compositional and the demographic. We find that both the 
Southern/non-Southern and urban/rural gaps in mean tolerance levels can be explained by a 
combination of the differences in individual characteristics across the two population groups 
and the demographic characteristics that vary across the areas. Specifically, the Southern 
effect appears to be a reflection of how the religious composition of the South varies from the 
rest of the nation, while the urban effect is due to the greater concentration of college 
graduates in urban areas. The one explanation that we do not find support for is the subcultural 
account. There do not appear to be systematic differences in tolerance across regional or 
urban/rural lines that exist above and beyond the effects of individual and contextual 
demographic characteristics. This is not to say that cultural differences do not exist between 
Southern and non-Southern or urban and rural regions; rather, that the observed differences 
in tolerance between these areas can be traced back to differences in their populations. 


Interaction Effects: Are All People Affected Equally? 


A second question regarding the contextual effects model is whether the observed 
contextual effects are evenly distributed across all types of individuals or are stronger for 
certain types of people than others. For example, is the effect of being in an area with many 
college graduates the same for individuals with a college degree and those without a degree? 
Are the tolerance levels of evangelical Protestants more strongly affected by the proportion of 
evangelicals in an area than the tolerance levels of non-evangelicals? 

General theoretical models can be posited to support either idea. If beliefs and attitudes 
spread primarily through small-scale interactions such as informal interactions and small 
group activities, we would expect that the effects of context would vary across groups. 
Individuals are more likely to have these personal interactions with people who are like them, 
so the pro-tolerance effects of college graduates may be stronger on other college graduates 
(to whom they have a social bond and similar intellectual framework) than on non-graduates 
(who may be dismissive of graduates’ ideas). 
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On the other hand, the transmission of contextual effects may operate at a macrosocial 
level that does not have varying effects based on individual characteristics. For example, if city 
or county laws are a means of diffusion, both residents with and without college degrees are 
subject to the provisions and the effects. Were this to be the case, we would see little or no 
differences in the effects of context on individual tolerance across different groups in the 
community. 

To examine this issue, we tested whether the contextual effects were consistent across 
people with varying levels of the corresponding individual characteristics. This was done by 
modeling the PSU-level coefficients with the corresponding individual-level variables in the full 
model.’* The results (Table 3) show that the effect of being in a community with more college 
graduates does not vary based on the individual's level of education. Highly educated and less 
educated individuals’ tolerance levels are similarly affected by the percentage of college 
graduates. 

This suggests that the modes of transmission of pro-tolerance attitudes that result from 
greater concentrations of college graduates is achieved through institutional and macrosocial 
means, as opposed to the face-to-face interactions between similar individuals. It may be that 
areas with more college graduates are more likely to pass anti-discrimination laws (and may 
also be more likely to see that they are enforced). These areas may have stronger norms of 
cultural acceptance that lead both the college educated and those without degrees to be 
accepting of non-normative individuals and their ideas. Institutions, such as_ local 
governments, schools, cultural centers and businesses, may be more likely to create and 
support pro-tolerance activities when the population that they serve contains a higher 
proportion of individuals who are likely to value diversity. And even though these public 
activities may be set up in response to the demands of the intellectual elite, their effects are 
likely to be felt throughout the community. 


Table 3: Main Effects and Interactions 


Individual PSU-level PSU* Individual 
Variable Effect Effect Interaction 
Education 506." .049** -.002 
Evangelical Protestant Sika ooue | -.012* | -.013* 


Note: Model includes all other variables in Table 1, Model 3. 
*p<.05 “p<.01 ***p < .001 (two-tailed test) 


A different pattern emerges when looking at the interaction between individual 
membership in an evangelical Protestant church and the percentage of evangelical 
Protestants in the PSU. The statistically significant interaction term in Table 3 indicates that the 
effects of being in an area with a high proportion of evangelical Protestants are greater for 
those who are themselves evangelical. Figure 3 shows that the predicted tolerance score 
declines for all individuals as the share of evangelical Protestants increases in the PSU but that 
the slope is steeper for members than for non-members. In a PSU where 10 percent of the 
population is evangelical Protestant (approximately one standard deviation below the mean), 
the expected gap between members and non-members is 1.07 points. In a PSU where half of 
the population is evangelical (approximately one standard deviation above the mean), the 
expected difference is 1.57 points on the tolerance scale. 

A larger population of evangelicals means that they are more likely to become part of a critical 
mass in local politics, school boards and media, but these results suggest that the contextual 
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effects of religion are somewhat different from that of education. The greater concentration of 
evangelicals will have an impact on local laws, curriculum, news broadcasts and stories, which 
could explain the lower average tolerance for both members and non-members in communities 
with more evangelical Protestants. However, the greater effect for those who belong to the 
religious group suggests that there is also an interpersonal dimension. Evangelicals may have 
their more restrictive and isolationist beliefs reinforced in religious services or other exclusive 
activities in which members interact with other members. The effect of these settings on 
tolerance levels complements the broader social context and leads to the larger negative effect 
of evangelical Protestantism in the community for members of the movement. At the same time, 
non-evangelicals might also be affected by interactions with like-minded individuals in their 
religious (or secular) settings. They may hear more liberal interpretations of Judeo-Christian 
traditions from their religious leaders as a response to the evangelical messages in the 
community. This may lead them to prioritize civil liberties and make them more resistant to 
evangelical messages that are part of the public discourse in the community. 


Figure 3. Predicted Tolerance Scores for Evangelical Protestants and non-Evangelicals by 
Percentage of Evangelical Protestants in PSU 
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Discussion 


Since the 1960s, a long research program has examined the individual characteristics that 
affect levels of tolerance for non-normative groups. It has often been claimed that context 
matters as well, but empirical research on this aspect has been sparse. In this study, we have 


Accounting for Spatial Variation in Tolerance * 2217 


more formally modeled the contextual dimension of tolerance by using a multilevel approach 
to properly account for the effects of context. In addition, we have sought to determine 
whether the somewhat vaguely defined subcultural contextual effects of Southern region and 
urban location are fundamental or simply proxies for more specific contextual-level 
demographic characteristics that affect individual's tolerance levels. 

The role of individual characteristics in generating tolerance is substantial, but our analysis 
has shown that the effect of demographic contextual conditions also must be taken into 
account. There is systematic variation in tolerance levels across communities. Some of this 
variation is due to the varying characteristics of individuals across communities in the United 
States (compositional effects), but the explanatory power of models of tolerance is increased 
by the inclusion of demographic characteristics (contextual effects). In this study, we have 
shown that living in an area with more college graduates has a positive effect on individual 
tolerance, while living in areas with a greater proportion of evangelical Protestants has a 
negative effect on tolerance. The contextual education effect explains the previously observed 
urban/rural difference in tolerance and religious context accounts for the Southern effect. 

While our results do not indicate that there is a significant subcultural dimension to spatial 
variation in tolerance, this does not mean that we are dismissing the role that culture plays in 
determining an individual's level of tolerance. Tolerance is an element of culture and there is a 
dialectical relationship between culture and demography. Places with large percentages of 
college graduates are likely to be culturally different from places with fewer college graduates. 
Similarly, Southern communities are culturally distinct as a result of their unique religious 
demographics. At the same time, the culture of a community also influences the composition 

of the population. A place that is known as an intellectual center (e.g., Boston, San Francisco, 
Washington D.C.) is likely to attract more college graduates and also is likely to have more of 
its residents pursue higher education. The Bible Belt of the South is a cultural description that 
has real demographic implications. Individuals who are not evangelical are less likely to move 
to these areas, while those who do share these beliefs are more likely to do so. In stating that 
the subcultural explanation is not supported by our data, we are not making the claim that 
cultural differences do not exist or that they do not affect individual levels of tolerance. Rather, 
our conclusion is that the less tolerant cultures of Southern and rural communities can be 
explained in large part by the demographic and compositional characteristics of these places. 
Following Molotch, Freudenburg and Paulsen (2000), we conclude that place matters, and 
seek to explain how it does. (See also Florida 2002.) 

Although these data do not allow us to test theories about the mechanisms through which 
these contextual effects operate on individuals’ tolerance levels, our results suggest that the 
two factors operate somewhat differently. The liberalizing effects of education are consistent 
across all members of a community, which may indicate that the effect is transmitted 
primarily through macrosocial means. The contextual education effect may operate through 
institutions, as positions of authority and influence in the United States tend to be occupied 
by the educational elite and also tend to be more responsive to the needs and desires of that 
elite. Where individuals in these positions (particularly at the local levels) are more likely to be 
highly educated, the more tolerant values of this population may be diffused through such 
mechanisms as policy decisions, judicial decisions/precedents, school curricula, and news 
coverage. 

Evangelical Protestant beliefs may also influence community attitudes through these 
macrosocial processes, but we also find evidence that suggests a complementary 
microsocial process at work. Members of evangelical churches are more strongly affected by 
the religious context of the community than are other residents. This may indicate that in- 
group processes complement the more general diffusion of less tolerant beliefs in the 


community. 
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While we have selected the most commonly used metric in the tolerance literature, two 
critiques of the measure have emerged. First, some have argued that any measure of tolerance 
or the willingness to extend civil rights to a group ought to ensure that the group is disliked by 
the respondent (Sullivan, Piereson and Marcus 1982). A willingness to extend civil liberties to a 
group that one favors is not tolerance at all. However, Bobo and Licari (1989) found no significant 
relationship between how individuals feel towards a group and their willingness to provide them 
civil liberties. In this study, we have used the same dependent variable as Bobo and Licari and 
therefore elected not to subsample our data according to target group sensibilities. 

Second, the non-normative groups identified in this study reflect a political landscape that 
is nearly a half-century old. Updated target groups, such as religious fundamentalists and 
feminists, might provide a better measure of tolerance attitudes that shift due to both cohort 
and historical effects. For example, communists may not seem to be as salient a threat since 
the end of the Cold War, but granting the right of religious fundamentalists (of any religion) to 
speak or publish a book constitutes a current test of tolerance. Similarly, feminists continue to 
elicit derogatory labels such as “feminazis” from popular conservative commentators. There 
is great value in the historical continuity of these data, but the development of a more 
contemporary list of threats to society would help us to determine whether the documented 
changes in tolerance are a reflection of changing attitudes or changing threats. 


Conclusion 


While the long-term trend in American society has been one of increasing tolerance, this does 
not mean that all Americans have come to subscribe to the belief that all members of society 
should be guaranteed basic civil liberties. Many of the major political debates of the past five 
years — the operations of the Department of Homeland Security, the treatment of prisoners both 
in terrorism and non-terrorism related cases, immigration policy, same-sex marriage — have at 
their core questions of tolerance and civil rights. The intensity and duration of these debates 
reveal a continued willingness of some Americans to restrict certain populations’ civil rights. 

Intolerance may also have substantial economic implications. In his work on economic 
development, Richard Florida (2003) cites tolerance as one of the “three Ts” that cities will 
need to embrace in order to be successful in the 21* century. (Talent and technology are the 
other two.) He argues that cities will need to be accommodating of the “creative class” that is 
composed of generally non-traditional (in their ethnicity, beliefs, sexuality, etc.) individuals in 
order to generate the innovations that will be the root of prosperity in the next century. 

At the broadest level, tolerance is a reflection of the degree of equality and justice that a 
society bestows upon its members and in particular, its minority groups who are the most 
likely to be subject to the harmful expressions of intolerance. In this paper, we have shown 
that the educational and religious context of American communities has a direct effect on the 
willingness of its members to extend civil rights to controversial groups. As the demography 
of American communities continues to change over time, we can also expect changes in the 
tolerance expressed by their residents. 


Notes 


1. There is a dialectical relationship between the subcultural dimension and the 
demographic dimension. The culture of a place is influenced by its demographic 
composition and the demography of a place is shaped by its culture. This point will be 
expanded upon in the discussion section of this paper. 
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Bobo and Licari, Ellison and Musick, and Wilson all used the General Social Survey for 
their studies, albeit different samples of years. The two studies that found no significant 
results only used single years of data (1984 and 1988, respectively), while the Wilson 
study used data from 1976 to 1989. 


It is also possible that the effect will work in the opposite direction and migrants to less 
tolerant communities will themselves become less tolerant. 


Sullivan, Piereson and Marcus (1982) argue that it is important to include groups from both 
the right and the left of the political spectrum in order to capture populations that both liberals 
and conservatives would oppose. Tolerance has also been measured as attitudes towards 
_ euthanasia, prohibition of alcoholic beverages and allowing epileptics into the workplace 
(Abrahamson and Carter 1984) as well as sexual nonconformity (Stephan and McMullin 
1982). Individuals’ attitudes towards interracial marriage, residential segregation and school 
integration have also been used as indicators of racial tolerance (Wilson 1986; Tuch 1987). 


. The data were recoded using the reclassification system developed by Steensland et al. 
(2000). In contrast to the GSS system of classifying Protestant denominations as 
fundamentalist, moderate or liberal, this approach divides Protestantism into evangelical, 
mainline and black groupings. 


. The PSU samples were revised in 1983 and 1993. In some cases, PSUs were repeated 
across different sampling frames. In these instances, the PSU was only counted once. 


In contrast to the U.S. Census definition of Southern region that includes border states 
such as Delaware and Maryland, we use a “Confederate South” definition that includes 
Alabama, Arkansas, Florida, Georgia, Louisiana, Mississippi, North Carolina, South 
Carolina, Tennessee, Texas and Virginia. We would expect this definition of the South to 
lend greater support of any existing regional subculture effects on tolerance. 


. The diversity index is calculated by summing the squares of the proportion of each 
racial/ethnic group in the area and then subtracting the total from one. It was then 
multiplied by 100 to yield a scale that ranges from 0 (all residents of the PSU are of one 
racial/ethnic group) to 100-(100/n), where n is the number of racial groups in the PSU. In 
this instance, five racial/ethnic groups were identified (non-Hispanic white, non-Hispanic 
black, Hispanic, non-Hispanic Asian and non-Hispanic Native American), so the maximum 
possible value for the diversity index was 80. 


Gay and Ellison (1993) explore the significantly lower levels of tolerance demonstrated by 
black Protestants in some detail. 


10. In order to examine whether the race effect on tolerance was due to the presence of 
questions about racism, we ran the models both with and without the three questions on 
racists. The results were not affected substantially by the inclusion of these questions. 
(Results available upon request.) 


_ Similar results are found when the dependent variable is separated into tolerance of leftist 
(atheists, homosexuals, communists) and rightist (militarists and racists) groups. The only 
difference at the contextual level is that the effect of percent new residents is statistically 
significant in the model for tolerance of leftist groups. (Results available upon request.) 


a 
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12. The interactions were calculated by replacing the original Level 2 equations (8, = Y 90) 
with equations of the form Bg = = G0 HY 12q: where Zy is the state-level variable that 
corresponds to the individual- evel Variable Xi in the level 4 equation. 
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Appendix Table A: Descriptive Variables 


Variable Mean St Dev Minimum Maximum 


Individuals (N = 16,512) 


Tolerance Score 9.38 4.94 0 15 
Year of Survey 1988.15 15.62 1976 2000 
Years of Education 12.88 3.04 0 20 
Male 46 0 1 
Frequency of Religious Attendance 3.90 2.66 0 8 
Family Income (log) 10.24 1.01 5.97 12.16 
Birth Year 1944.30 17.74 1887 1982 
White 81 0 1 
Black 14 0 1 
Hispanic .03 0 1 
Asian .02 0 1 
Different PSU Compared to Age 16 59 0 1 
Evangelical Protestant 26 0 1 
Mainline Protestant we 0 1 
Black Protestant .09 0 1 
Other Religion .05 0 1 
Catholic 25 0 1 
Jewish .02 0 1 
No Religion .09 0 1 
PSUs (N = 179) 
South 39 0 
Metropolitan area 58 0 1 
Percentage: 
More than 65 years old 17.39 4.63 4.88 37.10 
College Degree 35.29 14.41 8.86 65.14 
New Residents 14.49 8.21 3.31 60.62 
Evangelical Protestant 31.03 18.40 0 94.56 
Mainline Protestant 24.14 13.41% 0 71.54 
Black Protestant 9.67 17.08 0 100 
Catholic PERE 16.95 0 78.57 
Jewish .88 1.65 0 12.76 
Other Religion SoHo 3.70 0 27.68 
No Religion 8.21 6.23 0 35.64 
Diversity Index 27.71 17.40 1.66 66.24 


Intersections on the Road to Self-Employment: 
Gender, Family and Occupational Class 
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Abstract 

Are gender differences in the effects of family structure on self-employment participation 
robust across different forms of self-employment? Using event history analyses of 
competing risks and data spanning 20 years, I find that women enter non-professional and 
non-managerial self-employment to balance work and family demands. In contrast, 
family factors do little to explain women’s entrance into professional and managerial self- 
employment; these women are more similar to their male peers and appear to follow a 
careerist model of self-employment. ; 


Between 1975 and 1995, women’s self-employment participation grew by 60 percent, 
compared with 20 percent for men (Blau 1998).' Among American workers ages 25 and older 
in 2003, 9.2 percent of women and 15.5 percent of men were self-employed (Hipple 2004). 
Past research on gender and self-employment consistently finds that family characteristics, 
notably marital status and children, more strongly predict women’s self-employment 
participation than men’s (Arum 1997; Boden 1996; Carr 1996; Renzulli, Aldrich and Moody 
2000; Taniguchi 2002). However, the nature of the relationship between family structure and 
women's self-employment participation is unclear. Some studies find that marriage and 
children increase women’s likelihood of self-employment (Carr 1996; Taniguchi 2002), while 
others show that kin-laden social networks reduce business opportunities for nascent 
entrepreneurs (Renzulli et al. 2000). These contradictory findings point to a complex 
relationship between family structure and self-employment: differences among women, in 
addition to differences between women and men, may interact with family structure to affect 
self-employment participation. 

One source of heterogeneity among self-employed women is the growing polarity of self- 
employment activities. Recent studies show that self-employment is growing in both the 
most and the least rewarded occupations, with little in between (Arum 1997; McManus 
2000). However, research has not examined this heterogeneity in estimating the effect of 
family structure on women's self-employment. Instead, studies tend to either pool all of 
women’s self-employment activities together (Boden 1996; Carr 1996; Taniguchi 2002), or 
focus on small business owners alone (Loscocco and Robinson 1991; Renzulli et al. 2000). 
These two sampling approaches generate contradictory findings regarding the effect of 
family responsibilities on women’s self-employment activities. On the one hand, studies that 
include all of women's self-employment activities find that marriage and children encourage 
women's self-employment. This research has led to conclusions that women use self- 
employment primarily as a strategy to balance work and family commitments, while men use 


I thank Richard Arum, Paula England, Theodore Gerber, Naomi Gerstel, Donald Grant, Joya Misra, 
Hiromi Taniguchi and anonymous reviewers for Social Forces for comments on earlier drafts. This 
paper is based on dissertation research, funded in part by a pre-doctoral grant from the MacArthur 
Foundation Network on the Family and the Economy. Direct correspondence to Michelle Budig, 
Social and Demographic Research Institute, W33cd Machmer Hall, University of Massachusetts, 240 
Hicks Way, Amherst, MA 01003-9278. E-mail: budig@soc.umass.edu. 


© The University of North Carolina Press Social Forces, Volume 84, Number 4, June 2006 


2224 © Social Forces Volume 84, Number 4 * June 2006 


self-employment to advance their careers (Carr 1996). On the other hand, studies limited to 
small business owners finds that, compared to male entrepreneurs, women’s businesses are 
less supported, and even disadvantaged, by their families (Goffee and Scase 1985). Despite 
these contradictions, no studies examine the effect of family structure by different 
occupational classes of the self-employed. Inattention to stratification among the self- 
employed renders past findings about gender, family and self-employment participation 
uncertain. Is the finding that family structure encourages women's, and not men’s, self- 
employment consistent across professional and non-professional occupations? To answer 
this, | disaggregate self-employment by professional and managerial status and examine 
whether the gendered family story of self-employment varies by occupational class. 

In addition to illuminating the role of family characteristics on women’s self-employment, 
this paper makes several improvements on past research. First, | examine the full range of 
self-employment activities with a nationally representative longitudinal sample. The self- 
employment literature has focused on incorporated business owners and entrepreneurs 
(Clark and James 1992; Kalleberg and Leicht 1991; Loscocco and Robinson 1991) and 
neglected less profitable self-employed nonprofessionals and unincorporated small 
businesses. This obscures sources of inequality among the self-employed and omits many 
self-employment activities that women and other less advantaged workers undertake (Arum 
1997). The analyses in this paper allow respondents to define whether they are self- 
employed. Thus, the data reflect the full diversity of self-employment and enable examination 
of family characteristics across different forms of self-employment. Second, this study 
explores a rich set of family characteristics including number of children, marital status, family 
income and spousal self-employment. It uses an extensive array of human capital and job 
characteristics, and estimates competing hazards for two occupational classes of self- 
employment. While | combine managers and professionals in one category, for brevity’s sake 
| refer to this category as “professionals” and all other occupations as “nonprofessionals.” 


Theoretical Background 


In contrast to the well-developed body of research that documents how predictors of self- 
employment vary by immigration status, ethnicity and race, differential effects by gender 
remain under-analyzed. Some early research simply excluded women when testing theories 
of self-employment participation (Brock, Evans and Phillios 1986; Fuchs 1982). Other studies 
included women but did not examine differences by gender (Blau 1987; Form 1982). This 
research produced two major individual-level theories of self-employment participation: the 
disadvantaged worker and the class mobility hypothesis. The disadvantaged worker 
argument posits that workers without an attractive mix of human capital become self- 
employed when they are unable to obtain a job in the wage labor market. The class mobility 
hypothesis argues that workers in undesirable jobs turn to self-employment to improve their 
economic situation. Markers of undesirable jobs may include low pay, irregular hours or lack 
of employer-provided benefits. Skilled workers in bad jobs who have sufficient human capital, 
social networks and financial resources to pursue a business venture may become self- 
employed to improve both their immediate and longer-term economic position. 

Carr (1996) argued that these theories explain men’s self-employment, but not womer’s. 
Carr's analysis identified a bimodal pattern for men, wherein the most ambitious workers and 
the workers facing the greatest disadvantages in the labor market turn to self-employment 
to improve their economic situations. However, among women, the self-employed were more 
educated than the wage-employed and thus, did not fit the disadvantaged worker model. 
Moreover, self-employed women, with higher rates of marriage and preschool children and 
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lower rates of working hours, also did not fit the careerist model. Instead, Carr argued that 
self-employment offered women a flexible work strategy to combine competing 
responsibilities from employment and families. She argued that this work and family balance 
model of women's self-employment was consistent across occupations and business 
incorporation statuses. 

But Carr's descriptive analysis left un-theorized a bifurcation among self-employed women 
— two distinct groups working less than 15 hours per week and more than 41 hours. This 
indicates that women may enter self-employment with different goals — one group to reduce 
working hours and increase the time spent in other activities, and another group to increase 
their hours and potential career success. Those seeking reduced working hours might include 
women with greater family commitments. A second group of women may enter self- 
employment with career ambitions similar to men’s or possibly to escape the “glass ceiling” 
of discrimination they face in wage employment. To illuminate this bifurcation in women’s 
self-employment participation and to examine whether the work and family balance 
explanation is consistent across different forms of women’s self-employment, | propose two 
theories to explain women’s self-employment: a work and family conflict theory and a 
careerist theory. 


Self-Employment, Gender and Work-Family Conflict 


Parent-friendly jobs are not institutionalized despite increases in women’s employment. The 
nonprofessional wage work is least likely to be family-friendly. Thus, this perspective predicts 
(1) more mothers than non-mothers enter self-employment due to the institutional constraints 
mothers face in wage work, and (2) the proportion of mothers in nonprofessional self- 
employment should be greater than within professional self-employment. In addition to 
motherhood, other family factors such as marital status, family income and self-employment 
status of one’s spouse should affect women’s self-employment more than men’s and, among 
women, nonprofessional self-employment more than professional self-employment. 

Compared with professional occupations, nonprofessional wage work is more likely to 
lack employer-sponsored childcare, pay wages too low to fund quality private daycare, and 
entail closely supervised and inflexible schedules. Recent research demonstrates that 
workers with lower wages and occupational status are less likely to have access to family- 
friendly benefits (Shore 1998). In addition, women in nonprofessional jobs are less likely to 
have control over the pace and timing of their work which are positively associated with 
reduced work and family conflict (Thomas and Ganster 1995). Self-employment may offer 
female nonprofessional wage workers more autonomy and greater flexibility in setting the 
time and place of work. Past research offers some support for this argument (Connelly 1992; 
Presser 1995). Given this, | expect that motherhood should more strongly predict 
engagement in nonprofessional, than in professional, self-employment. 

There are several reasons why marriage should affect women's self-employment 
participation more than men’s. First, Carr (1996) argues that because women's self- 
employment is often low-earning and does not provide paid leave for caregiving, self-employed 
women require a financial safety cushion provided by a husband's earnings. If so, family 
earnings should positively affect women’s likelihood of self-employment and this effect should 
be stronger for women in lower-paying nonprofessional self-employment. Second, marriage 
might positively affect women’s self-employment if women are more likely to join their 
husband's businesses than vice versa. Recent research finds having a self-employed husband 
dramatically increases the likelihood that a woman will become self-employed (Greene 2000; 
Taniguchi 2002). It seems more likely that a woman's reason for joining her husband's self- 
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employment activity would be to support her husband's career rather than to advance her own 
career? Thus, | predict that having a self-employed spouse will most strongly affect the 
likelihood of nonprofessional self-employment. Finally, marriage may positively affect women’s 
self-employment participation more than men’s if marriage increases women’s family 
responsibilities, and thus work and family conflict, more than it does men’s.® 


Self-Employment, Gender and Career Mobility 


Not all women may become self-employed to balance work and family demands. Some 
women may not experience work and family conflict, for example, single and childless 
women. Some women, like some men, may become self-employed to advance their careers 
or socioeconomic class positions. Conceptually, two distinct groups with these careerist 
ambitions could enter self-employment. One group might be disadvantaged workers fleeing 
bad jobs. Because nonprofessional occupations are more likely to have poor job 
characteristics (low pay, lack of employer-provided health or life insurance benefits, or 
nonstandard hours), | expect bad job characteristics to explain transitions between waged 
and self-employed nonprofessional work. 

A second group of workers may have relatively good jobs in terms of pay, benefits and 
prestige, but desire the increased autonomy, earnings and/or satisfaction of being one’s own 
boss. Because professional occupations are more likely to have good job characteristics, 
workers transitioning between wage and self-employed professional work may be more 
pulled by the attractiveness of self-employment than pushed by the unattractiveness of wage 
employment. This second group of workers should be more likely to be professionals than 
nonprofessionals prior to entering professional self-employment; they should be more likely 
to enter self-employment from wage employment, rather than after a labor force absence; 
and measures of work skill, education and experience should positively affect professional 
self-employment. 

There also may be gender differences among those entering professional self-employment. 
Self-employment may attract career-oriented women who encounter gender discrimination in 
the workplace. Some qualitative research demonstrates the lack of advancement opportunity 
in wage employment encourages women to become entrepreneurs (Moore and Buttner 1997). 
If the source of discrimination is the employer, then a work situation where a woman is her 
own boss would eliminate this type of discrimination.‘ While self-employment may provide 
career-oriented women a way to break the glass ceiling, research supporting this argument 
rarely uses national samples (Cromie 1987). Because women are likely to encounter the glass 
ceiling when they are occupational tokens (Kanter 1977), | predict that women in male- 
dominated professional occupations will be more likely than women in feminized occupations 
to enter professional self-employment. 


Other Factors Affecting Self-Employment 


| control for other factors known to affect self-employment participation. Industrial sector 
contributes to gender differences in self-employment participation (Greene 2000). Employed 
women are more likely to be in non-technical fields such as personnel, public relations and 
service industries which do not prepare them for self-employment (Loscocco and Robinson 
1991). To control for the effects of industry, | include dummy variables for the industrial sector. 

Given the costs of new business ventures, parental assets may influence self-employment 
participation of young workers (Aldrich, Renzulli and Langton 1998). Socioeconomic class is 
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measured by parental education and occupational prestige. | also control for number of 
siblings because family size may affect parents’ ability to assist self-employment activities of 
their children. 

Finally, | include demographic controls for the respondent's age, race/ethnicity, local 
population density, local unemployment rate and region of residence. These variables also affect 
self-employment participation (Arum, Budig and Grant 2001; Boden 1996; Taniguchi 2002). 


Data and Measures 


The panel data for these analyses are drawn from the 1979-1998 years of the National 
Longitudinal Survey of Youth (NLSY), which is a multi-stage stratified national probability sample 
of 12,686 persons ages 14 to 21 in 1979. Annual survey waves continued through 1994 and 
biannually thereafter. The NLSY over-sampled various populations (i.e., minority racial groups). 
Descriptive analyses use sample weights, but due to the spell structure of the event history 
data, sample weights are not used in regressions. | exclude farmers, full-time students and 
active military personnel from the risk sets. This creates a sample size of 12,315 for all analyses. 

| calculate the percentage female in each detailed occupation/industry cell from 1990 U.S. 
Census data (U.S. Bureau of the Census 1993). NLSY data are coded into 1970 occupation 
and industry codes each year. Using the descriptive occupational and industrial titles for 1970 
and 1980, | recoded the 1970 three-digit occupation and industry codes into 1980 codes. | 
then matched the 1980 codes to 1990 occupation and industry codes. 


Principal Variables 


The dependent variable is self-employment status. | define the respondent as self-employed if 
she or he answered yes to the survey question: “Are you self-employed in this job?” Analyses 
distinguish between two subgroups of self-employment based on the three-digit census codes 
for professional and managerial occupations. Professional and managerial occupations (coded 1- 
199) are classified as “professional” occupations, while all other occupations are classified as 
“nonprofessional” occupations. For respondents who enter self-employment, time-varying 
variables are measured at the interview immediately preceding self-employment entrance. All job- 
related variables pertain to the occupation held immediately prior to entering self-employment. 
Respondents who never become self-employed are treated as right-censored observations. 

Time-varying family characteristics include marital status (currently married = 1), number 
of children,® net family income and a dummy variable indicating spousal self-employment. To 

examine gender differences in the effects of family characteristics | interact gender with all 
family variables. Only significant interactions are reported. 

Time-varying measures of human capital and individual labor supply are usual weekly 
hours worked, usual hours squared, years of seniority, years of work experience, education 
(highest grade completed), current school enrolment, number of jobs ever held, and 
employment status prior to self-employment. Non-employed respondents’ values for prior job 
characteristics are set at zero, and | control for non-employment to eliminate the distorting 
effect of these zeroes on coefficients. | also include the Armed Forces Qualifying Test to 
measure work-related cognitive skill.° 

Job characteristics include dummy variables indicating irregular hours (night or evening 
shifts, rotating shifts or on-call work), employer-provided health and life insurance benefits, and 
small firm size (fewer than 20 workers). The gender composition of one’s occupation is 
measured by the percentage female in the cell formed by cross-tabulation Census detailed 
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(three-digit) occupation with detailed industry. | include 12 dummies for industry of occupation. 
For this set of dummies, agriculture, forestry, fishing and mining serve as the reference category. 

Social class of the respondent's family of origin is measured as parental occupational 
prestige and parental educational attainment. | created these items by averaging father’s and 
mother's Duncan SEI scores and highest grades completed, or if a respondent grew up with 
a single parent, using the present parent's value for each measure. Time-varying demographic 
characteristics include respondent's age and age squared, whether the respondent lived in a 
rural, suburban or urban area (rural is the reference category), and region of residence. | also 
include a control variable for county unemployment rate to capture the effects of local labor 
market changes and strength of the economy on self-employment. 


Methodology 


| use a discrete time-event history model (Cox's proportional hazards regression) to analyze 
the transition rates into self-employment. Only the first transition into self-employment is 
modeled. The competing-risks model takes the form of a multinomial logistic regression 
model with three competing outcomes. (For elaboration, see Allison 1984.) One model 
predicts any form of self-employment and a competing hazards model predicts self- 
employment in professional vs. non-professional occupations. In results not shown, | ran all 
models separately for those who reported that their self-employment activity was their “main 
job” vs. those who reported moonlighting in self-employment (working in self-employment 
in addition to a regular wage job). Results did not differ between models.’ 


Results 


Descriptive Findings 


What are the pathways into professional and into nonprofessional self-employment? Figure 1 
presents respondents’ employment status prior to entering self-employment by gender and 
occupational class of self-employment. 

The majority of women (54 percent) who enter non-professional self-employment were not 
employed prior to entrance. This compares with only 27 percent of women in professional self- 
employment and roughly one-third of men in either form of self-employment. Thus, the 
counterfactual for female self-employed nonprofessionals is more likely to be non-employment 
than wage employment, and they are unique in this respect. In contrast, the majority of male 
self-employed non-professionals left waged jobs and typically made lateral occupational moves. 

Women’s predominant pathway to professional self-employment is from professional 
wage employment. Close to half of women entering professional self-employment left a 
professional wage occupation. In contrast, male self-employed professionals are fairly evenly 
divided across the three pre-self-employment origin states. Professionally self-employed 
women are most likely to have made lateral occupational moves into self-employment while 
their male peers are more likely to have made upward moves from non-professional wage 
employment or non-employment. 

To examine change in employment and family situations before and after self-employment, 
Table 1 presents descriptive statistics on these variables prior to and after self-employment. 
Means, standard deviations and t-tests are reported separately by gender, occupational class 
of self-employment and prior employment status. 
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Figure 1. Employment Status Prior to Entering Self-Employment, by Gender and Occupational 
Class of Self-Employment 
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Table 1 shows that women entering non-professional self-employment from non- 
employment average the most children and highest marital rates. Non-professional self- 
employment also appears to attract non-employed women, non-employed mothers in 
particular. Women’s higher marital rates may indicate that their self-employment participation 
needs a financial “safety cushion” consisting of a husband's earnings and benefits such as 
health insurance. 

For those employed prior to self-employment entrance, hourly earnings are higher after 
self-employment, and this increase is largest among female professionals. Self-employed 
female professionals have higher hourly earnings than their male peers. This reverses the 
male earnings advantage found in wage work. However, women in non-professional self- 
employment suffer the lowest earnings. Self-employed female nonprofessionals who were 
previously not employed average a paltry $4.08 per hour. 

Women are more likely than men to have a self-employed spouse. The rates of spousal 
self-employment are highest for women who were previously not employed. Unfortunately, 
there is no measure of whether spouses co-own a business or if one is an employee of the 
other. In additional analyses, | examined self-employed spouses’ occupational pairings. 
Results show that self-employed women with self-employed husbands tend to be 
nonprofessionals in clerical and service occupations. 

While the industrial sectors of self-employed professionals are fairly similar across gender, 
self-employed nonprofessionals are highly segregated by gender. Among nonprofessionals, 
the majority (54 percent) of women is in personal services, but their male peers are 
concentrated in construction and business and repair industries. The occupations of self- 
employed professionals — restaurant, bar or other business manager/owner — are the same for 
men and women. However, in the non-professional self-employment category, men cluster 
into trades and crafts occupations (e.g., auto mechanic and carpenter), while women most 
often are childcare providers. Given that non-professional self-employment has a high 
proportion of previously non-employed mothers, providing paid childcare may offer a route to 
combining full-time motherhood with income generating activities. 


he 
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Findings from Event History Analyses 


While the patterns found in the descriptive statistics are intriguing, they do not control for the 
effects of other variables. Event history analyses presented in Table 2 give the partial effects 
of these variables on the hazard of becoming self-employed and on the competing hazards of 
entering professional vs. non-professional self-employment. The following discussion uses 
risk ratios to describe the percent change in the likelihood of becoming self-employed given 
a one-unit increase in the independent variable. This percentage is computed by subtracting 
1.0 from the risk ratio and multiplying the result by 100. Model Chi-Squares for all models are 
significant at p < .001. 


Effects of Family Variables on the Likelihood of Self-Employment 

In the model predicting self-employment in any occupation, children havea significant impact 
on women’s, but not men’s, likelihood of self-employment. Each additional child increases 
women’s likelihood of self-employment by 11 percent. Likewise, marriage has no effect on 
men’s participation in self-employment, but significantly increases women’s by 53 percent. 
What kind of self-employment does marriage and children encourage for women? The 
competing hazards analysis shows that children encourage women's non-professional self- 
employment, but not professional self-employment. Marriage, however, increases women’s 
likelihood of both kinds self-employment. 

Why don't children affect women’s propensity for professional self-employment? Perhaps 
women in professional wage employment, whose jobs allow relatively more control over 
work schedules, encounter less work and family conflict. If so, children would not push 
women from professional wage occupations into self-employment. On the other hand, 
women professionals may have less work and family conflict because they limit their family 
responsibilities. For example, Table 1 shows that despite similar marital rates, women in 
professional occupations typically have fewer children than women in non-professional work. 

Table 2 also shows that family income has a small, but positive effect on the likelihood of 
self-employment. However, when self-employment is disaggregated, this effect holds only for 
non-professional self-employment. Having a spouse who is self-employed significantly 
increases the respondent's hazard of self-employment by 189 percent. Although “self- 
employed spouse” did not significantly interact with gender, this may simply be a problem of 
Statistical power. As Table 1 shows, self-employed women are far more likely to have a self- 
employed spouse than are men. 

The competing hazards analysis shows that self-employed spouses have a greater effect 
on non-professional self-employment (202 percent) than for professional self-employment (98 
percent). Taken together, family factors more strongly affect women’s likelihood of self- 
employment than men’s. Moreover, these factors encourage self-employment in non- 
professional occupations more strongly than in professional occupations. These findings 
concur with the work and family conflict model of women’s self-employment, especially for 
women entering non-professional self-employment. 


Effects of Human Capital and Labor Supply on the Likelihood of Self-Employment 
In general, human capital measures performed as expected. Education had a positive effect 
on entering professional self-employment and, as prédicted by the careerist model, this effect 
did not vary by gender. Similarly, job skills (measured by AFOQT) positively affect the likelihood 
of self-employment in professional occupations. Being non-employed has a significantly 
positive effect on the entrance to non-professional self-employment, and this effect is much 
stronger for women (64 percent) than for men (31 percent). Non-employment did not predict 
professional self-employment. 
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Effects of Prior Job Characteristics on the Likelihood of Self-Employment 

Overall, the effects of prior job characteristics support the “undesirable wage jobs” theory of 
self-employment. Working in a job with irregular shifts increases the likelihood of non- 
professional self-employment, while having health or life insurance reduces this likelihood, 
but none of these variables affect professional self-employment. In addition, working for a 
small firm greatly increases the likelihood of becoming self-employed. 

The gender composition of prior occupation has varying effects on the hazards of non- 
professional and professional self-employment by gender. Consistent with the discrimination 
explanation of women’s participation in professional self-employment, the percentage female 
of one’s-occupation negatively affects the likelihood of entering professional self-employment 
for women, but has no net effect on women’s likelihood of entering non-professional self- 
employment. Thus, the more masculinized their occupations, the more likely female 
professionals are to turn to self-employment. 

Strikingly, the coefficient for being female is non-significant in the model predicting 
professional self-employment. This variable, however, is interacted with other variables in the 
model and requires careful interpretation. Where female is interacted with marriage, this 
increases the likelihood that women will enter professional self-employment. However, being in 
a feminized occupation reduces women’s propensity for entering professional self-employment. 
Thus, women who are in feminized occupations are less likely, compared to men, to enter 
professional self-employment. But wage-employed women in male-dominated occupations are 
more similar to men in their likelinood of becoming self-employed in a professional occupation. 


Conclusions 


While past research found that family factors have different effects on women’s self- 
employment compared to men’s, the effects of family on women’s self-employment 
conflicted across studies. This paper examined whether the heterogeneity of self- 
employment activities might explain these contradictory findings. Results show that the 
effects of family structure on women’s propensity for self-employment significantly differ by 
occupational class. Family factors positively affect the likelihood that women will enter non- 
professional self-employment. However, women’s entrance into professional self- 
employment follows a careerist model of self-employment. Importantly, women entering self- 
employment in professional occupations are more similar to their male peers in self- 
employment than they are to women entering non-professional self-employment. 

The evidence that women with greater family responsibilities are more likely to become 
self-employed is clear for women entering non-professional self-employment. Married 
women and mothers have much greater likelihoods of entering non-professional self- 
employment. In contrast, marriage and children have little or no effect on men’s propensity to 
become self-employed. The positive effect of marriage is net of family income. This indicates 
that it is not simply the financial resources conferred by marriage that enable women to 
become self-employed (although spousal health benefits may play a role). The additional 
family responsibilities women gain when married may lie behind the marriage effect. Despite 
the low earnings of women in non-professional self-employment, these reduced earnings 
may be offset by the greater potential this form of employment offers to combine work and 
family responsibilities. A surprising factor that highlights the role of family in self-employment 
activity is the large positive effect of having a self-employed spouse on one’s hazard of self- 
employment in a non-professional occupation. Women are much more likely to have a self- 
employed spouse than are men. The opportunity to join a husband's self-employment venture 
encourages women's self-employment in non-professional occupations. Whether the woman 
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is best considered a partner in the family business or a sort of employee of her spouse, 
certainly her ability to take time off work for childcare or other family responsibilities would be 
more supported by her husband than by a disinterested business partner or employer. 

In addition to the direct effects of family factors on women's self-employment, the 
characteristics of women’s job situations that create or ease the conflict between work and 
family are important. The women who enter non-professional self-employment are not 
leaving professional occupations in wage employment. Rather, it is women who are not 
working or are in non-professional wage occupations and who have family responsibilities 
who are most likely to enter non-professional self-employment. Non-professional wage 
occupations often lack the characteristics of professional occupations that ease women’s 
struggle to balance work and family responsibilities. Non-professional occupations usually 
pay less, confer less authority and less control over the pace and timing of work, have more 
rigid schedules, and are comprised of daily tasks that are more heavily monitored than 
professional occupations. All of these factors reduce a woman's ability to check up on her 
child from work, take time from work for family-related demands, and afford quality childcare. 

Findings suggest that women, like men, enter professional self-employment to advance 
their careers. Respondents who enter professional self-employment are older, more educated, 
have greater job skills, and have parents with more prestigious occupations. Women who are 
married and work in a male-dominated occupation have an even greater likelihood of entering 
professional self-employment. Taken together, these findings suggest that those women, like 
men, with greater fiscal and social capital choose professional self-employment to advance 
their careers. It is unlikely that these women become self-employed to balance work and family 
conflict, because children have no effect on professional self-employment entrance. 

Overall, undesirable jobs appear to push men and women into non-professional self- 
employment. The aspects of the job that are unappealing are gendered. Both men and women 
are more likely to become self-employed in non-professional occupations when their job lacks 
regular or daytime hours, life insurance and health insurance, among other characteristics. 
These findings are consistent with predictions of the less desirable jobs argument. What isn't 
explained by the gender-neutral “bad jobs” argument is the impact of family factors on women's 
self-employment participation. However, to the extent that many job conditions are incompatible 
with women’s family responsibilities, family unfriendly jobs are “bad” jobs for mothers. 

Are the self-employment alternatives to less desirable wage jobs better than the original 
job? The extent to which self-employment moves non-professional workers into professional 
occupations is limited: only 29 percent to 37 percent of respondents in professional self- 
employment came from non-professional wage occupations. Within the non-professional 
category, men are much more likely to be in higher-paying crafts and trades while women are 
more likely to be in low-paying service or clerical work, particularly childcare. This segregation 
is more pronounced among the self-employed than among the wage-employed. These 
occupations may enable mothers to work from home and provide an alternative to non- 
employment, but compared to men’s non-professional self-employment, women’s non- 
professional self-employment is less promising in terms of profitability, 

Self-employment appears to be an alternative to a less-than-ideal wage employment position, 
where conditions are family-unfriendly or worker-unfriendly. To the extent that women are pushed 
into self-employment due to work and family conflict or due to workplace discrimination, 
women's rising self-employment participation in the past 20 years may not be the positive symbol 
of women’s progress as it has been portrayed. Moreover, given the greater gender segregation 
in non-professional self-employment, to the extent that self-employment growth occurs in non- 
professional occupations, gender inequality may be exacerbated. This inequality isn't simply 
gendered, however; there also appears to be growing inequality by family status because it is 
mothers, in particular, who are likely to enter low-paying non-professional self-employment. 
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Notes 


The growth of women’s self-employment is not an artifact of women’s increased labor 
force participation, which grew only 23 percent during this period (Blau 1998). 


. This point raises a question: Are self-employed women whose husbands are also self- 
employed actually their husbands’ employees? While possible, self-employed wives could 
also be co-owners of a business with their husbands. To check whether respondents with 
self-employed spouses significantly differed from other self-employed workers, | ran 
models excluding these respondents. Results did not differ. 


. The correlation between marriage and self-employment may also arise from unmeasured 
positive selectivity into both conditions. 


Of course, after self-employment, women may encounter gender discrimination from 
creditors, contractors or consumers (Clark and James 1992; Loscocco and Robinson 
1991). 


Multiple specifications for children were tested — including presence of an infant, number 
of young (0-5) and older (5-18) children, and dummy variables for one, two and three or 
more children. Results showed a monotonic effect for number of children. 


The AFOT comprises four subtests, one each in arithmetic reasoning, mathematical 
knowledge, word knowledge and paragraph comprehension. 


About a third of self-employed men and women are concurrently working in a waged job. 


One reviewer noted that heterogeneity within non-professional and non-managerial 
occupations may affect the findings of this analysis. To test whether findings are robust 
across subcategories of occupations within the non-professional/non-managerial 
category, | divided this category into four subcategories and conducted a competing 
hazards analysis using the fully specified model as in Table 2. The following subcategories 
were used: (1) technicians, sales and administrative support occupations; (2) service 
occupations; (3) precision, craft and repair occupations; and (4) other occupations, 
comprised of farming, forestry and fishing occupations and operators, fabricators and 
laborers. The findings of the gendered effects of marriage, children and self-employed 
spouse on the likelihood of self-employment are robust across the subcategories. While 
there is some variation, these results demonstrate that the heterogeneity of non- 
professional/non-managerial occupations has little, if any, impact on the central findings of 
_ the paper. These supplemental analyses are available upon request. 
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Abstract 

In this paper we test the hypothesis that local economic concentration is associated with 
decreased levels of civic participation. We define economic concentration as a social context 
in which a small number of corporate establishments or industries dominate a local 
economy. We argue that economic concentration leads to a monolithic power structure and 
generates civic apathy because the needs of the corporation override those of the local 
population. To test this hypothesis, we employ combined data from the 2000 Social Capital 
Benchmark Survey and the County Business Patterns. Our findings indicate that local 
economic concentration is negatively associated with traditional electoral participation 
and protest activities. We conclude by discussing implications for current theoretical work 
on civic community, embedded within the empirical decline in U.S. civic engagement over 
the past three decades. 


Introduction 


Sociologists have increasingly turned to civil society perspectives to understand the well- 
being of U.S. communities within the context of a global economy (Lyson, Torres and Welsh 
2001; Tolbert, Irwin, Lyson and Nucci 2002; Tolbert, Lyson and Irwin 1998; Young and Lyson 
1993). These scholars argue that global patterns of production and consumption have 
weakened the role of the nation-state as a mediating factor between multinational 
corporations and the communities in which they operate. An important consequence of 
globalization is that civic participation becomes a central problem-solving mechanism for 
maintaining civic welfare. According to this perspective, communities endowed with high 
levels of civic participation are better positioned to respond to the needs of local residents. 
During the same period in which U.S. communities have attempted to adjust to the 
changes brought about by globalization, studies point to a somewhat ironic decline in civic 
participation among U.S. residents (Putnam 2000). Although this line of inquiry has been hotly 
contested (Monti 1999; Paxton 1999), statistics on civic participation from 1974-1994 point 
to an increase in the number of “civic dropouts” in U.S. communities (Putnam 2000). Whether 
measured as membership in voluntary associations, newspaper readership, voting or signing 
petitions, residents are less engaged in their local communities today than in the past. 
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Despite attempts to independently address both of these issues, researchers have 
overlooked a key concept for understanding the interrelationship between local economic 
structures and civic participation. More than 50 years ago, Floyd Hunter (1953), Irving Fowler 
(1954), and C. Wright Mills and Melville Ulmer (1946) made distinctive contributions to a line 
of sociological research on economic power and civic welfare. The influence of the local 
business structure on civic well-being once occupied a prominent position in the sociological 
literature. In the years immediately following World War Il, sociologists focused on 
understanding how the new mass production economy, characterized by large, corporately 
controlled manufacturing plants, affected social and economic conditions across U.S. 
communities. For these researchers the dynamic transition from small-scale to large-scale 
production in both industrial and agricultural sectors presented a natural experiment for 
understanding the effect of economic concentration on localities. 

The explanatory framework for understanding civic welfare focused on the power 
structure of the community as a means for understanding civic participation and community 
problem-solving activities. Within this framework, local business owners and corporate 
executives, referred to as economic dominants, played important roles in decision-making 
processes. The power structure of the community was conceptualized as a function of the 
concentration of economic activity. For example, monolithic power structures occurred when 
a handful of absentee-owned firms accounted for the vast share of employment within a 
given community. For these communities the interests of the corporation held priority over 
those of the broader citizenry. In turn, local residents became alienated from local decision 
making and problem-solving activities because decision making was guided by select 
leadership circles acting on behalf of the large employers. Diversified configurations of power, 
by contrast, emerged in communities with a wealth of small locally-owned firms. In this 
situation, the interests of the community were synonymous with the interests of the local 
owners because the place of business and place of residence for the owners overlapped. 
Local residents would then be more engaged because problem solving became a pluralistic 
affair comprised of a number of competing interests. 

However, this line of thinking and inquiry virtually disappeared following the early 1970s 
when scholars turned to “mass society” arguments that viewed local communities as 
irrelevant (Bell 1960, 1973; Kornhauser 1959). The abandonment of community power as an 
explanation for understanding civic well-being begs the question: Why should we care about 
a concept that has been largely ignored for the past three decades? 

In short, community power is not a dead concept. Rather, economic power and its 
relationship to civic participation are actually more important today than a half century ago 
because absentee ownership has permeated a wide variety of industries. Recently retail, 
service and wholesale industries have followed the lead of large manufacturers in maximizing 
economies of scale through mass distribution. In some cases the retail sector that was once 
dependent on manufacturers now exerts substantial control over the production of consumer 
goods. Unlike their predecessors, 21st century U.S. communities are populated not only by 
large manufacturing plants, but also by vertically integrated “big box" retail corporations that 
manage both the wholesale distribution and retail sale of products from high-tech electronics 
to groceries. Main Street shopping districts once populated by locally-owned merchants have 
been replaced by a handful of big box retailers. 

To our knowledge, no study to date has employed the concept of community power to 
explain levels of civic participation in the era of the global economy. If we are correct in our 
assertion that the tenets of classical economic power theory do indeed apply to present-day 
communities, we suspect that economically concentrated communities will exhibit lower 
levels of civic participation than their more diversified counterparts. In the analyses that follow, 
we test this hypothesis using data from the 2000 Social Capital Benchmark Survey National 
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Adult Sample and U.S. Bureau of the Census County Business Patterns. Our study informs 
current research in the civic community tradition that has overlooked the interplay between 
the local economy and the problem-solving capacity of communities. 


Locating the Business Structure in Community Life 


Sociologists have increasingly sought to understand structural covariates of spatial 
inequalities in addition to those generated by aggregations of human capital characteristics. 
Recently, the civic community approach has been employed to highlight the importance of 
social and economic structures in generating socioeconomic well-being for communities and 
individuals (Irwin and Tolbert 1999; Lee and Ousey 2001; Lyson, Torres and Welsh 2001; 
Tolbert, Lyson and Irwin 1998; Tolbert, Irwin, Lyson and Nucci 2002; Young and Lyson 1993). 
Communities rich in entrepreneurial spirit and locally-owned small enterprises fare better 
economically than those with a few large absentee-owned employers, because these 
establishments are embedded in the locality and are less likely to relocate or scale back in 
economic downturns. Unlike branch manufacturing plants and corporate retail and service 
establishments, locally-owned businesses are also less exposed to global economic 
fluctuations. Socially, the civic community perspective hypothesizes that civic welfare also 
results from pluralistic political structures with dense networks of associations that create a 
problem-solving capacity for communities. 

Despite growing support for the civic community thesis, little attention has been given to 
the interrelations between the economic and social dimensions of communities. To our 
knowledge, only one study in the past two decades has attempted to explore this issue. 
Young and Lyson (1993) propose a “structural interaction hypothesis,” arguing that civic 
welfare is a function of economic and social characteristics, such that W = f(T*S) where W 
is the level of civic welfare and T represents the characteristics of the business environment, 
such as locally-owned vs. branch plant manufacturing establishments. The community 
structure component (S) of this equation captures organizational capacity to address local 
problems. The business environment and organizational capacity are conceptualized as 
independent dimensions of a community. 

Applying this perspective to civic welfare outcomes in 445 non-metropolitan counties in 
eight southern states, Young and Lyson (1993) find that organizational problem-solving 
capacity buffers the effect of business structures on civic welfare. These authors conclude 
that regardless of the type of business structure (locally- or absentee-owned), organizational 
capacity is a central predictor of civic welfare. The underlying premise of this assertion is that 
broad contextual factors, such as the structure of the local economy, influence the well-being 
of local populations through a set of social institutions (organizations and electoral 
participation) that encompass the organizational problem-solving capacity of a community 
(Young 1999). 

Although Young and Lyson’s (1993) work makes an important statement about the 
potential interrelations between social and economic components of community, we believe 
that the structural interaction hypothesis underestimates the relationship between business 
structures and problem-solving capacity for two reasons. First, business proprietors, 
executives, managers and professionals are typically powerful social actors in their 
communities. These persons participate in voluntary associations, political organizations and 
the electoral system. Simply put, the power and influence of economic actors are not limited 
to the economic realm. These actors also participate in activities that contribute to the 
problem-solving capacity of communities, thus it is difficult to justify the disentanglement of 
economic and social organizations in the manner of Young and Lyson’s work. 
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A second concern with this approach is that business organizations themselves have the 
capacity to influence the problem-solving organizations of a community and, historically, have 
played an important role in U.S. social and political life (Tocqueville 1994). When a small 
number of firms account for a large share of total employment within the local labor force, 
these firms wield a great deal of economic power in local community decision making 
because of their substantial contribution to the economic well-being of the community. 
Alternatively, when employment is more evenly distributed across a large number of 
businesses, power is dispersed and businesses must compete for a voice in local affairs 
(Mills and Ulmer 1946). In this way, firms themselves may influence the configuration of 
power and decision-making by generating institutional isomorphism between the structure of 
the economy and the configuration of decision making. 


Revisiting the Concept of Community Power 


Despite the paucity of recent studies exploring interrelations of community social and 
economic institutions, classic works on community power provide a framework for 
understanding these interrelationships and, in part, address the critiques of the civic 
community perspective described above. Dating back to the years immediately preceding 
World War II until the early 1960s, researchers sought to understand what type of local 
economy generated the greatest level of civic welfare (Fowler 1948; Mills and Ulmer 1946). 
The primary focus of these studies was to compare levels of community well-being between 
towns dominated by one or two large absentee-owned manufacturers and towns populated 
by a large number of smaller locally-owned manufacturers. 

Mills and Ulmer (1946) provide one of the most comprehensive accounts of how 
economic structure influences the level of problem-solving capacity in a community. Their 
comparison of big business and small business towns points out three important effects of 
economic concentration on problem-solving capacity. First, communities with higher levels of 
concentration of economic activity have a weaker independent middle class (Mills and Ulmer 
1946). The independent middle class is comprised of small business owners within the 
community. Local business owners are an important part of the problem-solving capacity 
because improvements to the broader community generally benefit local business owners as 
well. Communities with highly concentrated economic activity possess a smaller independent 
middle class and a weakened problem-solving capacity. 

A second effect of the business structure on problem-solving capacity is through the labor 
market. In economically concentrated communities, executives and managers of absentee- 
owned establishments gain success and upward mobility by advancing in the corporate, 
rather than the community, hierarchy (Mills and Ulmer 1946). Successful corporate employees 
must focus on the interests of the corporation rather than the well-being of the overall 
community. Corporate mobility may also require that a manager or professional change place 
of residence to obtain promotion. In this situation, it is difficult for middle class workers to 
develop social ties and participate in civic affairs. These assertions also apply in the 
contemporary labor market, where managers or professionals often move between 
corporations to gain upward mobility. 

A third concern expressed by Mills and Ulmer (1946) is the distribution of prestige and 
power within the problem-solving activities in the community. Because business owners and 
managers of absentee-owned establishments hold important positions in political and social 
affairs, the concentration of economic activity impacts that concentration of power in decision 
making. When economic activity is dispersed, a large number of business leaders compete 
for social prestige and input into local decision making. In economically concentrated 
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environments, social prestige is obtained through the development of ties with corporate 
managers and professionals. Economic concentration also results in a great deal of power for 
the handful of corporations that operate in a community. As evidenced by many segments 
of manufacturing in recent years, large corporations have the capacity to relocate 
domestically or internationally. Corporations which disagree with a given. policy or reform 
effort may threaten to relocate or withhold support, financial or otherwise. 

A central outcome of economic concentration in the community power perspective is the 
deterioration of civic spirit or what is known today as civic engagement. Though not explicitly 
tested in their analyses, Mills and Ulmer (1946) observed that big business communities had 
less civic participation and that big business “tends to dry up civic spirit.” (141) Studies testing 
these assertions find that there is a dampening effect of economic concentration on civic 
participation. In some cases executives and other corporate actors influenced community 
decision making for corporate well-being but did not otherwise participate actively in local 
organizations (French 1970; Mott 1970). In other cases, corporate executives actively 
participated in community affairs in order to promote the interests of the corporation in policy 
decisions. For example, Coates and Pellegrin (1956) noted in their study of “Bigtown” that 
corporate executives are indeed engaged in civic activity, but only to the extent that 
participation furthers the needs of the corporation. These executives gained control of 
leadership in community decision-making organizations and worked to limit participation in 
these groups to those sympathetic with the corporation's goals. 


Economic Concentration and Civic Behavior 


Although researchers in the community power tradition address the role of business structure 
in civic affairs, these studies have largely overlooked the manner in which economic 
concentration alters patterns of civic participation among community members. Much like 
contemporary civic community research, the underlying dynamics at the individual level that 
result in high levels of civic welfare have yet to be explored. Drawing on the civic participation 
and social capital research literatures, we link the structural influence of economic 
concentration to civic behavior by hypothesizing that economically concentrated 
environments alienate community members, resulting in an apathetic, disengaged local 
population. 

A primary link between economic concentration and community-wide civic participation 
are the notions of efficacy and trust. In his comparison of northern and southern Italy, Putnam 
(1993) finds a strong linkage between civic participation and trust in local government. Not 
surprisingly, the business structure of communities in northern Italy, where efficacy and trust 
are the highest, is characterized as an industrial district economy populated by tightly-knit 
small firms. Norms of reciprocity and trust are established through the sharing of resources 
among businesses (Putnam 1993). 

An important aspect of the industrial districts is that businesses operate in a pluralistic 
fashion by competing and cooperating to maximize profits. Prior studies find that competitive 
environments engender civic participation (Oliver 1999; Verba, Schlozman, Brady and Nie 
1995). In competitive small business environments, there are multiple agendas and 
preferences driving local problem solving. A central feature of pluralism in a small business 
environment is that even though multiple solutions may be posed for a given problem, the 
solutions are locally directed in nature. For example, competing groups in a community may 
propose a variety of methods for revitalizing a downtown shopping district. Regardless of 
which method is chosen, the community and local businesses benefit from these efforts. In 
this case, economic heterogeneity will lead to greater levels of civic participation because a 
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number of local interests seek a common goal, the improvement of the community. In 
contrast, economically homogenous communities lack broad competition for public 
resources resulting in a less active political environment. For these communities, local 
residents may become alienated because corporate goals are prioritized over the solution of 
local problems and general local well-being. 

A second effect of economic concentration on civic behavior relates to the concept of 
"cross-cutting networks.” (Horan 1971; Mutz 2002a) Cross-cutting networks refers to the 
degree that individuals are embedded in environments containing multiple political viewpoints. 
Mutz (2002b) finds that individuals become ambivalent when faced with multiple viewpoints and 
disengage from political participation because they fear sanctions from members of competing 
factions. Thus, when individuals are torn between multiple interests, they are less likely to 
participate. This concept is important for understanding civic participation in economically 
concentrated environments because residents are torn between the dominant corporate 
agenda and efforts that benefit the greater good. Fearing social consequences for supporting a 
non-corporate agenda, individuals disengage from local affairs. 

In the analyses that follow, we test these propositions regarding the influence of economic 
concentration on civic participation. Rather than conceptualizing civic participation on a single 
dimension, such as voting in an election or number of associational memberships, we employ 
two composite measures of civic participation that capture dimensions of traditional political 
participation and participation in protest activities aimed at social change. We test the 
hypothesis that economic concentration is negatively associated with both dimensions of 
civic participation. That is, communities in which economic concentration is greater should 
have lower levels of participation in traditional politics and protest activities. 


Data and Methods 


To test the effect of the concentration of economic power on civic participation, we analyze 
data from the 2000 Social Capital Benchmark Survey National Adult Sample (N = 3,008), a 
probability sample of U.S. households in the continental United States, collected by the 
Saguaro Seminar at the John F Kennedy School of Government at Harvard University 
(Saguaro Seminar 2000). Households were selected using random digit dialing techniques. 
The adult (age 18 and older) in the household with the most recent birthday was selected as 
the respondent. Multilingual interviewers were also employed to accommodate respondents 
not speaking English as a primary language, and up to 10 call-back attempts were made 
following the initial dialing. For the purposes of our study, we exclude persons who were not 
U.S. citizens (N = 218). 

County identifiers were provided for all respondents in the survey. Using these identifiers 
we attached information on economic concentration to the individual-level data. We chose 
counties to contextualize the effect of economic concentration on individual civic behavior 
because large establishments, such as retail supercenters and large manufacturing plants, 
may locate outside of city boundaries. Because these establishments employ persons from 
one or more communities within the county, it is likely that the economic structure of the 
county may influence social conditions in multiple locales. Additionally, 73.3 percent of all U.S. 
workers age 16 and older work within their county of residence (U.S. Bureau of the Census 
2002). Thus, for the vast majority of the U.S. population, the confluence of social and 
economic relations occurs within a single county. 

Respondents included in our analysis come from 1,094 of the 3,101 U.S. counties. These 
counties contain 82.1 percent of the total U.S. population. The omission of individuals from 
many U.S. counties occurs due to the uneven spatial distribution of the U.S. population. 
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Though our analysis is representative of the contextual conditions experienced by the vast 
majority of the U.S. population, we also examined differences in demographic and 
socioeconomic conditions between the counties represented in our analysis and the 
excluded counties. Our comparison indicates that the counties included in our analysis 
contain on average a larger number of residents, are more likely to be part of a metropolitan 
area, and exhibit higher median family incomes than the excluded counties. Despite these 
differences, the respondents in our analysis are drawn from a diverse set of counties ranging 
from isolated rural counties with fewer than 2,500 residents to the largest county in the 
United States. 


Economic Concentration 


Our measure of economic concentration draws on classical conceptualizations posited by 
Mills and Ulmer (1946). In their work, the concentration of economic activity consisted of 
three dimensions: (1) the concentration of employment into a small number of businesses; 
(2) the share of employment accounted for by non-local business owners; and (3) the 
industrial concentration of business activity. In our analysis we construct a county-level 
economic concentration index based on three similar indicators derived from the 1999 County 
Business Patterns (U.S. Bureau of the Census 2001a). 

We operationalize the employment concentration dimension as the percentage of total 
employment accounted for by the four largest business establishments. We use 
establishments rather than firms to calculate concentration ratios because county-level 
business statistics on firms are not available. Four firm concentration ratios are a widely used 
measure of economic concentration in government statistics and economic research (Pryor 
2001; U.S. Bureau of the Census 2001b). The four-establishment Concentration ratio indicates 
the degree to which employment in a county is competitive or oligopolistic. When the four 
largest establishments account for a large share of total employment, a handful of 
establishments control a large share of the labor market and business-related decision 
making within the community. 

The non-local ownership dimension is measured as the percentage of the labor force 
employed in large retail, large manufacturing, corporate headquarters, regional management 
offices, subsidiaries and auxiliary establishments. Because data on business ownership 
(locally-owned vs. absentee-owned) are not publicly available, we use employment size as a 
proxy for non-local ownership. Recent statistics on employment size and ownership confirm 
this assumption (U.S. Bureau of the Census 2001c, 2001d). In manufacturing, the average 
number of employees in single-unit, non-corporate, U.S. manufacturing firm is 8.0 workers. 
For multi-unit corporate manufacturers, the average number of employees per establishment 
is 171.6. Statistics for retailers suggest a similar trend. For grocery stores, single-unit grocers 
average 8.7 employees. The average number of employees for multi-unit grocery stores with 
100 or more establishments is 77.85 employees. 

We define non-locally oriented retailers as retail establishments (NAICS 44) with 100 or 
more employees. We chose 100 or more employees as our classification based on the 
average size of a Wal-Mart store (137 employees). Stores of this size are more likely to be 
absentee-owned by national or global corporations, such as Wal-Mart, K-Mart and Target. 
Non-locally oriented manufacturers are measured as the number of manufacturers (NAICS 32) 
with 250 or more employees. We chose 250 or more employees as the definition of non-local 
manufacturing because large manufacturing establishments, unlike their smaller 
counterparts, are more likely to be branch plants of a larger corporation. In our measure of 
non-local business activity, we also include the number of corporate headquarters, regional 


2248 © Social Forces Volume 84, Number 4 * June 2006 


management offices and subsidiaries (NAICS 55), or the auxiliaries of multi-establishment 
enterprises (NAICS 95). Auxiliaries provide services to other establishments in the enterprise 
such as data processing, warehousing, legal and accounting services. 

The industrial concentration dimension is measured with the Herfindahl-Hirschman index 
of industrial concentration (Rae 1968; Rae and Singer 1973). Industrial concentration is 


calculated as: 
ICED pr 
i=l 


where p; is the proportion of the total labor force employed in an industry. Large values of the 
industrial concentration index indicate a high degree of concentration of employment into a 
specific industry. 

Using principal components analysis, we examined the structure of the three indicators of 
economic concentration. Our findings indicate that the variables load ona single factor. The 
factor accounted for 62.2 percent of the total variance and has an eigen value of 1.87. The 
index of industrial concentration had the highest factor loading (.930), followed by the four- 
establishment concentration ratio (.794), and the percent of the labor force employed in large 
establishments and corporate offices (.609). Large positive factor score values indicate a high 
concentration of economic activity and negative values indicate a competitive, diversified and 
locally-oriented economic context. 


Civic Participation 


The dependent variables in our analysis capture two dimensions of civic participation and 
problem-solving behavior. The components of these measures are reported in the Appendix. 
Our first outcome, the electoral politics index, measures the respondent's electoral participation 
through voter registration, voting in the 1996 presidential election, interest and knowledge of 
politics and current events, and local newspaper readership. The electoral politics index ranges 
from 0-5 where a value of 0 indicates no electoral participation and 5 represents maximum 
participation. The second dependent variable, protest politics, identifies respondents’ 
participation in local reform activities, boycotts, protests and petitions, and membership in labor 
unions, civil rights organizations and public interest groups. Our initial analysis of the protest 
politics measure indicated a highly skewed distribution such that the vast majority of 
respondents either participated in no protest activities or only a single protest activity. Due to the 
observed distribution, we measure protest political activity as a binary variable, where 0 
indicates no protest activities and 1 represents participation in one or more protest activities. We 
hypothesize that residents of economically concentrated counties will be less likely to participate 
in either electoral or protest political activity. Our initial descriptive analysis of the relationship 
between economic concentration and civic participation suggests that economic concentration 
exhibits a significant negative correlation with the electoral politics index (-.105) and protest 
participation (-.120). We also obtained partial correlations accounting for population size of the 
county. After accounting for the effect of population size, economic concentration is a significant 
negative correlate of electoral politics (-.108) and protest activity (-.100). 


Control Variables 


In our models we control for respondents’ spatial, demographic and socioeconomic 
characteristics. The spatial controls in our study include population size of the county, 
categorical measures of the metropolitan status of the county in which the person resides 
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and the region of residence. To correct for skewness in the distribution of population size, we 
include the natural log of population size in our multivariate analyses. Region is divided into 
four categories based on the U.S. Bureau of the Census definitions: South, North, West and 
Midwest. Metropolitan status is classified into three categories: metropolitan county, non- 
metropolitan county adjacent to a metropolitan area, and non-metropolitan county not 
adjacent to a metropolitan area. Our demographic controls include categorical variables for 
sex (1 = female, 0 = male), race/ethnicity (white, black and other), and marital status (married, 
divorced/separated, widowed and never married). We also control for the respondent's age in 
years. Socioeconomic controls variables include categorical measures of education (less than 
high school, high school diploma and post-high school education), and employment status 
(employed, unemployed, retired and other labor force status). In our initial multivariate 
analyses we also included household income as a socioeconomic covariate and identified 
extreme multicollinearity between income and education. We also control for residential 
characteristics including a variable measuring length of residence (in years) and a binary 
variable for homeownership (1 = homeowner, 0 = non-homeowner). 


Analytical Strategy 


To test our hypotheses regarding the effect of the concentration of economic power on 
individual level civic participation and engagement, we estimate contextual OLS and logistic 
regression models for our dependent variables. Our contextual models assess the effect of 
economic concentration on our dependent variables after controlling for our county-level 
spatial controls and individual-level demographic and socioeconomic measures. Descriptive 
statistics for all variables in our models are reported in Table 1. 


Results 


Electoral Politics 


In Model 1 of Table 2 we report OLS regression coefficients predicting the electoral politics scale. 
The electoral politics scale ranges from 0 to 5 and captures the degree to which individuals 
participate in the voting process, follow current events through newspaper readership, possess 
knowledge of politics, maintain an interest in current events and know the names of their 
senators. Findings from this model test our hypothesis that economic concentration is negatively 
associated with participation in electoral politics. We expect that residents of counties with a high 
concentration of economic concentration will score lower on the electoral politics scale. 

Our findings indicate that economic concentration is indeed negatively associated with 
participation in electoral politics. A one-unit increase in economic concentration is associated 
with a -.138 decrease in the electoral politics index. With respect to the spatial controls in the 
model, our regression results point to no significant differences by population size, regions or 
metropolitan status. This finding indicates that economic concentration has a significant 
negative influence on electoral politics after accounting for spatial differences among 
counties in our analysis. 

With respect to the demographic controls in the model, a number of significant findings 
emerge. The coefficient for age is positive and significant such that a one-year increase in age 
is associated with a .031 increase in electoral participation. Compared to males, females 
exhibit lower levels of electoral participation. Hispanics also demonstrate lower levels of 
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Table 1: Means and Standard Deviations for Variables in Model 





Mean s.d. 
Economic Concentration Index 
Four Establishment Concentration Ratio of 
Employment 9.53 8.66 
Four Establishment Concentration Ratio of 
Employment (log) 1.85 .96 
Herfindahl Index , 11.58 4.13 
Herfindahl Index (log) 2.40 29 
Percent Employed in Large 
Establishments and Corporate Offices 12.46 12 
Population Size 
Population 745,542 1,494,290 
Population (log) 12.35 1.64 
Region 
North 19 
Midwest 23 
West 19 
South 38 
Metropolitan Status 
Nonmetropolitan Area, Adjacent to Metro oD 
Nonmetropolitan Area, Not Adjacent to Metro 07 
Metropolitan Area 19 
Demographics 
Age (years) 45.27 16.82 
Female 61 
Hispanic ale 
Race 
Black 18 
Other .09 
White so 
Marital Status 
Separated/Divorced oh 
Widowed .09 
Never Married 22 
Married wo 
Length of Residence (Years) 15.06 13.79 
Own Home 12 
Socioeconomic Status 
Education 
Less than High School .08 
High School Diploma 29 
Greater than High School 62 
Labor Force Status 
Unemployed 05 
Retired 16 
Other Labor Force Status 10 
Employed .66 
Dependent Variables 
Electoral Politics Index (0-5) 3.01 1.32 


Protest Politics Participation (binary) 56 
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participation than non-Hispanic persons. Findings regarding marital status indicate that 
married persons participate in electoral politics more than separated, divorced, widowed or 
never married persons. Home ownership is also an important predictor of electoral 
participation. Persons owning their homes have a significantly higher rate of electoral 
participation than non-homeowners. The length of residence in the community is also 
significantly associated with greater levels of electoral participation. 

Findings for our measures of Socioeconomic status indicate that education has a strong 
and significant effect on electoral participation. Relative to high school graduates, persons not 
completing high school score .618 lower on the electoral participation index and those 
attaining a post high school education score .664 higher on the index. These findings 
demonstrate that educational attainment is associated with higher levels of electoral 
participation. Our findings also point to differences across categories of labor force status. 
Relative to the employed, unemployed respondents exhibit lower levels of electoral 
participation than employed persons. 


Protest Politics 


We present our findings regarding participation in protest politics in Model 2 of Table 2. The 
protest politics measure is a binary variable where “1” indicates participation in protest 
activities, such as petition signing, demonstrations, boycotts, marches, protests, local reform 
efforts or membership in labor unions, ethnic and civil rights organizations, public interest 
groups, political action groups, political clubs and party committees. We hypothesize that 
economic concentration is negatively associated with participation in protest politics, thus 
persons in counties with a high degree of economic concentration will be less likely to 
engage in protest activities. 

Similar to our finding for electoral politics, economic concentration is negatively associated 
with protest politics. Thus, residents of communities with a high degree of economic 
concentration are less likely to engage in protest activities than those in economically 
dispersed settings. Coefficients for our spatial controls indicate that residents of the North 
and the West score higher on the protest politics scale than those from the South. Our 
findings also indicate that there are no significant differences between metropolitan and non- 
metropolitan counties in terms of the level of involvement in protest politics. 

The demographic variables in our analysis indicate that age, sex, race and martial status have 
significant effects on protest activity. Following our model for electoral politics, age is a significant 
predictor of protest politics involvement and females exhibit lower levels of protest politics. 
Relative to whites, persons in the “other” race category (Asian or Pacific Islander, Alaskan native, 
Native American or other race) are 1.45 times more likely to engage in protest activities. In terms 
of marital status, only those who are widowed differ significantly from married persons. 

Coefficients for our indicators of socioeconomic status indicate that education and labor 
force status exert important effects on protest politics. Persons with post-high school education 
participate in protest politics at higher levels than those with a high school diploma only. In terms 
of labor force status, retired persons and those in the other labor force category, such as 
students and homemakers, exhibit lower participation in protest politics than the employed. 


Conclusion 


Decades ago researchers turned to the concept of economic power to understand the role of 
business structures in generating civic participation. In the years immediately following World 
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War Il, the rise of mass production and the opening of branch manufacturing plants in U.S. 
cities presented an important challenge to sociologists. Do economically concentrated 
environments have higher levels of socioeconomic well-being than economically dispersed, 
small business economies? Today, cities across the United States face a very similar, yet 
compounded version of this same problem. Unlike the economy of the 1940s, 1950s and 
1960s, when the vast majority of retail and service activities were conducted by locally-owned 
establishments, retail and service industries are now dominated by absentee-owned 
multinational corporations. Today, extra-local interests increasingly dominate both the 
production and consumption sectors of local economies. 

The consequence of this economic shift is a further disjuncture of the interests of the 
business community and that of the broader population. Mills and Ulmer (1946) point to the 
independent middle class as an important component of civic welfare. Because the interests 
of these business owners generally matched that of the community, a large independent 
middle class ensured that local resources served local well-being. As big box retailers such as 
Wal-Mart, Target, Best Buy and others replace small retail and service establishments, the 
independent middle class of local owners is eroded. Our findings suggest that the absence of 
an independent middle class signals an equally absent civic spirit among local residents. 

In this paper, we address the relationship between local economic concentration and the 
civic spirit that Mills and Ulmer (1946) argue is at the root of civic welfare. Our results 
demonstrate that local economic concentration is associated with lower levels of civic 
participation. We find that residents of communities with highly concentrated economies 
tend to vote less and are less likely to keep up with local affairs, participate in associations, 
engage in reform efforts or participate in protest activities at the same levels as their 
counterparts in economically dispersed environments. We argue that the mechanism 
generating apathy in civic participation is linked to declines in social trust and the notion of 
cross-cutting social networks. Within the context of our study, social trust is enhanced by the 
presence of independent middle class business owners competing for status within the 
community. When the environment consists of a large number of competing political 
interests that are broadly focused upon the improvement of civic welfare, residents are more 
likely to participate because they trust local political processes. At the same time, 
economically concentrated environments lacking a strong independent middle class tend to 
generate civic apathy. Residents are torn between corporate and community interests and 
thus choose to disengage from civic affairs. Although not directly tested in our analyses, our 
findings support the validity of these assumptions. 

The results of this study also have important implications for theories of civic community and 
civic participation. First, our analyses demonstrate that economic environments and local 
problem-solving capacity are part and parcel of a broader civic mechanism that generates civic 
welfare. Prior research on the structural interaction hypothesis (Young and Lyson 1993) suggests 
that problem-solving capacity acts as a buffer between the economic environment and local 
well-being. Our findings support the hypothesized interrelation between the economic 
environment and levels of civic participation. We propose that healthy civic environments 
emerge as the interests of business owners and the general population intersect. When 
business owners and residents are on the same page, civic participation flourishes. 

For students of civic participation, especially those adhering to the “bowling alone” 
hypothesis posited by Putnam (2000), our findings make an important contribution. Although 
Putnam (1993) makes reference to the importance of industrial districts in generating trust 
and norms of reciprocity, the role of the business structure is conspicuously absent from the 
list of potential reasons why civic participation has declined in United States in the past 30 
years. To be sure, increases in dual-earner households, suburban sprawl, hours watching 
television and commuting times provide plausible explanations for civic decline (Putnam 
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2000). However, the decline of small retail and service establishments and economic 
concentration provides an additional analytic tool to understanding this issue. Although our 
findings are cross-sectional, longitudinal analyses of changes in civic participation should 
also consider temporal changes in economic concentration, especially among retail 
establishments. The decline of Main Street shopping districts and the emergence of civically 
nonconductive shopping malls and big box retailers provides a potential explanation for why 
Americans are withdrawing from civic life, isolating themselves and their families through 
consumptive practices. 
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Appendix 


Items in Electoral Politics Index 

1. How many days in the past week did you read a daily newspaper (number of 
days)? 

2. How interested are you in politics and national affairs (very interested, 
somewhat interested, only slightly interested or not at all interested)? 

3. Are you currently registered to vote (yes or no)? 

4. Did you vote in the presidential election in 1996 when Bill Clinton ran against 
Bob Dole and Ross Perot, or did you skip that one (yes or no)? 

5. Could you tell me the names of the two U.S. senators in your state? 


Items in Protest Politics Index 

1. Have you signed a petition (yes or no)? 

2. Have you attended a political meeting or rally (yes or no)? 

3. Have you participated in any demonstrations, protests, boycotts or marches 
(yes or no)? 

4. Did you or any of the groups that you are involved with take any local action 
for social or political reform in the past 12 months (yes or no)? 

5. Are you a member of a labor union (yes or no)? 

6. Are you a member of an ethnic, nationality, or civil rights organizations, such 
as the National Organization for Women, the Mexican American Legal 
Defense or the NAACP (yes or no)? 

7. Are you a member of other public interest groups, political action groups, 
political clubs or party committees (yes or no)? 
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Where’s the Faith in Faith-based Organizations? 
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Abstract ' 

Organizational religiosity is analyzed with data from a national survey of faith-based 
social service coalitions (N = 656). Twenty-one items related to religious practices within 
these organizations result in three distinct factors: service religiosity, staff religiosity and 
organizational religiosity scales. Self-defined faith-based coalitions vary widely on all 
three. OLS analysis regressing 12 coalition attributes on the three scales demonstrates that 
the religiosity measures often relate to the predictor variables in different ways, although in 
two cases there is consistency. Government funding is inversely related to all three 
religiosity measures, and evangelism as a coalition goal is positively related to all three. 


Religiosity is a term that historically has been used to describe and measure variations in 
individuals’ religious commitments along more than a single dimension. The most extensive 
elaboration of the meaning and dimensions of individual religiosity occurred in the work of 
Glock and Stark (1965), who developed what became known as the “5-D" approach to 
religious commitment, including: ritual activities, ideology or belief, experience, knowledge of 
religious matters, and the consequential dimension. According to their conceptualization and 
survey data, these five dimensions can be related, but they also can vary independently. Since 
that time, the question of how to measure individual religiosity has generated a continuous 
flurry of debates and studies on the part of social science scholars of religion. In addition, 
largely due to national surveys conducted by the National Opinion Research Center and the 
Gallup Organization that include questions regarding religious commitment, a rich empirical 
tradition has evolved around issues of individual religiosity. 

Beyond the individual level, what makes an organization “religious” or “faith-based” is not 
well specified in the literature, especially in terms of empirical indicators. The term faith-based 
organization typically suggests a religious congregation with primary missions of worship and 
religious education (Chaves 2004). By definition, congregations are faith-based regardless of 
how they may differ in theology, structure, size, location or types of ministries provided to 
congregants. Since the inclusion of the Charitable Choice provision in the Personal 
Responsibility and Work Opportunity Act (known as Welfare Reform) in 1996 and the 
subsequent establishment of the Office of Faith-Based and Community Initiatives by 
President Bush in 2001, discussion of faith-based organizations has begun to expand beyond 
congregations and include a wide array of entities, which may or may not be linked to 
congregations. This paper focuses on one form of faith-based organization that has grown 
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significantly in recent years — those that provide social services and operate independently 
of any given congregation, although in cooperation with at least some congregations. Virtually 
all have 501(c) 3 tax status. We call these organizations “faith-based social service coalitions,” 
and our national sample of them includes 656 cases. 

Religious nonprofit organizations in the United States are central to the social welfare 
system because of the money they collect for charitable activities and the services they 
provide. In 1995, for example, religious organizations in the United States received more than 
$60 billion, an amount that represented 44 percent of all charitable giving (Kaplan 1996). In 
terms of total revenues, religious organizations constitute the third largest sector of U.S. 
nonprofit organizations, behind health and education. More than 50 percent of American 
adults contribute to their church, synagogue, mosque or temple, representing 60 to 65 
percent of total household giving (Independent Sector 1993). While the majority of religious 
contributions go to maintaining the religious activities of congregations, large sums of money 
as well as volunteer time and in-kind donations are also allocated to the provision of social 
services. In 1991, approximately 40 percent of the $53.3 billion spent by congregations on 
activities were allocated for nonreligious education, health care and social services 
(Independent Sector 1993). McCarthy and Castelli (1998) estimate that religious 
congregations, national religious networks and free-standing religious organizations spend 
between $15 and $20 billion of privately contributed funds a year on social services. 

Despite the magnitude of the religious nonprofit sector, until the mid-1990s research on 
nonprofit organizations largely ignored it. Indicative of this neglect is the fact that, of the 2, 
195 works listed in Layton’s Philantropy and Voluntarism: An Annotated Bibliography (1987), 
only 2.1 percent of citations refer to religious organizations. In the past 10 years, however, 
religious nonprofits have begun to garner the attention of scholars and policy makers 
interested in the nonprofit sector, fueled substantially by the Charitable Choice legislation. 
(See Nonprofit and Voluntary Sector Quarterly, summer 1994; Cnaan and Boddie 2001; 
Wineburg 2001.) 

The recent surge of scholarly interest in the role of faith-based organizations in the 
provision of social services (e.g., AMmerman 2005; Bartkowski and Regis 2003; Chaves 1999; 
Cnaan and Boddie 2001; Farnsley 2003; Kennedy 2003; Monsma 2004) has provided rich data 
on issues such as quantity, types and outcomes of programs, collaborative arrangements 
among agencies and funding streams. While most of these studies allude in general terms to 
the issue of what makes such programs faith-based, there exist no clearly defined empirical 
measures for determining this. In an earlier study comparing secular and faith-based agencies 
that serve the homeless in Houston (Ebaugh et al. 2003), we documented clear differences 
between the two types of agencies in terms of specific expressions of religiosity. We also 
found that no one simple measure, such as self-definition as faith-based, organizational name 
or even mission statement suffices to clearly separate faith-based from secular agencies. 

Given the fact that most previous research has focused on congregations and 
denominations, which are, by definition, religious organizations, the issue of what constitutes 
a faith-based organization was irrelevant. It is the recent focus on faith-based social service 
agencies, which are often independent of both congregations and denominations, that makes 
the definitional issue pertinent. 


Conceptualizing the Faith Factor in Faith-Based Organizations 
Despite the lack of empirical measures of religiosity in faith-based organizations (FBOs), there 


are a number of conceptualizations of what constitutes the faith factor in religious 
organizations. One of the most widely quoted is Jeavons’ (1998) description of seven key 
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areas in which faith manifests itself within organizations: self-identity; religious convictions of 
participants; the extent to which religion helps or hinders the acquisition of resources; the 
extent to which religion shapes goals, products and services; the impact of religion on 
decision making; religious authority and power of leadership; and the extent to which religion 
determines inter-organizational relationships. Rather than arguing for a dichotomous 
Classification of faith-based vs. secular agencies, Jeavons posits that these characteristics are 
variables representing the degree of organizational religiosity, ranging from explicitly religious 
to completely secular. While Jeavons' scheme is frequently cited by those who study faith- 
based organizations, the measures we present in this paper are among the first to 
operationalize several of his dimensions. 

Smith and Sosin (2001) distinguish between faith-related agencies and faith-based agencies 
by emphasizing that the former term is more encompassing and includes organizations that 
have some link to religion at the institutional level, not simply at the level of personal belief 
systems. They argue that the degree to which an agency is linked to faith may be 
conceptualized as the extent of the “coupling” between the agency and resources, authorities 
and cultures that represent relevant faiths. An agency that is tightly coupled to faith is more 
closely connected to denominations or congregations than one that is loosely coupled. 

The Working Group Report on Human Needs and Faith-Based and Community Initiatives, 
Finding Common Ground (2002), specifies structural indicators that can be used to place 
religious social service organizations on a continuum, ranging from “faith-saturated” to “secular,” 
with “faith-centered,” “faith-related,” “faith background” and “faith-secular partnership” as values 
between the two extremes. Characteristics of religiosity that are used to locate organizations on 
the continuum are: mission statement; founding for a religious purpose; religiousness of board 
members, senior management and staff; affiliation with external religious agencies; financial 
support from religious sources; religious content of program; positive connections between 
religious content and program outcomes; and religious environment (name, building, religious 
symbols). 

Monsma (2004) uses a list of religiously-rooted practices to differentiate between faith- 
based/integrated and faith-based/segmented welfare-to-work programs. The first type 
incorporates religious elements into welfare-related services, such as using religious values 
or motivations to encourage clients to change behaviors or hiring only staff with a particular 
religious orientation. In the second type, religious elements or activities are largely separate 
from services provided by the organization, such as placing religious symbols or pictures in 
the facility where programs are offered. 

Sider and Unruh (2004) insist that it is programs rather than organizations that are faith- 
based because different programs within a single organization can vary widely in their 
religious content. They suggest that religious dimensions of social service programs are of 
two types: environmental (the creation of a religious environment apart from client 
interactions) and active (religious elements that involve direct communication of a religious 
message to clients). Chaves (1993) has identified two structures within religious 
organizations, each of which claims competing sources of authority: a religious authority 
structure, which enforces its claims by appealing to the supernatural, and an agency authority 
which emanates from bureaucracy and rationality. The more faith-based an organization, the 
greater its reliance on religious authority for legitimacy. 

Despite differences in the labeling of organizations and the dimensions on which to 
assess the faith factor, all of the conceptualizations rest on the assumption that organizational 
religiosity exists on a continuum in which some organizations are more religious than others. 
What the literature lacks are operationalized indicators of that continuum and their application 
to actual organizations. Rather than beginning with assumptions about what constitutes 
organizational religiosity, we drew upon the existing models of the factors that presumably 
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define it to develop a wide ranging list of questions about mission and goals, policies, 
practices and programs. We fielded a national survey of faith-based social service coalitions 
to which we submitted all of the relevant items. We then conducted factor analyses in order 
to discover patterns among the items, resulting in three discreet factors, each representing 
one dimension of organizational religiosity. After discussing our methods and sampling, we 
describe the composition of the three organizational religiosity scales. We then examine their 
relationship with a number of predictor variables based on hypotheses rooted in our fieldwork 
and previous studies of nonprofit social service organizations. 


Sample and Methods 


Selecting a term to describe the type of faith-based social service organization that we had 
identified in previous studies was a challenge. While some researchers and practitioners use 
the term “community ministry” to indicate collaborations among congregations to offer social 
services, the term is also frequently used to mean social outreach programs of specific 
congregations (Ammerman 2005; Chaves 1999) as well as a myriad of faith-based programs 
targeted at community change. The term “coalition” also has its drawbacks because it is used 
to describe all kinds of alliances/collaborations. Nonetheless, we opted for the term faith- 
based social service coalition but use it in a very precise sense to include only organizations 
that meet four criteria: (1) the organization defines itself as faith-based; (2) it delivers at least 
one social service (from an extensive list of service types); (3) religious congregations are in 
some manner affiliated with the organization; and (4) it has its own board of directors. Follow- 
up research demonstrated that virtually all also enjoy 501 (c) 3 tax status. We developed a 
questionnaire to elicit a broad range of information about faith-based social service coalitions, 
including how they are structured, the range of services and programs they offer, funding 
sources, religious expression, client, volunteer, board and employee characteristics, and the 
religious and racial/ethnic characteristics of affliliated congregations. 

The Interfaith Community Ministries Network, an organization that identifies community 
ministries engaged in social service delivery, has developed a list of about 1,300 faith-based 
social service organizations (Pipes 2001). We augmented this list using the worldwide web 
and the Yearbook of American and Canadian Churches (Lindner 2002). The result was a 
mailing list consisting of 1,186 organizations. At the end of the questionnaire, we asked that 
respondents identify other organizations like their own. We then sent questionnaires to this 
snowball sample of 297 organizations. Of the 1,483 questionnaires sent out during the 
summer and fall of 2002, 612 were returned for a response rate of 41 percent. Preliminary 
analysis of the data revealed that the sample was regionally biased to over-represent the 
South (about 50 percent of the returned questionnaires). We therefore purchased a national 
list of “social service and welfare organizations” from InfoUSA, culled by them to focus on 
22 states, primarily in the West and Northeast. After culling their list to identify those whose 
names suggested that they were most likely to fit our definition of a faith-based social service 
Coalition, the final wave of questionnaires was mailed to 555 organizations in January 2003. 
The return rate was 39 percent (N = 217). Combining all waves, 2,038 questionnaires were 
mailed, of which 829 were returned. This 41 percent response rate is considered by 
methodologists as robust for mailed surveys sent to organizations and filled out by top 
executives (Moncrief, Reisinger and Baldauf 1999). 

A number of the completed questionnaires came from organizations that do not fit our 
definition of a faith-based social service coalition; 173 (21 percent) were dropped because 
they failed to meet one or more of our four criteria. Our final sample of faith-based social 
service coalitions numbered 656. Given the absence of a complete list of coalitions, the 
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population we are studying is unknown; hence, it is impossible to draw a random sample of 
them. Our study, however, is the first to examine the apparently wide-spread and growing 
phenomenon of faith-based social service coalitions on a national level. 


Measuring Coalition Religiosity 


Prior to the survey, we had conducted two field studies of faith-based social service coalitions, 
projects that provided us qualitative data that helped in the construction of survey questions 
(Pipes 2001; Pipes and Ebaugh 2002). In addition to these two qualitative studies, we 
conducted a pilot survey in Houston of 89 agencies that provide services to the homeless 
(Ebaugh et al. 2003). Our sample included 53 self-identified “secular” and 32 “religious” 
agencies. Questions were developed to compare the two types of agencies in terms of: (1) 
basis for decision making, (2) resource preference, (3) organizational culture, and (4) 
organizational practices. We concluded that religion infuses faith-based agency self- 
presentation, personnel, resources, decision-making processes, and interactions with clients 
and among staff, which otherwise function no differently from their secular counterparts. We 
used findings from this study in developing the items in the national survey to tap variation 
among faith-based organizations in their expressions of religiosity. 

Our survey questionnaire included 21 items related to religious policies and practices 
within faith-based social service coalitions. We found substantial variation in the responses of 
coalitions to all of these items. These items were then subjected to factor analysis, from 
which three distinct factors emerged, each with eigen values greater than +1.0. Table 1 
presents the items and factor loadings for the 18 items that factored. Collectively, the three 
factors account for 63 percent of the correlation matrix. When necessary, items were recoded 
so that higher values are consistently associated with greater religiosity. 

Factor 1 we label “service religiosity.” It is composed of 10 items and has a very high alpha 
level of .949. This scale concerns the extent to which staff members incorporate religion into 
their interaction with clients (e.g., practices such as distributing religious material to clients, 
praying with them and using religion to encourage them). Factor 2, “staff religiosity,” 
consisting of five items with a respectable alpha of .744, concerns the role of religion in hiring 
and motivating staff and religious behavior among staff. Factor 3 consists of only three items 
that comprise “formal organizational religiosity,” or the extent to which the “public face” of the 
coalition is explicitly faith-based. The relatively low alpha level of .520 is acceptable for small 
scales (Robinson, Shaver and Wrightsman 1999). 

Three other variables concerning coalition religiosity failed to factor with the formal 
organization religiosity scale (or either of the other two), despite appearing as if they should 
be closely related to the public face coalitions present as faith-based entities. The first 
concerns coalition names, which were coded as unambiguously religious (78 percent), 
ambiguous (5 percent) or secular (17 percent). Although its factor loading was too low to 
include in the formal organization religiosity scale, the two measures are moderately 
associated with one another; the Pearson Correlation Beta is .227 (p = 000). We also have 
measures of the percentage of each coalition’s board that is clergy and the percentage that 
represent affiliated congregations, neither of which factor with any of the religiosity scales. 

Given incommensurate response metrics among questions, coalition factor scores were 
converted to Z scores, which constitute the three variables, the ranges and skewness of 
which are shown on Table 1. Positive Z scores mean higher levels of religiosity relative to a 
mean of zero and standard deviation of |, negative scores the opposite. The ranges and 
skewness of Z scores for the three factors vary considerably. At one extreme is the formal 
organization religiosity factor, where the range is quite small and shows only slight negative 
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Table 1: Items and Factor Loadings, Coalition Religiosity 


Factor 1 Factor 2 Factor 3 
Formal 
Service Staff Organization 
Religiosity Religiosity Religiosity 
(a = .949) (a = .744) (a = .520)° 
Items: 
Distribute religious materials to clients .846 
Help clients join congregations 844 
Pray with individual clients .898 
Pray with groups of clients .870 
Use religious beliefs to instruct clients .890 
Encourage client religious conversion .897 
Use religion to encourage clients 824 
Provide info about local congregations .663 
Programs require religious conversion 161 
Policy re: religious discussion w/clients 178 
Pray at staff meetings .678 
Favor religious job candidate 045 
Put religious principles into action .140 
Demonstrate God's love to clients .803 
Inspire clients’ faith via staff's actions 142 
Religiously explicit mission statement 154 
Organizational leader ordained clergy .653 
Sacred images in public spaces .134 
Z Scores: 
Skewness rie -1.35 -.45 


Range -1.25-+2.31 — -3.37-+1.20 1.84 — 41.15 





skewness. The widest range concerns staff religiosity, which has a considerable negative 
skew, perhaps reflecting a part of the sample that is subject to pressure to resemble secular 
social service agencies. The range of Z scores for the service-oriented religiosity factor goes 
in the opposite direction; it has a moderate positive skew. This suggests that a subset of 
Coalitions is likely to involve clients in expressions of religiosity beyond what is normal for the 
sample. For each scale we delete all cases for which there are missing values on any of the 
component items. This leaves us with Ns of 379 for service religiosity, 447 for staff religiosity, 
and 558 for formal organizational religiosity for multivariate analyses below. 


Correlates of Organizational Religiosity 


In the remainder of this paper, we develop and test hypotheses relating organizational 
religiosity to other organizational variables, based on findings from previous studies of 
nonprofit and religious organizations. Coalition Z scores on the three religiosity scales 
comprise the three dependent variables for OLS regressions using 12 predictor variables, 
grouped into three clusters: those concerning affiliated congregations, those pertaining to 
congregational resource provision, and those describing coalition attributes. Table 2 presents 
descriptive statistics for all predictor variables. 
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Table 2: Descriptive Statistics, Predictor Variables 


Predictor Variables: Range Mean (SD 
Characteristics of Affiliated Congregations: 
Religious Heterogeneity 1-6 ~ 4.00 
(1.40) 
Evangelism Scale 3-1 1.90 
0)) 
Resources Provided by Congregations: 
% Budget Religious Organizations 0-100 24.90 
(27.00) 
% Board Clergy 0-100 22.30 
(22.70) 
% Board Congregation Representatives 0-100 62.40 
(54.40) 
# Volunteers/Week 0 - 1000 71.70 
(220.40) 
Coalition Attributes: 
Total Income $500-$78 mil. $1.2 mil. 
($ 4.5 mil.) 
# Paid Managers & Professionals 0-565 4.70 
(23.70) 
% Budget Government 0-100 16.50 
(25.00) 
Social Activism Scale 0-4 1.46 
(1.26) 
Religiosity Coalition Name 3-1 2.60 
(.76) 
Proselytizing as Goal 1-5 2.24 
(1.30) 





Predictor Variables and Hypotheses 


Characteristics of Affiliated Congregations 


The sample coalitions vary greatly in the extent to which they embrace theologically diverse 
congregations. Respondents were presented with a long list of specific faith traditions and 
asked how many of their affiliated congregations are of each type. From this data we 
developed a six-fold religious heterogeneity scale, ranging from all congregations of the same 
faith to fully interfaith. Because different faith traditions vary in their meaning systems, goal 
priorities and religious expressiveness, in an effort to prevent conflict, we predict that: 


Hypothesis 1: The more religiously heterogeneous the congregations 
comprising a coalition, the lower the level of coalition religiosity. 


In the past decade, an increasing number of studies have focused on the rapid growth of 
evangelical churches in the United States ( Regnerus, Sikkink and Smith 1999; Smith et al. 1998), 
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although there is no universal measurement of the term “evangelical.” We use the dichotomous 
scheme developed by Steensland et.al (2000), which classifies American religious groups on the 
basis of theological criteria derived from denominational creeds. Based on our survey data, we 
developed a three point evangelical scale, where we code as “3” those coalitions composed of 
all evangelical congregations (4 percent); “2” includes at least one evangelical congregation (83 
percent), and “1” has no affiliated evangelical congregations (13 percent). Because evangelical 
congregations tend to emphasize personal conversion and evangelizing to a greater degree than 
mainline congregations, it seems reasonable to predict that: 


Hypothesis 2; The greater the involvement of evangelical congregations, 
the greater the coalition religiosity. 


Resources Provided by Congregations 


Coalition ministries are typically funded from a variety of sources, including congregations, 
governmental entities, foundations, corporations, fund-raising events, individuals and fees for 
service. In addition, non-monetary resources — such as board members, volunteers and in- 
kind donations — come from varied community groups, especially member congregations. 
Organizations may have to modify their structures and/or activities in various ways in order 
to maintain the support of funding agencies (Gronbjerg 1993). At a minimum they must 
provide information and access to the representatives of funding agencies. There is the 
tendency, therefore, for agencies to comply with, or at the least not to threaten, the goals and 
values of funding agencies in order to maintain funding support. We predict, therefore: 


Hypothesis 3: The more resources flow to coalitions from congregational 
sources, the greater their level of religiosity. 


We use four measures of congregational resource provision: (1) percentage of coalition 
budget provided by congregations and judicatories, (2) percentage of board who are clergy, 
(3) percentage of board who are congregational representatives, and (4) average number of 
volunteers per week, based on the assumption that most volunteers are recruited through 
affiliated congregations. Hypothesis 3 predicts positive relationships between these four 
measures of congregational resource provision and the three religiosity scales. 


Coalition Attributes 


Given that faith-based social service coalitions are part of the larger environment of nonprofits 
that provide social services, their cultures and organization should resemble those of secular 
nonprofits, especially those that are larger and employ more professional staffs. To the extent 
that such coalitions resemble the broader institutional field of secular social service agencies: 


Hypothesis 4: the higher the total income and the greater the level of 
staff professionalization, the lower the level of coalition religiosity. 


Total income is our measure of size. It was calculated by adding income received in fiscal 
year 2001 from each of 11 specific sources specified on the survey, plus an “other” category. 
Information was gathered directly on the number of paid employees in professional and 
managerial positions, our measure of organizational professionalism. 
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Until the Charitable Choice legislation in 1996, coalitions that received government funds 
were prohibited from proselytizing or displaying their religious character. Even with the 
assurances provided by Charitable Choice that an agency could receive government monies 
and maintain its religious character, many faith-based agencies continue to fear that receipt of 
such funds will jeopardize their religious mission (Chaves 1999). Coalitions that are more 
committed to integrating religion into their organizational culture should, therefore, be less 
likely to apply for and receive government funds than those that are less religiously 
expressive. In addition, those who rely more heavily on government funding may be most apt 
to conform to norms developed by the institutional field of secular agencies. Therefore: 


Hypothesis 5: The greater the percentage of budget derived from 
governmental sources, the lower the level of coalition religiosity. 


While earlier studies (e.g., Reid 1999) showed that many faith-based organizations fear 
that acceptance of government funds will constrain their ability to advocate in terms of social 
issues, in a more recent study, Chaves, Stephens and Galaskiewicz (2004) demonstrate a 
positive relationship between government funding and political activism. To the extent that 
government funding and social activism are positively related, we logically predict that: 


Hypothesis 6: The higher the level of social activism, the lower the 
level of coalition religiosity. 


From a list of nine types of social service activities asked of respondents, we developed an 
additive, five-level (0-4) sca/e of social activism, based on the number of four specific activities 
in which coalitions claim to engage: advocacy, community development, forums/workshops 
on policy issues, and community organizing. 

Earlier, we described how we measured the religiosity of coalition name, a variable that 
failed to factor with any of the three scales yet clearly represents yet another way by which 
coalitions express their religious nature. We predict that: 


Hypothesis 7: The more explicitly religious the coalition name, the 
higher the level of coalition religiosity. 


Finally, the survey included a list of 10 organizational goals and asked respondents to rate 
the importance of each on a five-point scale ranging from “one of the most important” (coded 
5) to “not important” (coded 1). One of these goals is to “provide evangelism opportunities for 
congregations.” Our logic is the same as that we used when discussing the evangelism scale 
(Hypothesis 2), resulting in the prediction that: 


Hypothesis 8: The more important proselytizing is as a Coalition goal, 
the higher the level of coalition religiosity. 


Analysis of Data 


We use OLS regressions to examine relationships between each of the three measures of 
coalition religiosity and the predictor variables. Missing values for predictor variables are 
imputed by using the expectation-maximization process, an interactive algorithm based on 
maximum likelihood estimation, from the SPSS Missing Values Analysis module. The 
maximum value for a bivariate correlation matrix for all predictor variables is .600 (and most 
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are well below .200), which typically would not be considered strong enough to create 
significant multi-collinearity. In addition, the maximum VIF for an individual variable is 1.50 and 
the mean for the full set of independent variables is 1.19. Given the standard of “10” for the 
VIF value, there is not significant evidence of multi-collinearity. 


Table 3: Religiosity Scales by Predictor Variables (OLS) 


Service Staff Organization 
Religiosity Religiosity Religiosity 
Robust Robust Robust 
Predictor Variables: Beta Std. Err Beta Std. Err Beta Std. Err 


Affiliated Congregational Characteristics: 
Religious Heterogenity -.010 028 -.007 .034 -.010 031 
Evangelism Scale .064* .083 Cae alo .005 .099 


Congregational Resource Provision: 
% Budget Religious 


Organizations ese 163 -.030 176 .043 144 
% Board Clergy .027 161 .036 195 SOF aes te 
% Board Congregation 
Reps. -.053 .089 .074 .088 -.023 .083 
# Vols/Week -.070 .000 .038 .000 .059 .000 
Coalition Attributes: 
Total Income aie .000 041 .000 .044 .000 
Pd. Mgrs & Profs. -.003 .006 .003 .000 -.036*** 000 
% Budget Govt. =1iZ7 = ahso -.133* — .190 -.124** 174 
Social Activism Scale oi cae eee, -.018 .033 LE a A? 
Religiosity Coal. Name .040 .037 .094* .056 Ave" . 3048 
Proselytizing As Goal 000 | Ode ag oe Bor 030) 
N (379) (447) (558) 
R? 574 .249 241 





p< 05 *p< 01 **p <.000 


Table 3 shows the OLS results using the three religiosity factor scales as dependent 
variables. The most obvious findings are: (1) the more important proselytizing is as a 
coalition goal, the higher the religiosity scores on all three scales; (2) those whose names 
convey their faith-based nature have higher levels of religiosity on two of the three scales; 
and (3) coalitions that include or are entirely composed of evangelical congregations tend 
to have higher service and staff religiosity scores than those that do not. The remaining 
religious variables show little if any relationship to any measure of coalition religiosity. 
Religious heterogeneity of congregational affiliates has no effect on any of the three 
religiosity measures. Likewise, greater coalition reliance on religious organizations as a 
funding source and number of volunteers are unrelated to any of the three religiosity 
measures nor is the proportion of the board composed of congregational representatives. 
The proportion of clergy on the board only affects organizational religiosity. 
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One of the most consistent findings relates to the negative relationship between 
government funding and religiosity. (See also, Ebaugh et.al 2005.) Reliance on government 
funding is significantly and negatively related to all three measures of religiosity. Given the 
cross-sectional data provided by our survey, it is difficult to know whether public funding 
agencies discriminate against faith-based groups or whether more religiously expressive 
groups are reluctant to apply for government monies. In another paper using this data set 
(Ebaugh, Chafetz and Pipes 2005), we demonstrate a negative relationship between those 
that applied for government funding and religiosity. Those who explicitly sought 
government funding are less religious in their behaviors and policies, and have more 
positive attitudes toward government funding than those that had not made such 
application. The three remaining predictor variables are each significantly related to some 
but not all three measures of coalition religiosity. Finally, social activism is significantly 
associated in a positive direction with organizational religiosity but inversely with service 
religiosity. This finding implies that coalitions that present a strong public persona as faith- 
based tend to get involved in social action issues while those that express their religiosity 
more in terms of interaction with and service to clients are less politically active. 

Given the size of the Betas and the directions of their signs, we can characterize 
coalitions whose policies and practices most strongly encourage religious expression, 
relative to those that don't, as: placing a very strong value on proselytizing and frequently 
comprising evangelical affiliate congregations, having a low level of reliance on 
government as a source of funding but a somewhat higher total income, and engaging in 
little social activism. The explained variance for this scale is especially high (.574). In a 
similar vein, those coalitions whose policies and practices demonstrate higher levels of 
staff religiosity also place a high value on proselytizing, more often include evangelical 
congregations, have a low level of reliance on government funding, and names likely to 
express their faith-based character. When we turn to examining the public “face” of 
coalitions, that is, the formal organizational religiosity scale, many more predictor variables 
are significantly related to it than to the other two scales. Moreover, in some cases they 
are related in different ways to this scale than to the other two. The Beta for proselytizing 
as a goal remains significant, but is far lower than for the other two religiosity scales 
(especially service religiosity), and the evangelism scale is no longer related at all. The 
negative relationship with government funding remains significant. Finally, the more 
organizationally religious coalitions engage in a higher level of social activism and are 
more likely to have a religiously explicit name. Clearly, the correlates of this dimension of 
coalition religiosity are quite different from those of the other two dimensions. 


Discussion and Conclusions 


In light of our findings, we now return to the question of what makes an organization faith- 
based. Clearly our data challenge those typologies that are based on the notion of a uni- 
dimensional continuum. Jeavons’ (1998) contention that strongly religious organizations 
score highly on all or most of seven dimensions while secular organizations score low on 
them suggests one continuum of religiosity. Likewise, the Working Group Report on 
Human Needs and Faith-Based Community Initiatives (2002) report clearly asserts a 
continuum, ranging from “faith-saturated” to “secular” with varying levels of religiosity 
between the two extremes. In contrast to these approaches, we have identified three 
discreet dimensions on which organizations vary in terms of religiosity: in the manner in 
which they relate to clients (service religiosity), in the manner in which staff are hired and 
relate to one another (staff religiosity), and in the public face that organizations present 
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(organizational religiosity). The fact that the items constituting each scale factor together 
with relatively to very high eigen values demonstrates their discreetness. In addition, 
Pearson bivariate correlations between the scales indicate that, while moderate overlap 
exists, the Betas are considerably lower than would occur if a single dimension of 
religiosity defined them as faith-based (i.e., the Betas correlating service religiosity with 
organizational = .444, and with staff religiosity = .636; the Beta for staff and 
organizational religiosity = .498). Our data, therefore, indicate that religiosity is a 
multidimensional concept when applied to faith-based organizations. The use of a single 
continuum to indicate variations in the degree of religiosity of an organization is too simple 
to convey this multidimensionality. 

Our data further indicate that the three religiosity scales relate differently to other 
organizational variables. For example, having more evangelical congregations as coalition 
members is associated with increases in both the service and staff religiosity scales within 
coalitions, but not with the level of organizational religiosity. Likewise, the more socially 
active the coalition, the less religiously expressive it is in its service delivery, but the more 
religious it is in its public face (as measured by the organizational religiosity scale). These 
varying relationships among organizational variables and religiosity measures further 
exemplify the fact that religiosity is not a uni-dimensional characteristic of faith-based 
organizations. 

There is no support for Smith and Sosin’s (2001) argument that the institutional 
coupling of an agency with the resources, authorities and cultures of particular faiths 
(what they define as a “faith-related agency”) affects the organization's structure, its 
religious Culture and its delivery of services. Their approach should result in significant 
inverse relationships between the degree of. religious heterogeneity of affiliated 
congregations and the three measures of coalition religiosity. However, no evidence of any 
relationship between the religiosity measures and religious heterogeneity was found. 

Our data offer partial support for Monsma’s (2004) typology differentiating faith- 
based/integrated and faith-based/segmented programs and/or organizations. While he 
includes in the first type both the use of religious values and motivations to encourage 
clients to change behaviors and hiring only staff with a particular religious orientation, our 
data indicate that these two sets of religious behaviors/policies operate independently. 
The first set is part of our “service religiosity” scale and the latter part of our “staff 
religiosity” scale. Monsma’s conception of faith-based/segmented programs reflects very 
closely our organizational religiosity scale. Likewise, Sider and Unruh (2004) develop a 
dichotomy of program types; the dimensions of one, “active,” is isomorphic with our 
service religiosity, but the other, “environmental,” subsumes both our organizational and 
staff measures. 

Our findings demonstrate clearly that organizational religiosity is a three-dimensional 
phenomenon and that self-defined faith-based organizations vary extensively on all three 
dimensions. In addition, each of these three dimensions is associated with other attributes 
of the coalitions in our sample, but not in uniform ways. The service and staff religiosity 
measures have relatively similar coalition correlates, but the organizational religiosity 
measure stands in somewhat different relationship to the predictor variables than the 
other two. In the case of only two predictor variables — percentage of budget government 
funded and proselytizing as a goal — are the Betas all significant and all in the same 
direction, regardless of religiosity measure. It remains for future research to see if these 
measures have utility for enhancing the understanding of the many other forms of faith- 
based, non-congregational and denominational organizations proliferate throughout the - 
United States. 
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Why Estimates of the Impact of Public Opinion on Public 
Policy are Too High: Empirical and Theoretical Implications 


Paul Burstein, University of Washington 





Abstract 

Statistical studies often show public opinion strongly affecting public policy. But the studies 
may overestimate the effect because they focus on issues — those especially important to the 
public — on which governments are most likely to be responsive. This article considers what 
the opinion-policy linkage would be if less-important issues were also considered, by 
examining a random sample of proposals addressed by the U.S. Congress. Opinion has 
considerably less impact in the random sample than in the statistical studies. But this does 
not mean that the public is being defeated by special interests. On many issues, the public 
has no meaningful opinions; organized interests, therefore, can win without the public 
losing. 


Democracy is most often defined as a set of procedures and institutions intended to make 
the holders of political power responsible to broadly inclusive electorates (Dahl 1989: ch. 15; 
Markoff 1999). Historically it has been the product of many groups’ struggles to win influence 
over government for themselves, and fairly often, for the broader public as well (e.g., Markoff 
1996; Rueschemeyer, Huber and Stephens 1992). Democracy is often evaluated by how 
much influence the public has. As Erikson, Wright and Mclver (1993:1) write, “We often 
gauge the quality of democratic government by the responsiveness of public policymakers to 
the preferences of the mass public.” (See also Manza and Cook 2002a:630.) 

A key measure of influence is the impact of public opinion on public policy. For many social 
scientists and much of the public, if democratic procedures and institutions are working well, 
public opinion will strongly affect public policy. If, however, opinion has little impact, then 
democratic procedures and institutions must not be working well; they have broken down 
or are being manipulated by powerful groups (e.g., Bok 1997; Domhoff 1998:ch. 5-6; Erikson, 
Wright and Mclver 1993). 

How strongly does opinion affect policy? Most first-rate studies of American politics find 
its impact substantial, but not everyone is convinced (Burstein 1998a, 2003; Manza and Cook 
2002a; Page 2002). Recently, those dubious about the impact of public opinion won support 
from an unexpected source. Benjamin Page, usually associated with the “strong impact” side 
of the debate, contends (2002:332-335) that statistical studies overestimate the impact of 
opinion on policy. One reason is sampling bias. Public opinion polls focus on issues important 
to the public, and it is on such issues that democratic governments are most likely to do what 
the public wants. 

Two claims follow, one empirical and one interpretive. First, if survey organizations asked 
about less-important issues, we would find opinion having less impact on policy than extant 
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work suggests. Second, if opinion does have less impact, that means democratic procedures 
and institutions are not working as well as we thought. 

This article addresses both claims, supporting the first but not the second. Page had no 
data to support his contention about sampling bias. This article does, and it shows that 
sampling bias has indeed led us to overestimate the impact of public opinion on public policy. 

What this means is another matter. The second claim rests on two assumptions: (1) that 
the public has meaningful opinions on less-important issues it is not asked about, and (2) that 
politics is zero-sum — if the public does not get what it wants, the reason must be its defeat 
by special interests. This article questions both assumptions, and contends that the public’s 
having less impact does not imply more frequent defeats. 

The article begins by reviewing arguments that sampling bias affects our estimates of the 
impact of opinion on policy. | then describe a new data set designed to overcome such bias 
— a data set that tracks the history of a stratified random sample of policy proposals 
considered by Congress, from the time first introduced until either enacted or abandoned. 
The data analysis shows how sampling bias may have affected the conclusions of several 
classic studies (Monroe 1998; Page and Shapiro 1983; to a lesser extent, Domhoff 2002a); 
other data are used to assess sampling bias in another key work (Erikson, MacKuen and 
Stimson 2002). The article then reconsiders how data on opinion and policy should be 
interpreted, and delineates the implications of the argument and findings for our 
understanding of democratic politics. 


Sampling Bias and the Impact of Public Opinion on Public Policy 


How strongly does public opinion affect public policy? Many social scientists find its impact 
powerful. Erikson, MacKuen and Stimson (1995:560), for example, conclude that at the federal 
level in the United States, there is a “strong and resilient link between public and policy 
(1995:560), “...nearly a one-to-one translation of preferences into policy, where policy and 
preferences are measured in familiar percentage liberal terms.” (2002:316) At the state level, 
according to Erikson, Wright and Mclver (1993:80), the impact of opinion on policy is “awesome.” 

Others, however, find its impact minimal. Domhoff (1998:171) contends that “public 
opinion has little or no influence on presidential actions or congressional legislation except as 
a constraint in times of unpopular wars or domestic social upheaval.” It “has little or no 
independent influence on foreign, defense, or economic policy.” (Domhoff 2002b: 124) Still 
others suggest the answer lies somewhere in between. Page and Shapiro (1983:189) 
conclude that public opinion “genuinely affects government policies” in the United States, but 
are reluctant to make strong claims about democratic responsiveness. Manza and Cook 
(2002a) highlight these disagreements by contrasting the “large effects” and “small effects” 
images of the impact of opinion on policy. 

Why do the estimates of impact vary so much? We can't be sure, because those who 
disagree do not confront each other directly to resolve their differences. Erikson et al. (2002) 
don't cite Domhoff, while Domhoff (1998) cites neither Erikson et al. (1993) nor Page and 
Shapiro (1983). Reviews may highlight the disagreements (Manza and Cook 2002a), but 
seldom try to resolve them (Burstein 2003). 

Recently, however, Page (2002) attempted to narrow the range of disagreement by 
carefully examining the evidence supporting each view. His conclusion (2002:326): the higher 
estimates of the impact of opinion on policy — Erikson, Wright and Mclver (1993; 2002) and 
even his own (Page 2002:335) — exaggerate its true impact. 

Page adduces several reasons for this, but the key one here is sampling bias. (The others 
are specification error, aggregation bias and failure to estimate the impact of policy on public 
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opinion.) “We can study the impact of public opinion, only to the extent that public opinion is 
measured.” But opinion is measured only for a “relatively small and rather vaguely formulated 
subset of policy alternatives.” (Page 2002:332) This subset disproportionately includes issues 
especially important to the public or the polltakers. It is on such issues that opinion is most 
likely to affect policy, and therefore estimates of the relationship based on such issues are 
probably higher than they would be were less-salient issues taken into account as well. 

How much higher? We don't know. Page proposes some ways to find out, particularly 
through case studies and by gauging how strongly responsiveness depends on salience. But 
he neglects the most direct way to overcome sampling bias: draw an unbiased sample. That 
is what | do here. | begin by describing the creation of what | believe is the first data set ever 
based on a (stratified) random sample of policy proposals — specifically, a sample of proposals 
addressed by Congress. | then show how the sampling procedure affects the availability of 
public opinion data and our estimates of the impact of opinion on policy. 


Research Design and Data 


The claim that the public is asked about a biased sample of issues is plausible but untested, 
probably for two main reasons. First, there is a general blindness to sampling issues in the 
study of politics. Thirty years ago Gamson (1975) demonstrated convincingly the importance 
of random sampling when analyzing the impact of social movement organizations — sampling 
greatly affects the findings. Since then only a handful of scholars have discussed how 
sampling bias could affect conclusions about policy change (Burstein 2003; Page 2002; 
Schroedel 1986; and Wittman 1995:182); no one has followed up by extending Gamson’s 
approach to other organizations or political contexts. 

Second is the problem of defining the population to be sampled. Page (2002:332) writes 
of opinion being measured for a “vaguely formulated subset of policy alternatives,” but 
doesn't define “policy alternative” or describe the set from which a less vaguely defined 
subset might be drawn. We could try to measure opinion for a sample of issues or policies 
instead, but run into the same problem. There is no standard definition of an “issue” 
(Baumgartner and Leech 1998:38) or a “policy,” and thus no way to identify a population of 
issues or policies from which to sample. To overcome the sampling bias Page refers to, we 
must precisely define the unit of analysis and the population to which it belongs. 

It would make sense for the unit of analysis to be fairly concrete, more like a policy 
alternative than an issue or policy. One possibility would be to define a policy alternative 
operationally as a bill introduced into a legislature. Bills are readily identified, and so is the 
complete set of bills, the population from which a sample would be drawn. We could 
examine the opinion-policy link by starting with an unbiased sample of bills — at the federal 
level in the United States, a sample of congressional bills. 

Unfortunately, particular bills are not satisfactory units of analysis for research on policy 
change. Support for a policy alternative (or what | will call a policy proposal) can't be limited to 
support for a particular bill. For one thing, bills die at the end of each congress; thus, no policy 
proposal could be on the political agenda more than two years. For another, within 
congresses, multiple identical bills are often on the agenda at the same time; if the sampled 
bill fails but another, identical one is enacted, it makes no sense to say that the policy proposal 
was not acted on. 

Rather than thinking of a policy proposal as a particular bill, it makes more sense to think of 
it as a set of specific legislative ideas, often manifested in multiple bills over multiple congresses 
(cf. Burstein 1998b:ch. 2). Operationally, then, a policy proposal will be defined as the content 
common to all the bills in a set of highly similar bills, both within a congress and over time. 
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To begin, | selected a stratified random sample of 60 public bills (other than appropriations 
bills) introduced in the U.S. House of Representatives during the 101% congress, 10 from 
among bills reported out of committee (to ensure that some bills had been taken seriously) 
and 50 chosen randomly. (The details are found in Burstein, Bauldry, and Froese 2005.) 

The bills varied widely by subject and likely impact, as expected in a random sample. 
Among the bills of great importance were those enacted as the Americans with Disabilities Act 
and as the Financial Institutions Reform, Recovery and Enforcement Act (popularly known as 
the “savings and loan bailout”). Less important were the bills known as the Student Right-to- 
Know and Campus Security Act, and the Amtrak Reauthorization and Improvement Act, while 
bills dealing with tanker traffic in Puget Sound and with pecan, mushroom and lime promotion 
and research programs mattered only to small groups. Other bills addressed foster care, taxes 
on capital gains, the admission of refugees on an emergency basis, the disposal of solid waste, 
and the boundaries of Rocky Mountain National Park. A complete list isin Table 1. 

After the sample was chosen, there was a search for identical (or virtually identical) bills in 
the 101* congress and in earlier and later congresses, to ensure inclusion of the entire history 
of each proposal, from first introduction to final outcome, either enactment or disappearance 
from the legislative agenda. 

A sample of 60 may not seem large, but tracing their history was an unprecedented and, 
as it turned out, labor-intensive task. | had to go back historically far enough to be sure that 
what appeared to be the first introduction of a bill manifesting the policy proposal was really 
the first; and similarly, for proposals that dropped off the legislative agenda (as opposed to 
having been enacted), | had to look for bills far enough into the future to be sure that a 
proposal was not reintroduced after a hiatus. The 60 proposals proved to have been 
manifested in close to 1,000 bills over a 28-year period. 

Because some proposals are manifested in one bill, and others in many, a random sample 
of bills is not a random sample of proposals. Identifying the population of policy proposals, 
however, would have been an overwhelming task. Beginning with a sample of bills seemed 
feasible, a dramatic improvement over the conventional approach — not sampling in a formal 
sense at all —and almost surely a decent approximation of a random sample of proposals. 

The dependent variable is whether the policy proposal was enacted into law; 20 were. The 
independent variables were to be measures of public opinion relevant to each proposal. 

But which measures? The public is seldom asked about specific policy proposals, so it is 
necessary to decide which survey questions, if any, are relevant to each one. This matching 
problem is confronted by everyone studying opinion and policy, but is rarely addressed 
explicitly; there are no widely accepted guidelines for matching poll questions and policies.’ 

Perhaps the most detailed discussion of the matching problem is Stimson’s (1999). He 
argues that it is useful to distinguish between narrower measures — opinion on particular 
issues — and broader ones, including his own global measure of the public's liberalism, which 
he calls “policy mood.” 

| make an additional distinction. Data from the General Social Survey, the American 
National Election Studies and the Roper Center at the University of Connecticut are used to 
create measures of opinion at three levels of specificity. 

A question is categorized as measuring a “specific issue preference” if it addresses a policy 
proposal relatively directly. With regard to a proposal to increase domestic oil reserves, for example, 
a question asking whether the United States currently had enough oil reserves is seen as measuring 
a specific issue preference; similarly, a proposal to reduce greenhouse gases is linked to a question 
asking whether the United States should take the international lead in fighting global warming. 

A question measures a “general issue preference” if it describes preferences in the 
domain of a policy proposal but does not address the policy proposal itself. For example, a 
proposal to provide more health information and counseling to Medicare recipients was 


Impact of Opinion * 2277 


matched with a GSS question on government spending on health care. The respondents 
were not asked about the proposal, but about whether they generally favor more government 
activity in the area. (The connection between question and policy may not appear close, but 
studies of the opinion-policy relationship are usually based on exactly these kinds of data.) 

Finally, when no measure of specific or general issues preferences is available, Stimson’s 
(1999:140-41) estimates of the public’s mood in broad policy domains (education, health, 
race, and so on) — essentially measures of liberalism — are used when appropriate. Stimson 
and others (e.g., Erikson, Wright and Mclver 1993) develop broad measures partly to permit 
analysis of the connection between opinion and policy globally without concern about 
specific issues. Using Stimson’s measures here is somewhat problematic because they are 
being linked to particular proposals — Stimson never uses them this way. 

Salience is measured with the Gallup “most important problem” question. Responses to 
the Gallup question were matched to the 60 policy proposals by two independent raters. For 
example, Gallup respondents mentioned “polluting the environment” and “the Savings and 
Loan crisis.” We matched these responses to policy proposals to reduce greenhouse gases 
and bail out the savings and loan industry, respectively. Often policy proposals did not fit any 
Gallup response category. For example, no Gallup response option encompassed a policy 
proposal to change Interstate Highway weight limitations. A proposal was defined as salient 
when at least 1 percent of the respondents said the issue was important.’ 


Data Availability, Salience and Sampling Bias 


Table 1describes the 60 policy proposals, notes which were enacted, identifies those salient 
to the public, and shows whether public opinion data were available and at what level of 
specificity. If data were available at more than one level, the most specific is identified. 

There are data on specific issue preferences for 10 policy proposals (17 percent), and on 
general issue preferences for 26 more (43 percent) — a total of 60 percent. For the other 40 
percent, no issue preference data could be found. Thus previous research on specific issues, 
focusing only on those on which opinion has been measured, must have ignored at least 40 
percent of the proposals Congress considers. 

Stimson’s measure of policy mood is potentially relevant for proposals that can be described as 
liberal or conservative; that proves to be half the remaining proposals, 20 percent of the total. The 
final 20 percent could not be described as liberal or conservative, making the policy mood measure 
irrelevant.4 Thus, data on issue preferences are available for 60 percent of the proposals, and 
mood data for 20 percent more. There are no measures of public opinion at all for 20 percent. 

As Page hypothesizes, preference data are available only for a biased sample of proposals: 
for salient issues, nearly always (24 of 26 proposals, see Table 1); for non-salient issues, barely 
more than a third (12 of 34). This adds credibility to the claim that estimates of the impact of 
- Opinion on policy are too high. 


How Strongly Does Opinion Affect Policy? 


How much does it matter if we analyze data for a random of sample of policy proposals instead 
of the usual biased sample? To get some sense of this, | analyzed data for the 60 proposals in 
ways as similar as possible to two classic studies: Monroe's (1998) on the relationship 
between opinion and policy cross-sectionally and Page and Shapiro's (1983) on the relationship 
over time. | also found data that describe sampling bias in Erikson, MacKuen and Stimson’s 
(2002) work on the global relationship over time between mood and policy change. (Both Page 
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[2002] and Manza and Cook [2002a] pay special attention to these studies in their reviews.) 
Some comparisons will also be made with Domhoff's (1998; 2002a; 2002b) work. 


Cross-Sectional Analysis: Monroe 


Monroe (1998) analyzes the relationship between public opinion and public policy, 1980-1993, 
by finding 566 survey questions on potential fedetal policies, ascertaining whether a majority 
of those with opinions wanted policy change and then seeing whether policy changed. 
Opinion and policy are seen as consistent when the public preferred the status quo and policy 
remained unchanged, and when the public wanted policy change, and policy changed. 

Monroe finds opinion and policy consistent for 55 percent of the issues (Table 2a, based 
on Monroe 1998). He also finds a substantial bias against change. When the public wants the 
status quo, policy remains unchanged 70 percent of the time, but when it wants change, 
policy changes only 45 percent of the time, producing a bias measure of 70-45, or 25. 


Table 2: Public Opinion and Public Policy 
A. Consistency between Opinion and Policy 1980-1993, from Monroe (1998:13) 





Policy Outcome Majority Preference 
Status quo Change 
Status quo 70% 55 
Change 30 45 
N 230 336 
Consistent 55% 


B. Consistency between Opinion and Policy, Proposals with Opinion Data 





Policy Outcome Majority Preference 

Favors Proposal Opposes Proposal Total 
Proposal enacted 12 (41%) 1 (14%) 13 
Proposal not enacted 17 (59%) 6 (86%) os 
N 29 7 36 
Consistent 50% 


C. Consistency between Opinion and Policy, All Proposals 





Policy Outcome Majority Preference 

Favors Proposal Opposes Proposal None Total 
Proposal enacted 12 (41%) 1 (14%) 5 (23%) 18 
Proposal not enacted 17 (59%) 6 (86%) 17 (77%) 40 
N 29 a 22 58° 
Consistent 31% 


Note: Italics identify opinion consistent with policy. 


*On two proposals, specific and general opinion data were available and contradicted each 
other; these have been left out of this analysis. 
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Like Monroe, | consider proposals for which specific or general issue preference data are 
available and gauge consistency between opinion and policy. My approach differs a bit from 
his. Instead of looking at preferences for or against change, | categorize opinion and policy 
as consistent when a proposal is enacted while a majority of the public favors it or not 
enacted when a majority of the public is opposed. 

For the 36 proposals for which specific or general issue preference data are available, opinion 
and policy are consistent for 18, or 50 percent, a figure close to Monroe's (Table 2b). The bias 
against change in my data, however, seems greater. When the public opposed a proposal, it 
died 86 percent of the time (six proposals out of seven), but when the public wanted enactment, 
Congress acted only 41 percent of the time, producing a bias measure of 86-41, or 45. 

Neither Monroe's results nor mine show public opinion having much impact. (He found 
the relationship between opinion and policy statistically significant at the .05 level: | did not.) 
Nevertheless, it is important to consider what happens when the viewpoint is shifted, from 
considering the 36 proposals on which public opinion is known (the biased sample), to 
considering all 60 (the stratified random sample). 

Assume, for the sake of simplicity, that members of Congress know what the public wants 
only if poll data exist. Then, when members of Congress know what the public wants (as it 
does on 36 proposals), it acts accordingly half the time. 

lf we consider all 60 proposals, however, our perspective changes. The 18 proposals on 
which the government responds to the public are no longer half the total, but only 31 percent 
(Table 2c). On the other 69 percent, sometimes the government is definitely unresponsive 
(when it ignores issue preferences), and sometimes (when there are no data on preferences) 
we can't say whether it is or not. 


Congruence Over Time: Page and Shapiro 


Like Monroe, Page and Shapiro (1983) analyze the relationship between opinion and policy for 
all the issues on which they find public opinion data. Their focus, however, is change over 
time. By seeing whether public opinion changed before policy or the reverse, they make 
stronger inferences about causality than in cross-sectional analysis. 

They found public opinion data documenting 357 cases of significant change in 
Americans’ policy preferences between 1935 and 1979, and focused on 231 in which policy 
also changed. Opinion and policy were described as congruent when both moved in the 
same direction, and noncongruent otherwise. They found congruence in 66 percent of the 
231 cases and noncongruence in the rest. 

Data on trends in specific or general issue preferences are available for 19 of the 60 policy 
proposals considered here. | consider opinion and policy to be congruent if a proposal is 
enacted after the public becomes more favorable (percent favorable increasing by at least 5 
percent), or not enacted if public support remains the same or declines. (Again, my approach 
is similar to Page and Shapiro's, but not identical. They focus on cases of policy change; | 
consider all policy proposals for which data are available.) 

For the 19 proposals, opinion and policy are congruent for 15, or 79 percent, a percentage 
a bit greater than Page and Shapiro's. This is a measure of what they would describe as the 
effect of opinion on policy. 

However, if we ask what the data tell us about the determinants of policy for all 60 
proposals, the results look quite different and highlight how little we can say about the impact 
of opinion on policy. The 15 proposals on which the government is definitely doing what the 
public wants (by conventional standards) are 79 percent of the proposals for which 
preference data exist, but only 25 percent of all 60. For the other 45 proposals, we know the 
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government is not doing what the public wants on four, or 7 percent of the total of 60. On the 
remaining 41, we have no way to know if the government is responsive. Public opinion data 
are available for so small a fraction of the policy proposals that the impact of sampling bias on 
the results could be very substantial. 


Policy Mood and Policy Change: Erikson, MacKuen and Stimson 


Erikson, MacKuen and Stimson (2002:368) find the impact of opinion (measured in terms of 
policy mood) on policy so powerful that they are somewhat mystified. “How,” they ask, “could 
the degree of popular control be so strong?” 

Like Monroe (1998) and Page and Shapiro (1983), they begin with data on public opinion 
and then find data on relevant policies — that is, policies likely to respond to opinion if 
democratic procedures and institutions work as they should. Unlike Monroe, and Page and 
Shapiro, they describe how they selected some policies for analysis from a larger set; this 
enables us to see if their sampling procedure is biased and, if so, how the bias might affect 
their conclusions. 

Erikson, MacKuen and Stimson (2002:328) want to examine how the public’s policy mood 
affects what they call “major shifts in American governmental policy” over the years. Their 
focus is domestic policy, and their concern is whether opinion and policy were moving 
together along a liberal-conservative dimension. To measure shifts in policy, they begin with a 
list of laws designated as especially important at the time of passage by David Mayhew 
(1991), updated through 1996. They then eliminate from their data set laws that don’t match 
their concerns (2002:330): laws dealing with foreign, defense or agriculture policy; laws with 
an impact “more local than national;” and laws neither clearly liberal nor conservative. 

Mayhew’'s list (1991:52-73) includes 189 laws. Of these, Erikson, MacKuen and Stimson 
(2002:374-80) include 107, or 57 percent. Among those left out because they pertained to 
defense, foreign policy and agriculture were reorganizations of the Defense Department 
(1958 and 1986), the nuclear test ban treaty (1963), the War Powers Act enacted at the end 
of the Vietnam War (1973), and major laws affecting foreign trade (e.g., the Trade Expansion 
Act of 1962) and subsidies to agriculture. 

Left out because they are (in Erikson, MacKuen and Stimson’s [2002] view) local rather 
than national in scope, or neither liberal nor conservative, are a number of laws recognizable 
decades later as very important. These include the Federal-Aid Highway Act of 1956 creating 
the Interstate highway system; the laws that created NASA, the Arms Control and 
Disarmament Agency, the departments of Housing and Urban Development, and 
Transportation; some antidiscrimination laws, including the Age Discrimination Act of 1967, 
the Equal Pay Act of 1963, the Equal Rights Amendment (not ratified by the states), and the 
Equal Employment Opportunity Act of 1972; the National Environmental Policy Act of 1969; 
the 1986 Tax Reform Act; the Savings and Loan Bailout (1989-1990); statehood for Alaska and 
Hawaii; and crime control laws including the Omnibus Crime Control and Safe Streets Act of 
1968, and the Organized Crime Control Act of 1970. However important the laws Erikson, 
MacKuen and Stimson include, they clearly exclude many that were important as well. 

Page (2002:329) suggests that Erikson, MacKuen and Stimson find opinion strongly 
affecting policy partly because they include in their analysis only policies especially likely to be 
affected by their measure of policy mood. The policies measure federal governmental policy 
liberalism, and policy mood the public's domestic policy liberalism. Their opinion measure 
gauges how liberal the public is, and among laws enacted, they include only those that can 
be described as liberal or conservative. If a law couldn't be described as liberal or 
conservative, they left it out of their analysis. The data presented here strongly suggest that 
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if they had chosen a random sample of policies or analyzed all the laws Mayhew considers 
important, they would have found that opinion had much less impact on policy. 


Small Effects: Domhoff 


Up to this point, | have contended that the impact of opinion on policy has been 
overestimated by researchers generalizing from biased samples. Domhoff (1998; 2002a; 
2002b), in contrast, could hardly be overestimating the impact of opinion. He sees it having 
So little: almost none on policies that elites care about — especially foreign, defense and 
economic policies — and a modest impact on others (cf. Erikson 1976). 

Domhoff provides no quantitative evidence to support his claims, but Monroe and Page 
and Shapiro present some that is relevant. Monroe (1998:14) finds consistency on defense 
and foreign policy issues perhaps slightly greater than overall consistency (61 percent and 
67 percent consistent respectively, compared to the overall 55 percent), and “economic and 

-labor" issues perhaps slightly less (51 percent). Page and Shapiro (1983:182) find 
congruence on foreign policy a bit less than the overall measure (62 percent vs. 66 percent 
overall), and “economic and welfare policies” the same. 

Of the 60 policy proposals analyzed here, 34 have data on public opinion adequate to 
gauge consistency for the different types of issues (Table 3a). In this data set, consistency 
between opinion and policy is higher for foreign, defense and economic policies than for 
other policies.’ At first glance, it appears that neither of Domhoff’s claims is supported by 
the data: public opinion does affect policy, no less so for foreign, defense and economic 
issues than for others. If we add the proposals for which no public opinion data are 
available (Table 3b), overall consistency between opinion and policy declines as expected, 
but it remains true that responsiveness on foreign, defense and economic policies is higher 
than on others. 


Table 3: Opinion and Policy by Issue 


A. Consistency between Opinion and Policy, Proposals with Opinion Data 


Foreign, Defense, 


Opinion and Policy Economic Policies Other Policies Total 
Consistent 11 (58%) 6 (40%) 50% 
Inconsistent 8 (42%) 9 (60%) 

N 19 15 34 


a 


B. Consistency between Opinion and Policy, All Proposals 
en 


Foreign, Defense, 


Opinion and Policy Economic Policies Other Policies Total 
Consistent 11 (42%) 6 (19%) 29% 
Inconsistent 8 (31%) 9 (28%) 

No opinion data 7 (27%) 17 (53%) 

N . 26 32 58 


Note: Consistency is assessed by Monroe’s (1998) criteria; see Table 2. 
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Domhoff would have a number of responses to these conclusions, including an important 
one shared with Page (2002), namely that without including other variables in the analysis, it 
is impossible to ascertain whether public opinion has an independent effect. This is true. 
Given how obvious this is, it is remarkable how few quantitative studies of the determinants 
of policy attempt to gauge the effects of elite activities and public opinion simultaneously 
(Burstein 2003:34-35). The little evidence there is, though, is that public opinion continues to 
affect policy even when elites’ influence is taken into account. 


Implications for the Study of Democratic Politics 


Summing up a recent volume on public opinion and public policy, Page (2002) entitles his 
chapter “The Semi-Sovereign Public” to remind readers of Schattschneider’s (1960) classic, 
The Semi-Sovereign People. The reminder is appropriate, Page (2002:325) argues, because 
the research “largely confirms, or is consistent with, Schattschneider’s view of American 
politics. There is considerable evidence of government responsiveness to public opinion... 
But the evidence also indicates that there is considerable room for interest groups, party 
activists, policymakers, and others to prevail against the public on many issues.” 

The quotation is accurate, but | think that developments in the social sciences since 
Schattschneider’s work reduce the value of his framework for studying opinion and policy. 
Indeed, his framework (which is widely shared) may blind us to how those developments 
should affect current work. 

Even when Schattschneider wrote, some social scientists had realized that: (1) sampling 
bias may affect our estimates of statistical relationships; (2) limits on individuals’ cognitive 
capacities mean that on most policy issues the public has no meaningful opinions; and (3) 
power is not necessarily a zero Sum phenomenon in which gains by one side must come at 
the expense of the other (Parsons 1960:ch. 6). Today the vast majority of social scientists — 
including those who write about the impact of opinion on policy (e.g., Manza and Cook 2002a) 
— would agree. But few consider the implications for assessing past work or for planning 
additional research and interpreting the likely findings. 

As noted above, for example, few social scientists consider the possible impact of 
sampling bias on our conclusions about the determinants of policy change. Yet it seems quite 
clear that they should. Page (2002) makes an especially detailed and cogent argument that 
such bias leads us to overestimate the impact of public opinion on public policy, and the data 
presented above support him. 

But how much impact does sampling bias have? How much lower than current estimates 
would we find the impact of opinion on policy to be if we were able to ask the public its 
opinion on a random sample of policy proposals? 

The answers depend in substantial measure on why public opinion data are unavailable for 
so many issues. If the public's opinions are unknown primarily because polling organizations 
do not ask about them, future surveys could do so, eliminating sampling bias by asking about 
a random sample of policy proposals. 

But what if public opinion on many issues is unknown primarily because it doesn't exist? 
On those issues the government couldn't respond to the public at all. The impact of public 
opinion would necessarily be zero. The impact of sampling bias would be very high, and 
revised estimates of the impact of opinion much lower than the current ones. If we assume 
that lack of data means public opinion does not exist, our cross-sectional estimate 
(comparable to Monroe's) of responsiveness — how often the government did what the public 
wanted — would drop from 50 percent to 31, and our over-time estimate (comparable to Page 
and Shapiro's) from 79 percent to 25. 
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It can readily be argued that the public indeed has no opinion on most issues or at least no 
stable, coherent opinion. Individuals seem to respond to many survey questions on policy 
almost randomly, changing their answers unpredictably (Converse 1964); they may be 
responding to questions by drawing on multiple, often contradictory attitudes and, depending 
on what's on their minds when the question is asked, answer with the first seemingly relevant 
response that comes to mind (Zaller 1992). 

As a practical matter, it is difficult to imagine most people having opinions about many of 
the policy proposals they weren't asked about (even tangentially), such as H.R. 5753, proposing 
to amend the Internal Revenue Code to exempt export trade corporations and foreign sales 
corporations from passive foreign investment company rules; H.R. 895, proposing to treat 
federal pay in the same manner as non-federal pay with respect to garnishment; H.R. 2799, 
proposing to amend the Agricultural Act of 1949, with respect to the 1990 crop year, to permit 
up to 20 percent of program acreage to be planted with alternative crops if: (1) such acreage is 
planted with canola, rapeseed, sunflower, safflower, flaxseed, kenaf, crambe, guayule, 
milkweed or meadowfoam; or H.R. 2136, proposing to limit the term of incarceration for civil 
contempt in child custody cases to 12 months (Table 1).? These may seem trivial, but in fact 
may be important: rules on corporate taxation may affect the distribution of the tax burden by 
income group; garnishment may affect the likelihood of divorced spouses being able to collect 
child support payments; agriculture policy may affect not only the income of farmers but the 
openness of American markets to agricultural products from less-developed countries. And 
collectively, the 40 percent of policy proposals for which no specific or general opinion data are 
available must potentially influence a tremendous amount of government activity. 

Thus, it’s likely that public opinion affects public policy even less often than Page suggests 
— partly because the public is not asked its opinions on less-salient issues, and partly because 
if it were, most people would have no meaningful opinion. 

How can we overcome sampling bias and improve our estimates of the impact of public 
opinion on public policy? First, if we want to estimate the impact of public opinion on public 
policy, we need to consider all the policy proposals for which opinion exists — as a practical 
matter, this would mean a random sample of such proposals. There would be practical 
problems to address, very importantly justifying the cost of asking about policies not likely to 
be salient. Nevertheless, it’s possible to imagine what the research design would look like, 
and there is a long tradition of making changes in the General Social Survey and American 
National Election Studies in response to advances in theory and methods. 

As we have seen, though, asking the public about a wider range of issues raises additional 
concerns. If we are to consider the population of proposals for which public opinion exists, 
we need criteria for deciding whether it exists for each issue. There is a considerable literature 
on when public opinion may be said to exist in some meaningful sense (Converse 1964; 
Shapiro 1998; Sinnott 2000; Zaller 1992), but it has never been used to decide whether 
opinion exists on particular issues in quantitative work on the impact of public opinion on 
public policy. It would be necessary to figure out how to do so. 

Because the existence of public opinion is so critical for the analysis of democratic 
responsiveness, we would have to consider what brings it into existence on particular issues. 
It is sometimes claimed that the public has no opinion on issues because education and 
public discourse are controlled by powerful institutions that keep many issues off the political 
agenda. This must be true sometimes, but surely on a great many issues people have no 
opinions because they have no reason to, and lack the time and cognitive capacities to 
develop them (Jones 1994). Congress deals with so many proposals that not even its own 
members can form independent opinions about many of them (Krehbiel 1991). 

Because salience is probably so critical to responsiveness, we need to understand the 
factors that affect it as well. Our attention must shift to the tremendous amount of political 
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conflict over winning access to the public agenda (Hilgartner and Bosk 1988; Jones 1994). 

We already know what we are likely to find: public opinion affects policy less than we 
previously believed. But improvements in our understanding of the connection between 
opinion and policy should change our interpretation of what the finding means. 

The conventional view is that a high correlation between opinion and policy (in analyses 
carefully trying to assess causality) means that democratic procedures and institutions are 
working well, and that a low correlation means they are working poorly. Indeed, a low 
correlation means that special interests have prevailed over the public (Erikson, Wright and 
Mclver 1993:81; Manza and Cook 2002b:23; Page 2002:325). This implicit zero-sum view of 
the political process partly explains why researchers hopeful about democratic government 
are so pleased when they find a high correlation. As Erikson et al. (1993:81) write, for 
example, “with an imputed correlation of about .91, state opinion explains over 80 percent of 
the variance in state policy liberalism. This leaves little variance to be accounted for by other 
variables. Thus, state opinion is virtually the on/y cause of the net ideological tendency of 
policy in the state.” (Emphasis in original.) And the zero-sum view also explains why those 
who find special interests getting what they want so often assume that this must mean the 
public has been defeated. Page seems to accept this logic when he weighs the impact of 
public opinion against the ability of special interests to prevail. 

But politics is not necessarily zero-sum. If we find that special interests often get what they 
want, that does not mean they have defeated the public. When they want the same policies as 
a majority of the public — a possibility considered surprisingly seldom in quantitative analyses 
of policy change (Burstein 2003) — both can win. And when the public has no opinion — no doubt 
the case on many issues — forces other than public opinion can win without the public losing. 
(For example, the proposal that became the Americans with Disabilities Act was on the 
congressional agenda for 17 years; it was never asked about in a public opinion poll nor did it 
receive any press attention until close to enactment in 1990. But the absence of public opinion 
does not mean that the public was somehow defeated by lobbyists for the disabled.) A relatively 
low correlation between opinion and policy does not necessarily mean the public is being 
defeated. If we want to see how often other forces defeat the public, we need to be sure we are 
studying actual struggles, with the public on one side and special interests on the other. 


Conclusions 


This paper examines two claims about the impact of public opinion on public policy. First, that 
sampling bias has led us to overestimate the magnitude of the impact; and second, that if the 
true impact is less than we have thought, that means democratic institutions have not been 
working as well as many believe. 

Data from a unique data set and from elsewhere support the first claim. And it should be noted 
that the claim may very well apply to social movement organizations (SMOs) as well. Gamson 
(1975) showed that sampling bias affected our conclusions about SMO impact, and recent work 
(Burstein and Sausner 2005) concludes that SMOs have less impact than many believe. 

The second claim is another matter. Analyses of the impact of opinion on policy generally 
adopt a zero-sum view of the political process. The more often the government responds to 
public opinion, the better the democratic procedures and institutions are working; when the 
government responds less often, special interests must be prevailing over the public and 
democratic institutions must be failing. But the latter is not necessarily true. The government 
may respond strongly to the public when the public has meaningful opinions and to other - 
forces otherwise. Zero-sum battles between the public and special interests are only a part of 
the political process. How large a part? How often does the public prevail over special 
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interests, and how often the reverse? At this point, we don't know. What we do know is that 
current estimates of the impact of opinion on policy are probably too high; this does not 
necessarily mean democratic procedures and institutions are not working well. 


Notes 


I 


Among those who analyze substantial numbers of issues, for example, Page and Shapiro 
(1983:177) have not published their procedures for doing so; Brettschneider (1996) and 
Brooks (1985; 1987) are vague. Monroe (1998:11) writes that this task was not especially 
problematic “because most of the survey items dealt with what proved to be very specific 
policy proposals that were under consideration.” This was not my experience. 


Survey organizations seldom ask about support for particular policy proposals. 
Researchers, therefore, often use indicators of opinion about an issue as indicators of 
support for policy proposals, even though people often favor some social outcome but 
oppose government action. For example, responses to a question as to whether the 
United States made a mistake in sending troops to Vietnam are interpreted as gauging 
preferences for ending American involvement (McAdam and Su 2002:709); support for 
women having roles equal to men in business, industry and government is taken to signify 
support for the Equal Rights Amendment (Soule and Olzak 2004:484); and support for 
blacks having “as good a chance” as whites to get jobs is seen as support for equal 
employment opportunity legislation (Burstein 1998b:50; for a general discussion, see 
Burstein 1998b:ch. 3). 


There are multiple ways to gauge salience, including the extent of media coverage and using 
different cutoff points in the Gallup data. | tried using other Gallup cutoff points, measures of 
issue coverage in the New York Times, and combinations of the two. The results were the 
same regardless, so | use the 1 percent Gallup cutoff for the sake of simplicity. 


All proposals were coded independently by three coders using Stimson’s criteria; 29 (48 
percent) did not fit a liberal-conservative dimension. This is in line with Page and Shapiro's 
(1983: 183) finding that 40 percent of their issues “Were not easily locatable on the liberal- 
conservative dimension.” 


The majority-minority criterion may seem arbitrary, given how sensitive distributions of 
responses are to question wording; analyses identical to those reported were conducted 
with a division between favorable and unfavorable at 62 percent, the mean of all the 
distributions, rather than 50 percent. The results were not affected by this change. 


Alternatively, they could explicitly limit their conclusions to the range of policies they study, 
but they state their conclusions globally instead. 


It should be noted that Domhoff, Monroe and Page and Shapiro all treat the 
classification of issues as unproblematic, but they shouldn't have — much political 
debate is about how issues should be classified and framed (Baumgartner and Jones 
1993). | classified the Americans with Disabilities Act as an economic policy, for 
example, because much of the testimony at hearings and the congressional debate 
revolved around the costs of accommodating the needs of the disabled, but many 
people view the Act as a civil rights law. 
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8. Interviewers could provide paragraph-long summaries of each proposal, but then the 
distribution of responses would depend heavily on how the summary frames the proposal. 
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Did Falling Wages and Employment Increase U.S. 
Imprisonment? 


Bruce Western, Princeton University 
Meredith Kleykamp, Princeton University 
Jake Rosenfeld, Princeton University 





Abstract 

This paper studies the effects of wages and employment on men’s prison admission rates 
in the United States from 1983 to 2001. Research on the effects of the labor market on 
incarceration usually examines national- or state-level data, but our analysis studies 
prison admission among black and white men in specific age-education groups. We find a 
significant increase in educational inequality in incarceration; nearly all the growth in the 
risk of imprisonment was confined to non-college men. Regression analysis of prison 
admission rates shows the negative effects of wages and employment on black men’s 
incarceration, and the negative effects of wages on white men’s imprisonment. If 1980s’ 
wage and employment levels had persisted through the late 1990s, the estimates suggest 
that prison admission rates would be 15 to 25 percent lower for all non-college men. 


Two major social trends reduced the living standards of young low-education American men 
over the past 30 years. Earnings and employment among those with just a high school 
education were eroded by the tide of rising income inequality. While the labor market faltered, 
growth in the American penal system turned prison time into a common life event for low-skill 
and minority men. The growth in U.S. economic inequality and the prison boom both date 
from the mid-1970s, and both trends continued through the 1990s. Have falling wages and 
employment among young, low-skill, men contributed to the growth in American 
imprisonment? 

Labor market trends might influence the scale of imprisonment in two main ways. Falling 
wages and employment may increase crime at the bottom of the economic ladder, 
generating more arrests, convictions and prison admissions. In one account, young black 
men turned to drug dealing and other crime in response declining job opportunities through 
the 1980s and 1990s (Freeman 1996; Duster 1997). Against this view, sociologists of 
punishment see the criminal justice system not just as an instrument for crime control; it also 
operates to contain marginal populations that elites and voters perceive as threatening. The 
direct link between contemporary economic inequality and punishment was forcefully 
claimed by Wacquant (2000) who argues that the criminal justice system has thoroughly 
penetrated poor inner-city neighborhoods, giving rise to a novel form of racial domination. 

We examine the impact of the labor market on men’s prison admission between 1983 and 
2001. Earlier research associated aggregate labor market indicators with aggregate 
incarceration rates, either in samples of U.S. states or national time series (e.g., Jacobs and 
Helms 1996, 2001; Greenberg and West 2001). While the analysis of aggregate incarceration 
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rates has produced valuable estimates of political and cultural effects, it misses a central 
implication of labor market theories of punishment: that carceral efforts target economic 
losers. For the labor market theory of punishment, increasing economic disadvantage doesn't 
just raise the general level of incarceration; it also increases inequality in incarceration. We 
revisit the labor market theory of punishment with a novel approach that calculates the risk 
of imprisonment for white and black men at different ages and levels of education. We then 
relate these disaggregated risks to disaggregated measures of wages and employment. In 
contrast to previous aggregate-level research, our design produces a tighter link between the 
economic status of the disadvantaged and their involvement in the criminal justice system. 


The Prison Boom 


About two-thirds of the American correctional population is housed in state or federal prisons 
serving sentences for felony convictions of a year or longer. Between 1920 and 1970, the 
imprisonment rate averaged about 100 per 100,000 of the U.S. population. The 1970 
imprisonment rate, at 96 per 100,000, stood near its historic average. By 2001, there were 470 
prisoners per 100,000. Growth in the prison population was driven by an increase in the 
numbers admitted to prisons and an increase in the time served by those admitted. Figure 1 
shows that between 1977 and 1998 the state and federal prison population grew more than 
fourfold to include about 1.3 million inmates by the end of the 1990s. Annual prison 
admissions expanded by a similar margin. Around 650,000 people are now annually 
sentenced to a year or more in state or federal custody. 


Figure 1: Prison Admissions and Populations, 1977-1998 
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Our analysis examines the prison admission rate because it is likely to be most directly 
related to labor market conditions. A rising gap between rich and poor may affect admissions 
by increasing crime rates among low-income men or by increasing the rates of arrest and 
court commitment to prison. Time served in prison, on the other hand, is only observed when 
a prisoner is released. In this case, the effects of the economy will only be seen with a long 
and indeterminate lag. We might also follow other research by analyzing incarceration rates 
(e.g., Jacobs and Helms 1996; Greenberg and West 2001), but this bundles together the 
immediate effect of the labor market on admissions and the lagged effect on time served. 

Aggregate figures on imprisonment conceal large educational and racial disparities. 
According to the 1997 Survey of Inmates of State Prisons, prisoners average less than 11 
years of schooling compared to more than 13 years of schooling among men under age 40 in 
the non-institutional population (Western and Pettit 2005). Censuses of correctional facilities 
find that the imprisonment rate of African Americans is about seven times higher than that 
of whites. Disaggregated prison admission rates that capture these inequalities can be 
calculated using data from the National Corrections Reporting Program (NCRP). The NCRP 
data provide an annual census of all prison admissions and releases in 38 reporting states, 
covering 80 to 90 percent of the total prison population. The NCRP data record a prisoner's 
age, race and ethnicity, education and offense. Combining the NCRP data with aggregate 
figures on total prison admissions yields estimates of the numbers annually entering prison 
for white and black men at different ages and levels of education. 


Table 1: Percentage of Non-institutional Men, Ages 20-39, Annually Entering Prison, 
by Education and Race 


1983-1988 1989-1994 1995-2001 


Whites 
Less than High School ‘B55 2.45 3.37 
High School or GED 32 39 56 
Some College .06 .06 .06 
Blacks 
Less than High School 7.09 14.59 16.33 
High School or GED 1.42 2.43 2.87 
Some College 49 2 54 


Source: Figures are calculated with data from the National Correctional 
Reporting Program (1983-2001), and the Outgoing Rotation Group files of 
the Current Population Survey (1983-2001). 


Table 1 reports prison admission rates for young black and white men from 1983 and 2001." 
Unlike most studies of imprisonment, we distinguish admission rates for those without high 
school diplomas or GEDs, high school graduates and those with at least some college. For blacks 
and whites, high school dropouts are about five times more likely to go to prison in a given year 
than men who have completed high school. The prison admission rate rose sharply for low- 
education men but little among the college-educated. Racial disparities are also pronounced. In 
line with aggregate statistics on incarceration rates, the NCRP data indicate that rates of prison 
admission for African Americans are five to 10 times higher than for whites. The combination of 
racial and educational inequality strikingly affects young black male dropouts. We estimate that 
one in six black male dropouts went to prison every year in the late 1990s. The protective effects 
of college education are also shown; fewer than 1 percent of college-educated black men were 
admitted to prison in 2001. These patterns are similar to the lifetime risks of incarceration 
calculated by Pettit and Western (2004), whose estimates are based on different data sources. 
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Disaggregated prison admission rates show that race and class disparities are extremely 
large, far larger for example than inter-state variation in incarceration rates. Whereas Louisiana's 
incarceration rate of 801 per 100,000 exceeds Maine's (149) by a factor of five (Maguire and 
Pastore 2005, Table 6.23), the relative risk of imprisonment of a black male dropout is more 
than 250 times higher than for a college-educated white man (16.33 compared to .06 percent). 
Understanding the vast race and class disparities in imprisonment thus seems central to 
explaining the prison boom. High rates of imprisonment among disadvantaged men are usually 
attributed to their high rates of offending and to the social control efforts of authorities. We 
next review theories of race and class inequalities in crime and social control, and then propose 
an empirical test. This test isolates the mounting risk of imprisonment among young 
disadvantaged men that is independent criminal behavior, but linked to economic status. 


The Labor Market and Crime 


Are high rates of prison admission among low-education men due to their high rates of crime? 
Merton's (1968, 223) social strain theory provides the seminal justification, “The moral mandate 
to achieve success... exerts pressure to succeed by fair means if possible and by foul means if 
necessary.” Frustration at blocked opportunities might drive the poor to crime so they might 
access the material success enjoyed legally by the middle class. In these cases, the crime- 
involved poor may have “withdrawn their attribution of legitimacy” to norms of law-abiding 
behavior (Cloward and Olin 1960, 19). Blau and Blau (1982) go further, arguing that ascriptive 
inequality (such as racial inequality), more than inequality based on achievement, appears 
particularly illegitimate. High levels of racial inequality, rather than economic disadvantage, may 
fan the frustrations that trigger crime. 

Theories of informal social control likewise anticipate more crime among the economically 
disadvantaged. The routines of steady employment, independent of its economic attractions, 
reduce opportunities for offending. Researchers also find that men in primary sector jobs — where 
work is consistent, routinized and monitored — commit less crime than men in the secondary 
labor market where employment is irregular (Sullivan 1989; Crutchfield and Pitchford 1997). Low 
wages and unemployment may also foster crime indirectly by undermining the bonds of family 
and neighborhood. Stable two-parent families can monitor children’s activities and divert them 
from the peer networks that provide a familiar context for delinquency (Sampson 1987). Poor 
families, which are more likely to be headed by a single parent, have fewer resources to restrain 
delinquency (Hagan 1993). Consequently, poor neighborhoods with many single-parent families 
have been found to have high levels of violence (Morenoff et al. 2001). 

The class-crime connection is unevenly supported by empirical studies. The relationship is 
weakest in research on self-reported offending (Tittle et al. 1978). Still, official records from police 
and courts consistently show that lower class men and male youth are more involved in crime and 
delinquency than the middle class. Individuals in poor and minority neighborhoods are also likely to 
self-report crime at relatively high rates, regardless of their own social class (Braithwaite 1980). 

Ecological time series and panel studies of states and counties offer stronger support for labor 
market effects on crime. (Chiricos 1987 and Freeman 1995 provide reviews.) Ecological studies 
often distinguished the effects of absolute deprivation (measured by unemployment rates or 
income levels) and relative deprivation (measured by inequality statistics); LaFree and Drass 
(1996), for example, find that race-specific arrest rates increased during periods of rising intra- 
racial income inequality, but an absolute measure of economic well-being was not consistently 
associated with arrest rates. Broad trends, however, are difficult to reconcile — crime rates 
escalated in the decade from 1965 while economic inequality declined. Land, McCall and Cohen 
(1990) are skeptical that the effects of absolute and relative deprivation can be convincingly 
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separated. Their analysis of city- and state-level data from three census years estimates the 
effects of an omnibus measure of economic deprivation that includes a weighted sum of the 
poverty rate, a Gini index of family income inequality and median family income. The estimated 
effect of economic deprivation on homicide rates increases at the city and state levels between 
1950 and 1980. (Liska and Bellair 1995 report similar results.) Ecological studies reporting a 
connection between crime and the labor market capitalize on the sharp increase in crime rates 
between the mid-1960s and mid-1970s. 

Our interest focuses on the link between economic conditions and crime after 1980 when the 
labor market situation of low-skill men significantly deteriorated. Freeman (1996) reflects on these 
later decades arguing that “the depressed labor market for less skilled men in the 1980s and 
1990s has contributed to the rise in criminal activity by less skilled men.” Despite Freeman's 
claim, quantitative studies provide little evidence of a consistent relationship between labor 
market conditions and crime rates from 1980 through 1999. Fowles and Merva (1996) analyze 
28 metropolitan areas between 1975 and 1990 and find a stable association between household 
income inequality for several categories of violent crime. However, Doyle and her colleagues 
(1999) could not reproduce this result in their study of crime rates in 48 states from 1984 to 1993. 
Trends in a national series of youth homicide arrests, 1967-1998, were also weakly related to 
income inequality and reliably related only to unemployment rates among whites but not blacks 
(Messner, Raffalovich and McMillan 2001). These ecological studies suggest that analyses of 
states and national time series have not consistently linked levels of offending to either incomes 
or employment in the period when low-skilled men’s economic opportunities deteriorated most 
—the 1980s and 1990s. 

The ecological regressions are limited by their inference of crime among the poor from 
aggregate crime rates for the whole population. Ethnographic research helps fill the gap by 
studying crime in poor urban neighborhoods. Ethnographers have identified entrepreneurial gangs 
as a key source of economic opportunity for young men in urban communities characterized by 
chronically high rates of joblessness. Bourgois’ (1996) research on Hispanic drug gangs views the 
sale and distribution of crack cocaine as a response to depleted economic opportunities in inner 
cities. Venkatesh and Levitt (2000) find that drug gangs have become important economic 
organizations in poor urban areas. Their research on Chicago's “outlaw capitalism” shows that drug 
gangs have a well-defined organizational hierarchy in which incomes are steeply graduated from 
the street sellers at the bottom to the managers at the top. Other researchers more generally claim 
that drug dealing in inner cities proliferated as legitimate, low-skill, employment opportunities 
diminished in the 1980s and 1990s (Duster 1997; Tonry 1995; Anderson 1999). 

In sum, theories of social strain and informal social contro! lead us to expect that the 
declining economic situation of low-skilled men increased their criminal activity in the 1980s 
and 1990s. This hypothesis receives uneven support from quantitative studies that are 
hamstrung by a highly aggregated research design. Contextual and disaggregated 
ethnographic observation directly observes crime among the poor. This work provides clearer 
evidence that drug dealing and other crime became more common among disadvantaged 
young men in response to the collapse of low-skill urban labor markets in the 1980s. 
Increasing crime in poor urban neighborhoods thus provides one explanation of rising 
incarceration rates among young minority men with little schooling. 


The Labor Market and Social Control 
In the sociology of punishment, criminal behavior in the population is only weakly related to 


the scale of imprisonment (Garland 1991). Instead, for one theoretical approach, criminal 
punishment expresses authorities’ reaction to the perceived menace of marginal populations. 
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Inspiration for this argument is often traced to Georg Rusche (1978 [1933]; Rusche and 
Kirchheimer 2003 [1939]) who viewed historic variation in forms of punishment — fines, 
torture, imprisonment — as products of the economic situation of the dispossessed. The 
unemployed, representing the most desperate and crime-prone workers, occupy a special 
place in this theory. Elites would stem the threat of rising crime by intensifying punishment as 
the surplus population expanded. Punishment would then become less wasteful of labor, less 
intense, under conditions of labor scarcity. 

For the contemporary descendants of Rusche, the criminal justice system embodies a 
social conflict between authorities and marginal populations. While Rusche viewed 
punishment chiefly as means to deter crime, modern proponents see punishment as 
controlling a broad array of threats to social order posed by troublesome populations. The 
level of punishment is expected to vary with the size of the troublesome group. Empirical 
studies defined threatening populations in terms of their employment status, (e.g., Box and 
Hale 1982), race and ethnicity (Hall 1978), or some combination of the two (Melossi 1989; 
Spohn and Holleran 2000). If not through crime, how do marginal populations endanger social 
order? Some claim that troublesome populations are viewed by authorities as jeopardizing not 
just public safety, but the economic order in general (Quinney 1974; Spitzer 1975). In addition 
to stealing from the rich, the able-bodied poor may refuse to work, reject the dominant values 
of hard work and achievement, and advocate revolutionary change (Spitzer 1975). The 
destabilizing potential of low-income young men at the bottom of the social structure is well 
captured by Spitzer's (1975, 645) term, “social dynamite,” evoking volatility more than durable 
disadvantage. In this analysis, threats to public safety and social order become identified with 
marginal groups that become the focus of social control efforts. Wacquant (2000) placed the 
sociological analysis of criminal punishment in the modern context of the jobless ghetto. Like 
Freeman and others, Wacquant (2000) sees growth in the penal system as closely connected 
to the decline of urban labor markets in the later postwar period. While Freeman emphasizes 
the intervening role of crime, Wacquant’s analysis describes a “prisonization of the ghetto” 
that represents the latest form of institutionalized white supremacy — a political response to 
the demise of the ghetto as an economically viable, yet controlling, institution in the lives of 
African Americans. From this perspective, the growth in imprisonment is rooted not in rising 
urban crime, but the failure of the ghetto as a form of institutionalized social control over 
young black men. 

In the abstract, social threat accounts of punishment sound conspiratorial. By what 
concrete process do dominant groups actively use the state's legitimate violence against 
those who are relatively powerless? Socio-economic disadvantage is linked to criminal 
justice supervision in three main ways. First, legislators perceiving poor and marginal 
populations as dangerous or unruly may write criminal law to contain the threat. The 
legislative offensive against the disadvantaged has been claimed most forcefully in the area 
of drug policy. Over the past three decades, Congress and state legislatures widely adopted 
mandatory prison sentences for drug possession or trafficking (Bureau of Justice Assistance 
1998, 7). Consequently, the risks of imprisonment given a drug arrest and the proportion of 
drug offenders in state prison increased sharply from 1980 and 1996 (Blumstein and Beck 
1999). In the course of the prison boom, the passage of tough sentences for drug 
sentencing was thus associated with intensified criminalization of poor urban minorities 
(Tonry 1995; Dubber 2001). 

Second, police may surveil and arrest the poor more frequently than the affluent. Police 
partly focus their efforts in poor urban communities because more of daily life and illegal 
activity transpire in public space. Ethnographers suggest that in urban areas, the purchase 
and consumption of drugs, drunkenness and domestic disturbances are more likely to 
take place in public. In the suburbs, drug use, drunkenness, and domestic disturbances 
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are more likely to take place in private homes. Consequently, poor urban residents are 
more exposed to police scrutiny and risk arrest more than their suburban counterparts. 
(e.g., Duneier 1999; Anderson 1999; Bourgois 1996). Police also tend to view poor 
minorities as more involved in crime, treating them with greater suspicion (Wilson 1968, 
ch. 2; Chambliss 2000). 

Third, judges may treat poor defendants harshly in court. Judges can view. poor 
defendants as more culpable with less potential for rehabilitation (Kluegel 1990; 
Steffensmeier, Ulmer and Kramer 1998, 770; Greenberg 1977; Albonetti 1991). Thus, 
controlling for offense characteristics and criminal history, sentencing research finds the 
highest probability of incarceration among low-status unemployed defendants — either 
minorities or those living in high unemployment areas (Spohn and Holleran 2000; D'Alessio 
and Stolzenberg 2002). 

Social threat accounts of punishment suggest that rising inequality may increase 
incarceration among the disadvantaged. Most research on the economic determinants of 
incarceration follow Rusche and Kirchheimer by studying the effects of unemployment 
and other measures of surplus population. Chiricos and Delone (1992) report that national 
time series analyses consistently show a positive effect of unemployment on prison 
admissions. However, this pattern does not hold in the 1990s when historically low 
unemployment rates were associated with high levels of incarceration (Michalowski and 
Carlson 1999). Indeed, recent studies by David Jacobs and his colleagues find no evidence 
for the effects of unemployment on incarceration in a time series (1950-1990) or a panel 
study of 50 states in 1970, 1980 and 1990 (Jacobs and Helms 1996; Jacobs and 
Carmichael 2001). 

Although economic analysis of criminal punishment focuses on unemployment effects, 
several studies examine the impact of incomes. Strongest results are reported by Jacobs and 
Helms (1996, 2001) who find several positive and significant effects of income inequality on 
imprisonment rates (1953-1998) and prison admission rates (1950-1990). Income inequality 
effects are much weaker in samples of states or counties where cross-sectional variation 
predominates. Panel studies of states show that Gini indexes on family incomes are only 
weakly related to prison incarceration rates, after controlling for crime rates and the racial 
composition of the population (Greenberg and West 2001). Jacobs and Carmichael (2001) fit 
a fixed-effect model to panel data, showing that states with the largest increases in inequality 
did not experience the largest increases in incarceration rates. 

Previous research unevenly indicates the effects of the labor market on the scale of 
incarceration, but aggregated measurement and inefficient research designs may have 
prevented strong findings. Theory suggests that unemployment and falling wages increases 
inearceration among the disadvantaged. Still, most studies of labor market effects analyze 
aggregate crime and incarceration rates. These rates have been defined on the whole U.S. 
population, or for states or metropolitan areas. A few studies separate the involvement of 
blacks and whites in the criminal justice system (LaFree and Drass 1996; Bridges and 
Crutchfield 1988). Even here race is used as a proxy for economic status and the possibility 
that affluent blacks are treated differently from poor blacks is obscured by the aggregated 
research design. A strong test of the effects of labor market conditions on incarceration 
requires a more disaggregated analysis where incarceration risks are observed for those who 
are most economically marginal. 

Incarceration research on economic effects also applies inefficient designs that use 
relatively little information in statistical analysis. Time series studies, which offer the strongest 
support for inequality effects on incarceration, rely on data sets of 30 or 40 time points (e.g., 
Jacobs and Helms 1996). In these analyses there are often large numbers of plausible models 
and long lists of correlated covariates. Results are thus highly sensitive to the choice of model. 
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Panel studies of states use more information, but time series tend to be sparse consisting of 
just two or three time points (Jacobs and Carmichael 2001; Greenberg and West 2001; Beckett 
and Western 2001). Growth in economic inequality has been uneven, accelerating in the early 
1980s and flattening out at a high level through the 1990s. This variation is missed in panel 
studies. A stronger design examines annual variations in a cross-section of the population, 
capturing the ebb and flow of the gap between rich and poor households. 


A Disaggregated Analysis of Imprisonment 


To address the limits of previous research we develop a disaggregated approach to 
estimating the effects of labor market conditions on incarceration. Using annual data for 1983 
to 2001, men’s probability of going to prison is calculated for age-race-education groups by 
combining data from the NCRP and data on the non-institutional population. These admission 
rates are estimated for four age groups, (1) 20-24, (2) 25-29, (3) 30-34, (4) 35-39; three levels of 
schooling, (1) high school dropouts, (2) those with high school diplomas or equivalency but no 
college, and (3) those with at least some college; and two race groups, (1) non-Hispanic 
whites, and (2) non-Hispanic blacks. The dependent variable thus gives the prison admission 
rate in a given year for black dropouts ages 20 to 24, 25 to 29, 30 to 34, and 35 to 39. 
Admission rates are then recorded for black high school graduates in these four age groups 
and again for black men with some college. Admission rates are then calculated for whites 
in all these age-education categories. This coding scheme provides a 4 x 3 x 2 table for each 
of 19 years, yielding a sample of 456 admission rates for analysis. Admission rates calculated 
from the NCRP are largely consistent with those from other data sources (see Appendix), and 
the large samples of the NCRP allow sharp estimates of admission rates for specific 
subgroups. 

Our key predictors are disaggregated measures of employment and weekly earnings 
calculated from the merged outgoing rotation group files of the Current Population Survey. 
Like the prison admission rates, weekly earnings and employment figures are calculated for 
men’s age-race-education subgroups. So, for example, weekly earnings are calculated for 
black male dropouts ages 20 through 24, 25 through 29, and so on. Median weekly earnings 
data indicate a clear race gap that has persisted among young men throughout the 1980s and 
1990s (Table 2). From 1983 through 2001, the weekly earnings of young white men exceeded 
those of young blacks at the same level of education by 25 to 30 percent. The table also 
indicates increasing earnings inequality by education among white men that is well 
documented in other research (e.g., Bernhardt et al. 2002). In the 1980s, young white men 
with at least some college education earned 47 percent more than young white male 
dropouts. By the late 1990s, this earnings gap had grown to 62 percent. Inequality did not 
grow among black men due to the large decline in weekly earnings for blacks who had 
attended college. The earnings of black male high school dropouts were also more robust 
than the earnings of their white counterparts. 

Table 3 reports trends in employment by education for young black and white men. 
Employment is measured by the percentage of the non-institutional population who are in full- 
time or part-time work. Employment rates for whites at different levels of education remained 
stable throughout the 1980s and 1990s. About three-quarters of young white dropouts were 
employed in comparison to about 90 percent of young whites with at least a high school 
education. In contrast, employment declined among black dropouts, from 58.6 to 51.1 
percent but grew slightly among the college-educated. In short, earnings inequality grew 
among young white men in the 1980s and 1990s, but employment inequality grew among 
young black men. 
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Table 2: Median Weekly Earnings (1999 dollars) of Male Workers, Ages 20-39, by 
Education, Race and Ethnicity 


Less Than High High School Some College Dropout 


School orEquivalency College _Earnings Ratio 
Whites 


1983-1988 491.24 589.74 722.46 1.47 

1995-2001 437.79 539.61 707.59 1.62 
Blacks ; 

1983-1988 368.33 463.13 590.38 1.61 

1995-2001 338.46 419.29 535i12 1.58 


Note: Figures are calculated from the Outgoing Rotation Group files of the Current 
Population Survey (1983-2001). 


Table 3: Average Percentage of Men Employed, Ages 20-39, by Education, Race and Ethnicity 


Less Than High School Some College Dropout 


High School _or Equivalency College _ Earnings Ratio 
Whites 


1983-1988 76.6 90.1 86.0 
1995-2001 76.6 89.6 87.3 
Blacks 
1983-1988 58.6 78.9 79.2 135: 
1995-2001 51S 76.2 82.2 1.60 


Note: Figures are calculated from the Outgoing Rotation Group files of the Current 
Population Survey (1983-2001). 


To study the effects of earnings and employment on imprisonment we write a regression for 
the prison admission rate pyr, the proportion of the non-institutional population going to prison 
in year t = 1983, ..., 2001, es men in race / (/ = black or white) at education level / (/ = < HS, 
HS/GED, or > HS), in age-group k (k = 20-24, 25-29, 30-34 or 35-39). The effects of earnings and 
employment on the risk of prison admission can be written with the regression equation: 


log (pri = Bo + Bi Wee BoE pin X tie > Ergns 


where Wyiz is the median weekly earnings for a specific race-age-education group in year t; 
Eik is the subgroup employment rate; Xp, is a vector of other covariates like crime rates or 
dummy variables for demographic characteristics; and ez is an error term. With this 
regression, the labor market effects describe how wages and employment are associated 
with rates of prison admission across the categories of age, race and education. 

This analysis focuses just on the labor market determinants of incarceration and neglects 
many of the institutional and political variables that are often studied in research on state-level 
incarceration rates. An alternative approach could calculate disaggregated age-race-education 
prison admission rates for each state. Such an approach would take advantage of the state- 
level variation in the NCRP and the demographic disaggregation used here. A state-level 
analysis could also introduce the political and institutional variables that were found to be 
important studies of incarceration in the states (e.g., Jacobs and Carmichael 2001; Greenberg 
and West 2001). Unfortunately, such an approach is infeasible because some groups are too 
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small to accurately describe at the state level. For example, in 1999 black male dropouts, ages 
20 to 24, constituted just 1 percent of the male population under 40. With groups this small, 
admission rates, employment rates, wages and crime rates can be estimated at the national 
level with the large samples of the CPS and the NCVS, but not at the level of individual states. 

Although the disaggregated imprisonment rates provide more detailed measurement, 
estimates of wage and employment effects with this design are subject to two offsetting 
biases. First, by taking many low-skilled, crime-involved men out the workforce, imprisonment 
will tend to raise observed levels of employment and earnings. Imprisonment effectively 
truncates the lower tail of the earnings and employment distribution (Western and Pettit 
2005). The positive effect of imprisonment on employment and earnings will tend to bias the 
expected negative effects of employment and earnings on prison admissions towards zero. 
On the other hand, large numbers of entering prisoners accompany large numbers of prison 
releases. The re-entry of ex-prisoners into the labor market will tend to lower employment 
rates and lower wages. This negatively biases the estimated effects of earnings and 
employment on admission. Under the prison boom, prison admissions have slightly 
exceeded prison releases so the net bias, at least in the short-term, will tend to be small. 

Bias may also result from the strong upward trend in prison admission rates. A non- 
stationary time series, in which the mean of the series depends on time, can bias least 
squares estimates. However, tests of unit autocorrelations — unit root tests — and other 
diagnotsics show little evidence of non-stationarity in prison admission rates after adjusting 
for year fixed effects and covariates. There is evidence of serial correlation in the analyses for 
white men (with autoregression coefficients of .4 to .6) although adjusting for time 
dependence yields results similar to those reported below. 


Distinguishing the Effects of Crime and Punishment 


The labor market may affect prison admissions through the criminal behavior of the 
disadvantaged or the social control efforts directed at them by the authorities. Studies of 
national time series or panels of states have tried to identify the social control effect by 
statistically adjusting for crime rates (Jacobs and Helms 1996; Chiricos and Delone 1992). We 
extend this approach in several ways. 

First, national time series and state-level analyses use aggregated measurements of 
crime. In our analysis, we control for criminal behavior using disaggregated data for age-race- 
education groups. Because violent crime usually involves victims and perpetrators with similar 
social status, we tap crime among blacks and whites at different levels of age and education 
with victimization data from the National Crime Victimization Survey (NCVS). The NCVS 
annually asks respondents about their exposure to violence over the past year. The data can 
be used to construct violent victimization rates — the number of victims of violence divided 
by the population — for different offenses and for different subgroups. As we would hope, the 
victimization rates, like criminal offending, are strongly patterned by age, race and education. 
The variation can be described by regressing the annual rate of violent victimizations per 1000, 
V, on dummies for education (HS for high school graduates and SC for those with some 
college), race (B for blacks), and age (Ay for 25-29 year olds, Ag for 30-34 year olds, and Ag for 
35-39 year olds): 


EV)= 5175 -45HS -40SC +.15B -.41A, -.64A9 -.60A, 
(06) (05) (05) ~— (04) (06) ~— (06) -~—(.06) 


R? = .35, N = 456 
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where OLS standard errors are in parentheses. These descriptive results show the relatively high 
risks of violent victimization among young, poorly educated, black men. Although disaggregated 
victimization rates provide an indicator of crime for specific age-race-education groups, they 
obviously do not directly measure crime by perpetrators or homicide victimization. Direct measures 
of crime that include homicide are usually taken from the aggregate crime statistics compiled from 
police reports in Uniform Crime Reports (UCR). We also analyzed prison admission, controlling for 
UCR age-race specific homicide rates and obtained results substantively identical to those below. 

Second, research on the deterrent and incapacitation effects of imprisonment indicate that 
increased imprisonment reduces crime rates (Levitt 1996; Rosenfeld 2000). The dependence 
of crime rates on imprisonment may be a significant source of bias in the 1990s as crime fell 
while incarceration rates increased. Past analyses of crime rates used instrumental variables 
to identify exogenous variation in imprisonment that does not depend on the level of crime 
(Levitt 1996). Instead of trying to identify exogenous variation in crime that might drive 
imprisonment, we fit fixed effects that capture unmeasured variables that place people at risk 
of criminal behavior or incarceration. In the most detailed specifications below, we introduce 
fixed effects for every race-age-year cell in our tabular data. Variation in criminal offending by 
race, age and year is well-established in studies of official statistics and victimization data 
(e.g., Hindelang 1978, 1981; Gottfredson and Hirschi 1990; Land, McCall and Cohen 1990). 
Fixed effects — estimated with dummy variables for all race-age-year subgroups — account for 
all variables that vary by race, age or year and the interactions among them. For example, the 
effects of UCR homicide rates which are available annually for blacks and whites above and 
below age 24 are accounted for by the fixed effects model. We can also think of the fixed 
effects as tapping the criminal propensity of the population that is not fully reflected in 
observed crime rates. Involvement in drug and property crime, which are unmeasured here, 
may also be correlated with age and race and to that extent will be captured by the fixed 
effects. In the early 1990s, imprisonment rates increased but crime rates held steady leading 
some to argue that the population's propensity to commit crime had increased (Freeman 
1996). We can treat this propensity as an unobserved variable absorbed by the fixed effects. 

Although the current modeling strategy tries to distinguish the effects of crime from social 
control, there are certainly limitations. Treating fixed effects as capturing only the propensity to 
commit crime neglects social control processes that mostly affect certain race-age groups, 
particularly young black men. If drug-war policing and criminal sentencing largely affected young 
black men across the educational distribution, the social control effect will be captured by the 
fixed effects. Criminal sentencing also became more punitive in the 1980s and 1990s and this 
trend will be captured by yearly fixed effects. Similarly, variation over time in presidential 
partisanship or urbanization will also be captured by the fixed effects. In this way, the large 
number of determinants of imprisonment, besides labor market conditions and levels of crime, 
can be absorbed by the fixed effects and bracketed from the analysis. On the other hand, if 
there are processes that operate differently across levels of education or at different points in 
time, such effects will be confounded with the wage, employment, and crime variables. 

Despite these limitations, the current analysis introduces far more detailed information 
about the risks of incarceration than earlier research. The robustness of estimated earnings 
and employment effects can be examined by introducing age-race fixed effects and 
victimization rates which are known to be correlated with criminal offending. 


Results 


Before examining the effects of employment and wages, we use a regression analysis to 
describe educational inequality in prison admission. Education is measured by dummy 
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variables for high school graduates or equivalents, and those with at least some college. 
Controlling for race, the level of violent crime, age and the upward linear trend in 
imprisonment rates, the risk of imprisonment among the college educated is only 4 percent 
(e2% ~ 04) as high as the imprisonment risk among high school dropouts (Table 4, column 
1). High school education is also associated with a reduced risk of incarceration. High school 
graduates are only 20 percent (e" ~ .20) as likely to be sent to prison as dropouts. 


Table 4: Results from Regression of Men’s Log Annual Prison Admission Rates, by Age, Race and 
Education 


All Men Whites Blacks 
(1) (2) (3) (4) 

Intercept -.13 -.11 -.39 1.47 
(1.67) (.69) (2.76) (8.64) 

High School -1.63 -1.55 -1.43 -1.58 
(47.11) (21.39) (20.14) (14.71) 

College -3.33 -2.88 -3.11 -2.58 
(75.95) (33.63) (34.53) (19.62) 

Violent Crime x 10 19 11 38 .08 
(5.60) (1.15) (3.61) (.83) 

Black 1.81 1.72 a "= 

(59.80) (26.31) 

Year .04 04 .08 .06 
(14.11) (3.01) (7.86) (4.34) 

Year X High School 2 -.01 -.02 -.01 
(1.51) (3.30) (.65) 

Year X College a -.05 -.06 -.05 
(6.85) (6.72) (4.61) 

Year X Violent Crime Ke 01 #02 01 
(.82) (2.16) (1.18) 

Year X Black _ 01 ~ a 

(1.84) 
R? 97 97 .98 .96 





Notes: Statistics for 1983-2001. High school indicates high school graduation or equivalency, 
college indicates at least some college. Sample size for all men is 456; sample sizes for whites and 
blacks is 228. Regressions also control for the effect of age. All standard errors are calculated 
using a sandwich estimator to account for non-constant error variance (Wu 1986). Absolute t 
statistics are in parentheses. 


The remaining models in Table 4 examine whether education and racial inequality in 
imprisonment changed over time. Changes in the race and education effects are modeled with 
interactions with the linear time trend (Year). The interaction effect indicates that educational 
inequality in incarceration steadily increased. In 1983, prison admission rates among the 
college educated were only 4 percent as high as those for high school dropouts. By 2001, this - 
relative risk had shrunk to just 2.2 percent. In contrast to the increase in educational inequality 
in imprisonment, there is no evidence of deepening racial inequality. The race-year interaction 
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effect is negligibly small, showing that blacks’ relative risk of incarceration remained about 6 
times higher than whites’ (e'*' = 6.1) throughout the 1980s and 1990s. 

The final two columns of Table 4 report results separately for black and white men. The 
large intercept for black men indicates they are at much greater risk of imprisonment than 
whites (1.47 compared to -.39). The smaller education coefficients suggest that the risk of 
imprisonment is less class-stratified among blacks; increasing education reduces 
imprisonment more for whites than blacks. Still, blacks and whites both experienced 
increasing educational inequality in imprisonment (Table 4, columns 3 and 4). Among whites, 
the effects of high school and college became more negative with time, indicating that the 
growth in white imprisonment was concentrated among high school dropouts. Among black 
men, only the college-by-year interaction is significant, showing that the growing likelihood of 
imprisonment was broadly experienced by all those with just a high school education. Only 
college-educated black men were spared the increasing risk of incarceration. Although racial 
disparities in imprisonment have not increased in the past 20 years, prison time has become 
more widely distributed across the population of young black men without college education. 

Table 5 reports estimates of the effects of earnings and employment on prison admission 
among young black and white men. Only variation in admission rates across levels of 
education contributes to the estimates, after violent crime and year-race-age fixed effects are 
controlled (Table 5, column 1). In this model, a $100 increase in median weekly earnings- 
roughly the earnings gap between dropouts and high school graduates-is estimated to halve 
the risk of prison admission (e"” = .48). A 10 percent gap in employment rates — roughly equal 
to the dropout-graduate employment gap among whites — is associated with a 20 percent 
reduction in the risk of imprisonment. 


Table 5: Results from Regression of Men’s Log Annual Prison Admission Rates on 
Earnings, Employment and Violent Crime 


All Men Whites Blacks 
(1) (2) (3) (4) 
Intercept 2.96 Seis Cor 4.04 
(2.26) (6.97) (2.35) (12.03) 
Earnings ($100s) -.73 -.38 -.53 -.30 
(16.07) (7.44) (8.34) (4.25) 
Employment (10%s) -.34 -.14 16 -.17 
(5.09) (1.86) (1.66) (2.52) 
Violent Crime 87 01 -.06 03 
(5.16) (.19) (1.00) (.58) 
Fixed Effects 
Year X Race x Age Yes No No No 
Education x Race x Age No Yes No No 
Education x Age No No Yes Yes 
Within-group R’ 10 16 26 14 
Parameters 155 Zh 12 15 


Notes: Statistics for 1983-2001. Absolute t statistics are in parentheses. All standard errors are 
calculated using a sandwich estimator to account for non-constant error variance (Wu 1986). 
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An alternative model fits fixed effects to every education-race-age cell (Table 5, column 2). 
This specification is similar to fixed effects models for panel data (Hsiao 1985, ch. 3) in which 
only longitudinal variation contributes to the analysis. In this model, the earnings and 
employment effects are again significant, but they have declined by half to two-thirds. The 
estimates indicate that a $100 increase in weekly earnings is associated with a 32 percent 
reduction in the risk of imprisonment. A 10 percent fall in the employment rate is associated 
with an 11 percent increase in the risk of imprisonment. 

The regression analysis can be extended by studying how the effects of employment and 
earnings on imprisonment differ for blacks and whites (Table 5, columns 3 and 4). A test for 
racial heterogeneity is motivated by claims that the economic decline of young black men 
with little schooling had severe effects on their risks of incarceration (Tonry 1995; Duster 
1997; Wacquant 2000). Regression analysis for whites indicates that the risk of imprisonment 
is linked only to earnings, not employment. Because employment among young white men is 
not steeply stratified by education, nearly all the variation in prison admissions is accounted 
for by variation in weekly earnings. Real weekly earnings among white high school dropouts 
fell by about $60 between the early 1980s and the late 1990s, associated with a 37 percent 
increase in the risk of prison admission. Imprisonment among blacks, however, is sensitive to 
both employment and earnings. This suggests that the growth in incarceration is tied as 
closely to declining employment among non-college black men throughout the 1980s and 
1990s as it is to the growth in income inequality. This result is consistent with other research 
suggesting that the idleness of low-skilled black men in poor urban neighborhoods exposes 
them to greater scrutiny from police and heightens perceptions of dangerousness in the 
courts (Chambliss 2000; Steffensmeier et al. 1998). The estimated employment effect for 
black men suggests that the 7 percentage point fall in employment among black dropouts 
between 1983 and 2001 was associated with a 13 percent increase in the risk of 
imprisonment. The same risk was associated with a drop of $44 in earnings among black high 
school graduates over the same period. 

How might trends in prison admission be changed if inequality in earnings and 
employment had not increased? We address this question by comparing predicted levels of 
prison admissions for the period 1995 to 2001 under two scenarios. In the first, we predict 
admissions using the observed levels of employment and earnings. In the second, we fix 
employment and earnings at the average level for the 1983 to 1987 period. To predict prison 
admissions under our two scenarios we use estimates from the fixed effects models (3) and 
(4) for whites and blacks (Table 5). Because earnings and employment inequality increased 
from the 1980s to the 1990s and earnings and employment deteriorated for low-skilled men, 
we expect that imprisonment rates would be significantly lower if 1980s labor market 
conditions persisted through the 1990s. 

The observed levels of employment and earnings reproduce the observed patterns of 
educational and racial inequality in imprisonment. Among blacks and whites, high school 
dropouts are more than 10 times as likely to go to prison than men with some college 
education. Black men are five to eight times more likely to go to prison than white men. If 
earnings and employment were fixed at their 1980s’ level, the estimates indicate that prison 
admission rates would be between 7 and 25 percent lower by the late 1990s. The estimated 
reduction in admission rates is largest among poorly educated whites. These results are 
driven largely by the decline in real earnings among white men without a college education. If 
weekly earnings among young white high school dropouts had remained at 1980s levels, 
estimates indicate that prison admission rates by the end of the 1990s would be 25 percent 
lower than they were. The effects of wage and employment trends are estimated to be 
smaller for black men without a college education. If earnings and employment levels for 
young black dropouts remained at 1980s levels through the 1990s, estimates indicate that 
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Table 6: Predicted Average Prison Admission Rates for Men, Ages 20-39 





Employment and Earnings 


Observed 1983-87 Ratio (S.E.) 
(1) (2) (1)/(2) 

Whites 
Less than H.S. 2.60 1.96 tO 
(.05) 
High School or GED 46 34 14 
(.05) 
Some College .06 .06 .93 
(.01) 

Blacks 
Less than H.S. 12.79 10.77 84 
(.06) 
High School or GED 2.26 1.90 84 
(.05) 
Some College 59 Ro 88 
.09 


Notes: Findings based on observed and 1983-1987 levels of earnings and employment, 1995- 
2001. All predictions are based on model (3) and (4) from Table 5. Standard errors for the ratio 
of differences are calculated with simulation from the posterior predictive distribution. 


prison admission rates would be 10.8, not 13 percent. The proportionate change in admission 
rates is nearly identical for black high school graduates. The decline in earnings and 
employment among blacks who finished high school is estimated to increase prison 
admission rates by one-sixth, from 1.9 to 2.3 percent. 


Discussion 


A large research literature claims that a slack labor market intensifies criminal punishment. In 
some accounts, rising poverty lifts crime rates and, ultimately, the scale of imprisonment. 
Alternatively, economic dislocations may produce anxieties among the rich about threats to 
social order posed by the poor. These anxieties produce a punitive trend in criminal justice 
policy that draws those those on the margins of the labor market into the penal system. 
Despite these claims, empirical research unevenly supports a link between labor market 
conditions and imprisonment. Weak empirical support for the labor market theory of 
punishment results from weaknesses in research design. Although theory expects rising 
economic disadvantage among poorly educated men to increase imprisonment rates, 
previous research focused on the association between aggregate unemployment and income 
statistics and incarceration rates. 

We presented a disaggregated analysis in which the earnings and employment of men at 
different ages, races and levels of education were linked to their risk of imprisonment. These 
detailed admission rates showed that incarceration had become common at the end of the 
1990s among men with little schooling. About 16 percent of black male dropouts, ages 20 to 
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39, were estimated to enter prison each year by 2001. Regression analysis of detailed 
admission rates found that imprisonment risks shifted across levels of education, not across 
age or race groups. Most of the growth in the risks of incarceration were concentrated among 
men, particularly black men, with just a high school education. Despite the economic 
expansion of the 1990s, these men suffered the largest losses in earnings and employment 
and experienced the largest growth in imprisonment rates. We thus found that weekly 
earnings and employment rates were strongly related to prison admission for young black 
men. The empirical analysis suggests that if levels of economic inequality that were observed 
in the mid-1980s prevailed through the 1990s, prison admission rates among black and white 
men without college education would be reduced by 16 to 25 percent. 

Was the growth in incarceration due chiefly to changes in the criminal justice regime or to 
increasing crime among low-education men? Supporting the social control account of 
imprisonment, these results were robust to a detailed specification of fixed effects and 
victimization rates to control for criminality among black and white men in different age groups at 
different levels of education. Of course, crime rates may vary across levels of education in ways 
not captured by our models. Still, the race and year effects also tap social control processes that 
are correlated with earnings and employment. Social control efforts fluctuate with cyclical shifts 
in unemployment (Chiricos and Delone 1992); and criminal sentencing increasingly resorted to 
imprisonment as real wages fell through the 1980s and 1990s. African Americans are at relatively 
high risk of incarceration even controlling for criminal offending (Blumstein 1993). Over-estimates 
of the social-control component of economic effects are balanced in part by biases in the model 
for crime in the population. Given these offsetting biases, intensified punishment — not increasing 
crime — seems the likely mechanism for the economic effects. 

Controlling for crime across levels of education, also helps counter the concern that these 
results reflect the changing effects of selection as educational attainment increases. With 
increased levels of schooling in the population, dropouts in the late 1990s may be more crime 
prone than dropouts in early 1980s. If changes in selectivity contribute to educational inequality 
in imprisonment we would also expect to observe increasing imprisonment rates among the 
college-educated who accounted for a larger proportion of the population in the late 1990s 
than the early 1980s. There is no evidence for this effect, however. Indeed a striking feature of 
the prison admission trends is the stability of imprisonment rates among the college-educated. 

Finally, this analysis has shown that rising economic inequality is not only associated with 
higher rates of imprisonment, it is also associated with increasing inequality in imprisonment. 
The growth in economic inequality was accompanied by the emergence of two kinds of 
collective experience: one among college-educated whites who were largely unaffected by 
the prison boom, the other among non-college educated blacks, for whom imprisonment 
became a common life event. Because the official criminality conveyed by a prison record 
affects a variety of life chances and citizenship rights, and imprisonment is increasingly 
concentrated at the margins of economic life, the prison boom reinforces lines of social 
disadvantage. More than just a social control institution, the prison contributes to the 
formation of poorly educated black men as a discrete social group with a distinctive life 
experience that differs from the mainstream. From this perspective, the progressive 
normalization of incarceration among poor black men represents an expansion, rather than 
just a by-product, of the new American inequality. 


Note 


1. Hispanics are not included in the analysis because they are not recorded in a consistent 
way across states in the NCRP data. 
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Appendix 


Constructing the Data 


Prison Admission Rates 

The prison admission rate is defined as the number of people annually entering the custody 
of state or federal prisons as a percentage of the non-institutional civilian and military 
population. Annual age-race-education cell proportions were calculated from the NCRP 
(Bureau of Justice Statistics 2002). These cell proportions were then multiplied by aggregate 
counts of male admissions obtained from the National Prisoner Statistics Series (NPS-1) of the 
Bureau of Justice Statistics. The NCRP data yield similar age-race distributions to the Survey 
of Inmates of State and Federal Correctional Facilities. However, levels of schooling in the 
NCRP tended to be lower than in the inmate survey. The denominator of the admission rate 
— the population at risk of going to prison — was calculated from the Outgoing Rotation 
Groups files of the CPS (Feenberg and Roth 2002), and counts of military personnel obtained 
from the Department of Defense. 


Earnings 

Earnings are measured annually by the median weekly earnings of each age-race-education 
cell for all male workers, deflated by the CPI-U. Earnings are earnings-weighted figures from 
the Outgoing Rotation Group files of the CPS. Additional analysis examined earnings for full- 
time full-year workers and measures of earnings relative to different percentiles of the 
earnings distribution, but these alternative specifications yield results identical to those 
reported in the paper. 


Employment 

Employment is measured by the employment to population ratio of each age-race-education 
cell for the male non-institutional and civilian and military population. Employment rates are 
calculated from survey-weighted data in the Outgoing Rotation Group files of the CPS and 
counts of military personnel from the Department of Defense. 


Violent Crime 

Violent crime is measured by the total number of personal crimes suffered as a proportion of 

the civilian non-institutional population. The number of criminal victimizations is given by the 

incident-based files of National Crime Victimization Survey, 1983-2001 (Bureau of Justice 

Statistics 1997, 2004, 2005). Victimizations are calculated separately for blacks and whites, 

ages 20-50, at different levels of education. Denominators for the victimization rates were 
taken from the Outgoing Rotation Groups files of the CPS. 
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Exit and Voice: 
Organizational Loyalty and Dispute Resolution Strategies 


Elizabeth A. Hoffmann, Purdue University 





Abstract 

This study compares workplace dispute resolution strategies (exit, voice and toleration) in 
matched pairs of conventional and worker-owned cooperative organizations operating in 
three industries — coal mining, taxicab driving and organic food distribution. Building on 
Hirschman’s classic exit, voice and loyalty thesis, this research demonstrates how the degree 
of loyalty that workers hold affects how they approach workplace problems. I find that 
workers with greater loyalty are more likely to embrace “voice” as a way to address their 
problems. Although the “exit” patterns do not mirror the classic “exit-voice framework,” the 
data support Hirschman’s broader thesis, which incorporates examination of emotional 
involvement, and entry and exit costs. 


While previous scholars have studied Hirschman’s exit, voice and loyalty thesis in the 
workplace, few have explored workplace loyalty as an orientation that affects voice and exit 
as Hirschman envisioned it (Dowding, John, Mergoupis, and Vugt 2000). Hirschman 
asserted that, when confronting workplace problems, people’s two courses of action were 
to (1) leave the organization, “exit,” or (2) stay and express their displeasure, “voice.” Some 
scholars read Hirschman as offering “loyalty” as a third option, while others see loyalty as a 
contingency that shapes whether people will exit or voice. This paper takes the latter 
definition. Whereas both exit and voice behaviors can send a similar message to the 
organization, causing it to improve, voice is the more difficult option (Hirschman 1970). 
When will people stay and “fight,” and when will they “cut bait” and leave? Hirschman says 
the key to understanding this decision is the loyalty of the members. Those with greater 
loyalty are more likely to stay and try to change the organization from within. This is 
particularly true if they believe that their efforts have the power to influence the organization 
(Hodson 2001). Entry costs further heighten the likelihood of choosing voice rather than exit; 
those who have endured difficulty in joining the organization will be less likely to discard their 
membership lightly (Hirschman 1970). 

Some research suggests that members of worker cooperatives — businesses that are co- 
managed and co-owned by their workers — will have greater loyalty than workers in 
conventional businesses (e.g., Cornforth, Thomas, Lewis and Spear 1988; Rothschild and 
Whitt 1986). Members of worker cooperatives often have ideological attachments to their 
workplace. Sometimes they specifically sought jobs in worker cooperatives; other times they 
simply happened upon these jobs and became converted to the cooperative workplace 
ideology. Moreover, some worker cooperatives require that their members pay an amount of 
money to buy into the cooperative before they may begin working. Thus, members of worker 
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cooperatives — because of self-selection reasons or factors created by the structure of these 
businesses — may have greater loyalty. 

To investigate the impact of loyalty on the dispute resolution strategies of exit and voice, | 
compare conventional organizations and collectively-run organizations. | examine three very 
different industries, comparing one conventional and one cooperative business in each. By 
studying matched sets of conventional businesses and worker cooperatives, this paper 
explores how loyalty affects workers’ choice of dispute resolution strategies. 


Theoretical Context and Previous Research 


This article draws on the classic writings of Albert O. Hirschman as well as research on worker 
cooperatives. In his influential 1970's book, Exit Voice, and Loyalty, Hirschman asserted that 
two “contrasting, but not mutually exclusive” (1970: 15) behavior options exist for employees 
who are dissatisfied with a firm or a product: (1) exit: they can leave the firm or stop buying 
the product, or (2) voice: they can express their dissatisfaction to the relevant authority (1970: 
4). Additionally, dissatisfied people could also engage in “acquiescence” rather than either 
“voice” or “exit,” meaning that they would remain “dumbly faithful” to the firm or product 
without leaving or voicing their discontent (1970: 31). (In this article, | operationalize 
“acquiescence” as the category “toleration,” meaning that the problem is perceived but no 
action is taken.) However, only voice and exit provide the feedback that enables organizations 
to improve. When the exit option is unavailable, the only way dissatisfied people can 
communicate their dissatisfaction is with voice, so that “the role of voice would increase as 
the opportunities for exit decline.” (Hirschman 1970: 34) Thus, in situations where one cannot 
exit or the cost of exit is particularly high (such as membership in one’s family, state or 
church), the voice option is the only way to express displeasure. 

When both exit and voice are options, the decision to exit or remain will be affected by 
how effective voice would be. If workers “are sufficiently convinced that voice will be 
effective, then they may well postpone exit... [T]herefore, exit can also be viewed as 
depending on the ability and willingness of the [workers] to take up the voice option... 
[because] once you have exited, you have lost the opportunity for voice, but not vice versa.” 
(Hirschman 1970: 37) Other research, such as that on whistle-blowers, demonstrates that 
sometimes workers go outside their organization and whistle-blow only after their voices have 
been ignored by the supervisors they turned to (Rothschild and Miethe 1999). This supports 
Hirschman‘s argument that exit is closely linked to voice. 

Loyalty is the key to understanding which conditions favor exit vs. voice. Sometimes 
loyalty is defined as a possible action similar to acquiescence, while others read Hirschman 
as defining loyalty as a contingency that shapes outcomes, rather than an outcome itself 
(Dowding, John, Mergoupis and Vugt 2000: 447). This paper embraces this second, and more 
frequently used, definition of Hirschman’s loyalty. This study defines loyalty as allegiance to 
a concept outside of the self, such as an organization. The presence or absence of loyalty 
affects what courses of action individuals will choose, such as whether to continue in an 
unpleasant circumstance but voice their unhappiness or to exit the unpleasant circumstances. 

When Hirschman defines loyalty as loyalty to the organization, he posits that people who 
have an exit option will stay and, instead, engage in voice if (1) they're willing to forgo the 
certainty of exit and embrace the uncertainty of possible improvement and (2) they believe 
they can influence the organization. In this way, “loyalty holds exit at bay and activates voice” 
in that it lessens the attractive ease of exiting by raising the cost of exit, making the exertion 
needed for voice less unappealing (1970: 78). Yet, voice is most effective when “backed up by 
the threat of exit.” (1970: 82) 
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High entry costs can affect the timing of voice and exit, although Hirschman emphasizes 
that high entry costs do not cause acquiescence but, instead, promote greater voice. Those 
who have paid dearly to enter will be disinclined to exit readily or to stay and remain passive, 
but rather will embrace voice to address the problems they see. High exit costs might have 
a similar effect: those for whom exit demands great sacrifice will be hesitant to exit and will, 
instead, consider the option of voice (Hirschman 1970). 

Hirschman’s loyalty is often only applied within the context of the organization, i.e., loyalty 
to the firm itself. Other times, however, loyalty is examined both with regard to loyalty to the 
organization as well as to loyalty to the industry or to industry-related ideals or goals. For 
example, Perrucci et al. examined whistle-blowing and found that sometimes employees may 
have less loyalty to the specific firm, but more loyalty to the industry or to certain ideals 
(Perrucci, Anderson, Schendel and Trachtman 1980). 

Other researchers have explored concepts similar to loyalty, such as allegiance, 
commitment and attachment. This study's specific definition of loyalty is somewhat similar to 
the definition of organizational commitment suggested by Meyer and Allen (1997), which 
describes commitment as having three components: affective, normative and continuance. 
They define affective commitment as an employee's emotional attachment to, and 
identification with, the workplace. Normative commitment refers to commitment out of 
obligation or feelings of moral responsibility. Continuance commitment — the more rational, 
cost-benefit component of commitment — is defined as commitment to the organization 
because the employee needs the income of that job (Meyer and Allen 1997). 

The definition of organizational loyalty this project uses somewhat parallels this definition 
of commitment. Loyalty, as an allegiance to a concept outside the self (as mentioned above) 
has two components: an emotional/affective component and a goal allegiance component. 
The emotional/affective component mirrors the affective component of Meyer and Allen: it 
involves workers’ identification with, and emotional attachment to, their workplaces. The goal 
allegiance component addresses the loyalty employees feel toward an organization when that 
organization exemplifies or agrees with their own broader goals; this is similar to Meyer and 
Allen’s normative commitment. An example of this would be the worker who feels strongly 
about the environment and recycling; he is committed to his recycling job because it’s in 
accordance with his goal to help the environment. This project's theory of organizational 
loyalty does not have a component that parallels continuance commitment because loyalty in 
this context is conceptualized as allegiance which surpasses more self-focused concerns 
such as personal income. 


Worker Cooperatives’ Possibilities 


Worker cooperatives, organizations in which all employees collectively manage and own the 
business, pose an intriguing arena for exploration of Hirschman’s thesis. Some researchers 
assert that the benefits of co-ownership and greater participation result in stronger loyalty to 
the organization as well as heightened worker satisfaction and greater productivity (e.g., 
Hodson 2001; Hyde 1991; Kanter 1972; Linehan and Tucker 1983). Members of worker 
cooperatives may have greater loyalty to their workplaces than employees of conventional 
businesses due to the members’ ideological commitment to this particular type of 
organization, the buy-in cost of putting forward one’s own capital to join the business, or both. 
Indeed, the extant literature demonstrates that simply owning a share of the business (e.g., 
Employee Stock Option Plans) may not be enough to produce greater loyalty. Collective 
management and worker participation also appear to be necessary (Cornforth, Thomas, Lewis 
and Spear 1988). Indeed, some studies indicate that the opportunity to participate in how 
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one’s workplace is run can significantly affect how employees feel towards the organization 
(Hodson 2001; Hoffmann 2001). However, the flip side of this greater loyalty is that members 
of worker cooperatives may also have higher expectations for the organization, thereby 
increasing the probability of greater dissatisfaction (Rothschild and Whitt 1986). 

Usually at least some portion of the coop members self-select into the organizatian 
(Rothschild and Whitt 1986; Hoffmann 2005). This decision to actively find a cooperative 
workplace is an example of loyalty before actual employment began. Even a worker, who may 
have entered the cooperative workplace with little loyalty and commitment, may develop 
such sentiments. The new experience of greater worker input and freedom to participate in 
managerial decisions can transform a new member into a strongly committed coop loyalist 
(Jackall and Levin 1984). The new experiences with cooperative management, including 
respect and appreciation from worker-managers, can have a profound effect on how the 
worker perceives the organization (Hodson 2001). 


Sampling and Methods 


| used the comparative case method to explore the effect of loyalty on “toleration” (similar to 
Hirschman’s acquiescence), exit and voice, particularly voice as seen in workplace grievance 
resolution (Ragin 1987). | interviewed and observed workers in three industries: coal mining, 
taxicab driving and organic food distribution. Within each industry, | studied a worker 
cooperative and a matched conventional business. The industries in this study offered a range 
of workplace cultures, gender balances and business objectives. | visited each business 
twice, observing and interviewing workers and achieving variation in interviewees on many 
dimensions. The duration of the visits ranged from a few days to two weeks. 


Sample 


| conducted a total of 128 interviews: 18 at HealthBite Distributors, 35 at Organix Coop, 14 at 
Private Taxi, 20 at Coop Cab, and 41 at Coal Cooperative/Valley Colliery. (Coal Cooperative and 
Valley Colliery were the same physical mine, but under different ownership and management 
systems.) For each site, Table 1 provides summary statistics on the interviewees as well as on 
the organizations, themselves. | did not identify a specific group of workers whom | knew to 
have had disputes, but spoke to all interviewees about their workplace experiences generally. 
| included a wide variety of interviewees to maximize the range of problems and experiences 
as well as the variety of solutions and expectations to be included in this study. My sample 
included present and former employees as well as managers and worker-managers. 
Interviewees also differed in terms of length of employment, sex, race, age, level of 
education, socioeconomic status and section of the particular business. Through careful 
sampling and the repetition of responses | encountered as interviewees spoke of similar 
themes, | have become confident that my findings are well triangulated and valid. Although 
these interviewees are not statistically representative of all the workers at their individual 
organizations, the diversity of this sample is helpful in developing conceptual models. 


Selection of Industries and Organizations 

The interviewees were drawn from six worksites in three industries: (1) coal mining, (2) 
taxicab driving, and (3) organic food distribution. The industries ranged from the coal mining 
industry where the workers are very pro-union, to the non-conformist- or loner-oriented 
taxicab industry, and the progressively-oriented organic food industry. The industries also 
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Table 1: Summary of Sites and Interviewees 


Type of No. of No. of 
Industry _ Organization _ Location Workers__ Interviews —% 
Valley Wales 
Colliery coalmine ——_ conventional (U.K.) 252 38™* 15 
worker Wales 
Coal Coop coal mine cooperative (U.K.) 239 41** 17 
taxi Wisconsin 
Private Taxi —_ service conventional (U.S.) 120 14 12 
taxi worker Wisconsin 
Coop Cab service cooperative (U.S.) 150 20 13 
HealthBite organic London 
Distrib food conventional (U.K.) 32 18 56 
Organix organic worker Halifax 
Coop food cooperative (U.K.) 50 oO 70 





ranged from having a predominantly male workplace culture, such as coal mining, to being 
less explicitly gendered, such as the organic food industry. These differences in workplace 
culture are somewhat reflected in the industries’ different gender balances: 50:1 men to 
women at the coal mines, 5:1 in taxi driving, and 1:1 in organic food. Table 1 provides a 
summary of the organizational attributes of each business. 

All businesses in this study met several key criteria. First, the company needed to have a 
formal system for grievance resolution. Second, it had to be sufficiently large that a formal 
grievance system was necessary. For this study, the minimum size of an organization was 30 
workers. Third, each business had to be a stable organization with established procedures; 
none was less than two years old. Fourth, no organization could be part of a larger 
organization. Additionally, each cooperative included in the study had to be a true worker 
cooperative — with all employees being equal shareholders and no outside shareholders — not 
merely an Employee Stock Option Plan (ESOP) company. 

Within each industry, | compared a worker cooperative (non-hierarchical workplace in which 
all workers are co-managers and co-owners) to a conventional, hierarchical business matched in 
size and gender ratios. The coal mining and organic food distribution were studied in the United 
Kingdom; the taxicab industry was studied in the United States. Because these two cultures are 
sufficiently similar, no cross-cultural comparison is included in this study. Admittedly, people are 
more class-conscious in the United Kingdom, but fundamental disputing culture, as seen in the 
two legal systems, is sufficiently similar (Wheeler, Klaas, and Rojot 1994). 

The businesses are summarized in Table 1. | looked at two organic food distributors: 
Organix Coop, a worker cooperative located in the mid-North of England, and HealthBite, a 
conventional business located near London. As organic food distributors only, they do not 
produce any of their products. While some workers in the industry describe the attraction of 
these jobs as simply the need for a paycheck, others spoke of their dedication to the organic 
and whole food movement and saw the jobs as a type of activism. Organix Coop was begun 
more than 20 years ago by college students who wanted to create a better, healthier, more 
egalitarian work environment. This consciousness of the worker cooperative ideology still 
permeates the business. Workers at Organix Coop became members after completing a 
probationary period and being voted into membership by the current members. After they 
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became members, they received their part of the company’s profits as well as wages, and 
became “vested” in the company, with each worker owning a single share of stock, 
regardless of tenure. When they left the cooperative, they must sell their share back to the 
company, generating a type of severance pay. 

Both the conventional taxicab company (Private Taxi) and the cooperative taxicab company 
(Coop Cab) are located in the same Midwestern town, whose alias is Prairieville. Coop Cab 
was begun more than 20 years ago by cab drivers who were out of work due to strikes at two 
of the city’s main taxicab companies. Possibly.affected by its location in a Big 10 university 
town known for its progressive politics, Coop Cab embraced the worker cooperative ideology 
in trying to create a better workplace, although not as strongly, uniformly or dogmatically as 
Organix Coop. (Both cab companies reputably had “overly-educated” drivers, often with 
advanced, even doctoral, degrees (Langway 1997). Workers at the cooperative, Coop Cab, 
became members after they had successfully completed a probationary period as 
determined by the membership committee. 

Finally, Valley Colliery and Coal Cooperative were deep-pit mines, meaning deep 
underground mining as opposed to strip mining. The two coal mines in this study were actually 
the same physical mine under two different systems of ownership and management. The alias 
“Valley Colliery” refers to the mine when it was nationally owned by the British Coal Board, 
while “Coal Cooperative” refers to the mine after it became a worker cooperative. All interviews 
for both Valley Colliery and Coal Cooperative were conducted several years after the re-opening 
of the mine as a cooperative. This mine was the last deep pit in Wales and one of the few left 
in the U.K. As such, employment at the mine — both when it was still part of British Coal and 
after it became a worker cooperative — held important cultural significance for the miners, who 
deeply identified with the mining occupation. During the period between the closing of the 
mine by the Coal Board and its reopening as a cooperative, some out-of-work miners took 
factory jobs, the only other jobs in the area. They described them with much contempt, often 
saying that they would rather go on government assistance than work there again. After the 
mine re-opened as a cooperative, workers had to become members before they could begin 
working. Each worker had to buy a single share of the cooperative at approximately $13,000. 
As with the other two cooperatives, this share entitled the member to profit sharing as well as 
wages. When the worker left the coop, this share would be bought back by the company. 


Methods 


One of the key benefits of qualitative studies is the high validity possible: the researcher can 
understand the greater context, obtain a large overview and triangulate the accounts of 
differently situated interviewees with various bases of knowledge. | employed a qualitative 
comparative case method to study three very different industries, each with one cooperative 
and one conventional organization. In gathering data for this study, | interviewed workers, 
observed behavior, read related documents and articles, attended companies’ business 
meetings, went to grievance hearings, and participated in aspects of some businesses (e.g., 
went down into the coal pit, rode in the taxicabs). 

The interviews ranged from 20 minutes to more than five hours, with most lasting 
between 30 and 90 minutes. At least one especially lengthy interview occurred at each 
business. All interviews and most site observations were tape-recorded and transcribed, so 
all quotes used here are direct quotes. These data were analyzed using the qualitative data 
software NVivo. Each interviewee is identified by a three-digit number. 

The main focus of the interviews was how the interviewee would handle potentially 
grievable circumstances. Interviewees were asked mostly general, open-ended questions, 
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but with some direct questions, especially as follow-up inquiries. In discussing grievance 
resolution strategies respondents would often draw on examples from their past experiences. 

The transcribed interviews were coded, using NVivo, for various themes. Some of these 
themes were responses to explicit questions (e.g., “In what ways is your job difficult?”). 
However, many others were extracted from the responses of interviewees to broader 
questions (e.g., “How would you describe your job?” “How would you recommend/criticize 
your job to another worker in the same industry?” “What would you change about your job if 
you could just snap your fingers and it would be different?”) or to follow up questions to other 
responses. Thus, many codes, such as loyalty, were not the result of a direct question or set 
of questions intended to measure loyalty, but were produced by careful analysis of 
interviewees’ responses to various questions. 


Results and Discussion 


One key voice mechanism that Hirschman mentions is grievance raising (1970). In this article, 
| focus on how this specific voice mechanism of grievance-raising works differently in 
organizations with greater and lesser workplace loyalty. | operationalize grievance-raising as 
bringing forward both formal and informal grievances, and define strikes as a type of formal 
grievance. 

After assessing that cooperative members truly did hold greater loyalty to their 
organizations than did their counterparts in conventional organizations, | compared how these 
two groups of workers addressed workplace problems. They mentioned three dispute 
resolution strategies: raising a grievance (voice), learning to cope with the problem (toleration, 
a rough equivalent of Hirschman’s acquiescence), and quitting their job altogether (exit). 
Although exit and toleration are not actually ways to resolve disputes, they are strategies 
workers use to end the conflict in their work lives. 


Loyalty 


| did not simply assume that members of cooperatives would have greater loyalty to their 
organizations, although the literature strongly indicated this. Rather, | compared the 
statements of the coop members with the employees in the other conventional businesses. 
The coop members consistently made statements about their loyalty to the organization (92 
percent). While the conventional business employees spoke infrequently about loyalty, when 
they did express such sentiments, they spoke of loyalty to the industry as a whole. 

For example, a member of the taxicab coop described the high level of loyalty to the 
cooperative which was echoed by many other coop members. 


Yet they choose to stick with [Coop Cab] because they believe in the 
cooperative and they feel a loyalty to the cooperative. | think there’s 
something that's very good about that... Because they believe that the 
way the coops are structured is a lot more beneficial to the workers. 
And the coops tend to deal with issues that help the community and 
they're more community-oriented than profit-making. So having a 
loyalty to that motivation is, in my mind, a better motivation than going 
someplace where your wages might be higher, but you're not 
necessarily supporting something that's community-oriented, 
community-based. [052] 
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This statement provides a good example of the goal allegiance component of loyalty. 
The members at Coal Coop spoke of both loyalty to the organization as well as loyalty to 
each other. The quotation below offers an example of the affective component of loyalty. 


They're very protective... You can tease yourself and laugh about each 
other, you know, as a part of the system, but if anybody else tries to 
hurt [a co-worker], he’s one of the pack then and you look after him 
and defend him before anybody [whether] he is young, he is new in 
the industry, [regardless, you've] got to protect him... | think the bond 
is definitely stronger underground. But | would say the washery down 
there, them men there, have also got a bond, a bond to friendship 
and loyalty with themselves. [113] 


Similarly, one of the workers at Organix Coop described her loyalty to the organization as 
appreciating the uniqueness of the cooperative work environment. 


You're part of a special organization... Sometimes it’s about this 
special feeling because [for example] you're working individually, but 
you don't finish until everyone finished. There were so many things 
that were going to go out [for shipment]... The work was always done 
at the end of the day. And, ideally, it was done as a team at the end 
of the day. You run around like mad and helped everybody else... In 
a [conventional] workplace, you're expected to just do your own job 
and go home. No one cares [in those businessesj. [138] 


Thus, the distinctiveness of being a member of a cooperative and the bonds forged with 
other members created a great deal of loyalty. 

This was not found in the sentiments of their peer workers employed by matched 
conventional businesses. For example, one worker from HealthBite provides a concise, if 
dramatic, articulation of the sentiment of being simply a hired hand. 


/f you have a problem, good luck! We've got a complaint procedure 
we go to if we got a complaint about something or anything like that. 
They try and solve it. But, really, we're on our own. [086] 


Although the conventional business employees mentioned loyalty to the goals of industry 
(such as HealthBite workers’ commitment to organic and whole foods), these workers 
generally did not mention loyalty to the organization or to co-workers. 


Voice and Loyalty 


The absence of organizational loyalty among these conventional-organization employees 
made them less likely to include voice in their dispute resolution strategies. In contrast, the 
presence of loyalty made the worker cooperative members more likely to anticipate engaging 
in voice. Comparing each percentage within each industry, the cooperative members are 
more likely to engage in dispute resolution (how this article is operationalizing voice) than their 
same-industry conventional counterparts, as Table 2 shows. 

Table 3 presents the frequencies of the employees in each type of organization, showing 
how many expressed loyalty and how many in each category indicated voice, toleration or 
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exit. Table 3 presents only the raw numbers, rather than the percentages, to enable the reader 
to make comparisons across the table. For example, using Table 3, the reader can compare 
employees who expressed loyalty in each industry, can contrast employees who expressed 
loyalty with employees who did not express loyalty within a single industry, or can compare 
the responses of employees in hierarchical and cooperative businesses across the industries. 
Table 3 clearly demonstrates that workers who expressed loyalty were more likely to engage 
in voice than workers who did not express loyalty and that cooperatives were more likely to 
have members who expressed loyalty. 


Table 2: Dispute Resolution Strategies: Percentages of Employees Who Mentioned each Dispute 
Resolution Strategy 


Taxicab Industry Whole Foods Industry Coal Mining Industry 
Private Coop HealthBite Organix Valley 
Taxi Cab Distrib Coop Colliery Coal Coop 
Voice 57 100 56 91 90 100 
Toleration 36 5 61 29 11 2 
Exit 36 20 qT 23 0 0 


Percentages sum to greater than 100% in some cases because the categories are not exclusive; 
some interviewees mentioned more than one dispute resolution strategy. 

Note: A “no response” category does not exist. All interviewees provided at least one strategy 
(voice, toleration or exit). 


Taxicab Industry 

The responses of workers in the taxicab industry offer strong support for Hirschman’s exit, 
voice and loyalty thesis. The members of Coop Cab were much more likely (100 percent) to 
describe a type of voice (formal or informal grievances) when explaining their dispute 
resolution strategies, in contrast to those in Private Taxi who less frequently mentioned voice 
strategies (57 percent). For example, one woman from Coop Cab explained that sometimes 
one had to speak up and bring a grievance if a decision happened that harmed the 
cooperative in some way, even if one did not experience that harm directly oneself. 


/ felt that [the worker manager] had really screwed up... [That policy 
decision] totally went against what a cooperative should do. Now, / 
could have just left it at that... In a way, it wasn't going to really affect 
me. But | had a duty to say something [and bring a grievance}... Just 
as [the worker manager] is a co-owner, I'm a co-owner and its my 
coop, too. [062/ 


This quotation illustrates both the affective and goal allegiance components of organizational 
loyalty. 

In contrast, workers at Private Taxi were more likely to take a more informal route when they 
tried to resolve their grievances. A Private Taxi driver explained that people try whatever means 
they can, with whomever they can, when they attempt to resolve grievances informally. 


Everyone goes to whoever they think they're gonna get some 
satisfaction from. If someone has a good relationship with [the 
owner], then they'll probably go to [him] and see what they can do. 


Se 
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/f someone has a good relationship with [the manager], at this point in 
time, they'll go to [him] to see what they can do. In some cases, 
people will go to the dispatcher and say, ‘Hey, you know, listen, | got 
this problem with this guy that’s doing this other shift, and | feel like 
he’s screwed me over. Or whatever.’ [063] 


Given the above pattern, it is not surprising that Private Taxi employees were more likely to 
have toleration among their dispute resolution strategies (36 percent as opposed to Coop 
Cab’'s 5 percent). Aman from Private Taxi, for example, described how he has come to quietly 
accept injustices on the road: 


First couple of years | worked here, I'd tel! the dispatcher [if another 
driver cheated me]... [Now] how do | handle it? | used to complain to the 
dispatcher. He said, ‘ll get you another ride.’ / said, ‘That doesn't settle 
nothing!’ | told management about it. They didn’t do nothing about It. 
They didn’t reprimand the driver or anything like that. So after a couple of 
years, | just ignored it. If somebody's stealing my ride, Id just say, ‘Well 
somebody else got the ride.’/ just stay real calm because | know it ain't 
gonna do no good to complain on it. You can't get uptight about it. [107] 


Thus, for some Private Taxi workers, learning to tolerate problematic situations was preferred 
to actively trying to resolve the problem. 

The pattern for exit dispute resolution strategies was the reverse of the patterns seen with 
voice, as Hirschman would have predicted. Thirty-six percent of Private Taxi employees 
included quitting in their dispute resolution strategies, while only 20 percent of Coop Cab 
workers did. This 24-year-old taxicab driver explained that the lack of long-term career 
potential meant that one could always exit. 


The ultimate thing is, we can always quit. It’s just cab driving. It’s a 
nice job, but it’s just a job. It’s just cab driving. It’s not like it’s a 
profession. You can always quit. [108] 


However, it should be noted that the lack of long-term career potential and minimal exit and 
entry costs were also true for the workers at Coop Cab, yet they did not embrace this “we can 
always quit” approach. In fact, when Coop Cab workers did say that they would quit, they 
often stated doing so because they were loyal to the cooperative and its ideals. For example, 
some said that if wrongful policies were adopted or poor decisions were made that they felt 
compromised the cooperative’s integrity, they would quit as a form of protest. A similar 
attitude about exit was voiced by the members of the organic food cooperative, Below, | 
- quote a member of Organix Coop to illustrate this approach. 


Organic Food Industry 

Workers at Organix Coop would talk about how they had heard about the worker cooperative 
from friends and intentionally sought out Organix because they wanted a cooperative work 
environment. Some described how they had left more highly paid jobs in the private sector to 
take a job at Organix to be part of a worker cooperative. In contrast, HealthBite workers often 
described their motivation for their job at HealthBite as focusing on their paycheck. Although 
some were motivated by a commitment to the organics movement, others saw their jobs 
simply as an easily accessible, low-skill job which provided a decent wage. Most stumbled 
into their current jobs through word-of-mouth or through answering ads in the newspaper. 
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Members of the worker cooperative were much more likely (91 percent) to mention voice 
(formal or informal grievances) as one of their dispute resolution strategies than their 
counterparts in the conventional organic food company (56 percent). Sometimes Organix 
Coop members preferred informal routes: 


/ tend not to go to the meetings. | like to chat things up in the pub 
[with the people involved]. That’s just the way | like to do it. [007] 


Other times they preferred formal grievance resolution, such as this Organix Coop member 
who described a time when he felt frustrated with some aspects of the organization. 


/t was just that the department wasn't acting in the way | wanted it to 
work. My relationship with the department was suffering and / 
thought that | couldn’t get on with my job, so | took the whole of the 
department to grievance. /t turned out that there was something 
available to me in terms of information or the way my job could be 
done to work better. So we worked it out quite well... | [had been] 
frustrated; | [had] thought about just leaving. Find something else... 
But | stayed and | saw it out. That wasn’t the easy path, but stayed 
because | care [about Organix Coop]. [145] 


Rather than quitting or suffering in silence, he raised a formal grievance against his entire 
department. His description provides an example of the affective component of organizational 
loyalty. 

The employees at HealthBite were less likely (56 percent) to attempt to actually resolve 
their workplace problems, but when they did, they were more likely to rely on resolving 
grievance informally (not reflected in Table 2, but see (Hoffmann 2003) for a discussion of 
formal versus informal dispute resolution in the taxicab industry). For example, one senior 
woman from HealthBite explained how she engaged in informal grievance resolution on 
behalf of other workers as well as herself. 


/'m the strong character, so | pretty much stand up for myself... There've 
been occasions when there've been female members here on some 
occasions when they felt that there’ve been injustices and they've 
come to me. And Ive sort of pushed on their behalf because I've 
worked more closely with [the two owners] and maybe | know them 
better than some of the other staff, And maybe the staff feel they can't 
approach the directors or they're not sure how they'll take something or 
whatever. So I'll quite happily go in there with them or go in there on 
their behalf and say ‘So-and-so is not happy with this,’ [or] They feel that 
they should be on a higher rate because of that.’ [143] 


Although this quote shows a conventional organization worker who does include voice 
among her strategies, it also highlights how her co-workers do not feel able to engage in the 
informal grievance resolution that she does. These co-workers were not able to engage in 
voice and eventually exited. 

Understandably, given the voice pattern above, toleration strategies were more 
frequently mentioned by workers at HealthBite (61 percent) than at Organix Coop (29 


percent). A representative employee at HealthBite explained that often his preferred path 
is to do nothing. 
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/f someone else isn’t doing their work and I’m doing it all, what I've 
learned in the past, is to just shut your mouth and keep doing it. 
‘Cause that’s how it works. | just shut up and keep doing it. [059] 


These employees had developed various ways to cope with problems they could not or 
would not resolve. Their ability to tolerate allowed them to avoid any sort of grievance 
resolution and to remain in their jobs. 

Both the voice and toleration patterns in the organic food industry clearly support 
Hirschman’s model. The exit patterns, however, do not follow the basic “exit-voice 
framework” but do support Hirschman’s broader thesis. In the organic food industry, a 
somewhat higher percentage of worker cooperative members mentioned quitting their jobs 
as a way to resolve workplace problems (23 percent) than did their counterparts in the 
conventional company (17 percent). 

Workers at HealthBite who mentioned exit as a strategy felt that quitting was an easy 
option if they found themselves unhappy at their jobs because such jobs are easily replaced 
and, therefore, quite disposable. 


What I’m getting here, | know | could get in another job. | can go to 
an agency and get another job. So I’m not fighting for a career here. 
It’s just a casual job and I’m not gonna fight to keep it. If there was 
something | really couldn't stand, then | would move on. [159] 


However, workers at Organix Coop explained that they would quit — not because they could 
find another similar job easily — but if they felt betrayed by the cooperative. For example, one 
woman said that if she felt sufficiently upset to bring a grievance, she would probably leave 
rather than raising the formal grievance, illustrating irony of the goal allegiance component 
of loyalty: 


/ think if | felt | was in a position where | was wanting to take another 
member of Organix to grievance, then | probably would leave. I'd just 
feel that I'd been let down too badly [at that point]. [010] 


Thus, although the percentages (23 percent at Organix Coop; 17 percent at HealthBite) do not 
reflect what Hirschman might have predicted with regard to exit, the reasons for exit fit well 
into Hirschman’s thesis. Hirschman asserts that “those [workers] who care the most [sic] 
about the quality of the product and who, therefore, are those who would be the most active, 
reliable, and creative agents of voice are, for that very reason, also those who are apparently 
likely to exit first in cases of deterioration.” (1970: 47) Therefore, it is interesting to note that 
Organix Coop, the coop with the greatest cooperative ideology zeal, is also the coop where 
its members report more exit strategies. Indeed, the anticipated use of exit by the Organix 
Coop members in the event of frustration with or betrayal by the cooperative supports 
Hirschman’s thesis in that it is these people, who most care about the organization as a 
concept, who are also more likely to consider exit. 


Coal Industry 
The percentages of various interviewee responses within the coal industry offer the least 
support for Hirschman’s thesis in some ways. Yet, in other ways, the industry acts much as 
Hirschman would predict. 

Here, the percentages of workers who mentioned raising a grievance either formally or 
informally (voice) was very similar for both the cooperative (100 percent) and the conventional 


2326 © Social Forces Volume 84, Number 4 * June 2006 


business (90 percent). However, despite this numerical similarity, the range of topics that 
workers could raise was much greater in the cooperative. One electrician, for example, 
recounted a formal grievance brought soon after the mine reopened as a cooperative: a 
grievance about the toilet paper. 


Another thing they wanted changed when we came back as a 
cooperative was the toilet paper. The toilet paper [the miners used], 
they were the old government bloody thick paper. A simple thing like 
that. And the managers, under British Coal, their toilets up there, they 
had the soft, bloody soft, pink paper. The things like that. Silly little 
things. But it matters. It says, ‘I’m no better than that manager over 
there and he’s no better than me.’ [129] 


For this miner, raising this grievance, despite its somewhat trivial focus, was important; it 
demonstrated loyalty (goal allegiance, in particular) to his cooperative’s ideal of equality 
among all members. 

The variation between the company’s workers regarding mentioning toleration as a 
strategy does support Hirschman’s framework. Workers at Valley Colliery were five times 
more likely to mention developing coping mechanisms as a way for dealing with problems 
at work as compared to those at Coal Cooperative. However, the union chairman described 
how occasionally this toleration would build at Valley Colliery and grow into resentment, and 
eventually explode into formal action (voice). 


The resentment would build up and build up. Not just one person, 
dozens and dozens would build, up until one day something minor 
might happen. And, bang! There's war. It’s strike! What the bloody 
hell?! It's been building and boiling for months and months because 
of all those little incidents that have been happening for weeks and 
weeks and all of a sudden, it just blows up, bang! [129] 


Other times, the employees would continue silently tolerating disputes, without these leading 
to any formal action. 

The numbers of workers who mentioned leaving as one of their dispute resolution 
strategies (exit) were identical: none. No one at either Coal Cooperative or Valley Colliery 
mentioned quitting as a possible way to resolve workplace grievances. This could be tied to 
the workers’ expressed concern in preserving their jobs and keeping the mine open, knowing 
that many deep-pit mines in the U.K. had already been closed. These miners expressed 
concern not with simply keeping a job, but in keeping these jobs. Many of them had fathers 
and grandfathers who had been deep-pit coal miners. For them, deep-pit mining was part of 
their identity and their heritage. 


/ think coal miners universally are very much the same sort of people. 
[in] these communities here, probably [for] anybody [who] works in 
[Valley Colliery], either their father or grandfather or great-grandfather, 
uncle or great uncle, was constantly working [in the coal industry]. 
They been brought up not only on coal mining when they are in the 
mine, but probably the conversation at their home has been about 
coal mining and in the pub on Saturday, and the club on Saturday, are 
predominantly coal talk. People say their fathers talk with their uncles. 
So what are they going to talk about? They talk about what's 


Exit and Voice * 2327 


happening at work: ‘What's happening in your pit?’ ‘In my pit?’ So 
predominantly we come from a coal culture. [113] 


Since the workers saw the jobs at the coal mines as precious commodities, it is not surprising 
that no one mentioned leaving. 

In fact, Hirschman says that the amount of loyalty often depehtis on the person's 
identification with the organization or group. Dowding et al. (2000) elaborate on this aspect of 
Hirschman’s thesis: 


A Londoner may feel loyalty to his country because he is ‘British’, and 
that is one of the ways in which he identifies himself, He may retain 
that loyalty to his country no matter how much he disagrees with the 
policies of the government representing the country. To exit from such 
a group just because he disagrees with the policies of the government 
is an exceedingly costly and painful process. Similarly a person may 
feel great loyalty to the firm for which she works, despite considering 
its current senior management to be fools and buffoons ruining the 
organization (Dowding, John, Mergoupis, and Vugt 2000: 447). 


Indeed, the workers at both of the coal mines — both Valley Colliery and Coal Cooperative — 
strongly identify with their particular mine. To leave that organization would mean to have to 
change their identity, to reconceptualize who they are. Thus, the exit costs are extremely high 
for both groups of miners, resulting in no one considering exit as a possible dispute resolution 
strategy. 

Additionally, as mentioned in the site descriptions of the Sampling and Methods section 
above, those miners who were forced to take factory jobs (the only other jobs available) 
between the time of the mine closing and its re-opening as a cooperative, detested the factory 
work. They stated that they would rather have no job than to have to return to factory work. 
This factory work was small-scale assembly of computer parts, what more than one miner 
referred to as “dainty women’s work.” In many ways, the only way to maintain their identity, 
particularly their masculine identity (Cheng 1996; Connell 1995), was to work in the mines. 

Moreover, the mining coop also had the highest entry costs, demanding a $13,000 buy-in 
by each worker. Therefore, one might speculate that the members of the cooperative would 
be even more reluctant than their conventional organization counterparts to exit, although one 
can't have a rate lower than zero to actually express this. 


Conclusion 


This study examined the impact of loyalty on worker dispute resolution strategies. Specifically, 
it compared high-loyalty workplaces (worker cooperatives) with low-loyalty workplaces 
(conventional companies). Workers at the high-loyalty workplaces were more likely to include 
voice among the ways they would resolve workplace problems, while workers at low-loyalty 
companies were less likely. Additionally, workers at low-loyalty companies were more likely to 
engage in toleration strategies (similar to Hirschman’s acquiescence), a behavior that involved 
neither voice nor exit. These voice and toleration patterns neatly followed Hirschman’s exit- 
voice framework. 

The exit patterns, however, were more complicated. In the taxicab industry, the exit pattern 
did follow the exit-voice framework. In the organic food industry, the worker cooperative 
members, not the conventional organization's employees, were more likely to include exit in 


2328 « Social Forces Volume 84, Number 4 * June 2006 


their anticipated dispute resolution strategies. Initially, this seemed contrary to Hirschman’s 
thesis. However, the reasons why the two groups of workers considered leaving were very 
different and support Hirschman’s thesis. The worker cooperative members’ greater 
ideological commitment made them, ironically, more loyal and more likely to become 
frustrated and consider leaving (Hirschman 1970; Rothschild and Whitt 1986). 

In the coal industry, workers at both the conventional and the cooperative company did 
not include exit among their dispute resolution strategies. While these zero percentages do 
not coincide with the exit-voice framework, a further exploration of this industry shows that 
these workers’ dispute strategies support Hirschman’s thesis. This industry had such 
significant exit costs that the workers in both businesses were extremely reluctant to leave. 
Working with coal and in the mines was a central part of these workers’ identities. To leave 
these jobs would be to remove a part of themselves. This level of identification keenly 
affects loyalty (Hirschman 1970). 


Reflections on Hirschman 


This analysis suggests certain reformulations of Hirschman’s model. Therefore, | want to 
leave the reader with some questions that require additional discussion of the exit-voice 
typology. 

First, although numerous scholars have debated the question of exactly “what is loyalty, 
a contingency or an outcome,” several important questions about loyalty remain. Most 
importantly is an analytic question. From where does “loyalty” come? How should 
researchers disentangle loyalty that is linked with the organization from loyalty that is linked 
to self-selection, such as the family socialization found in the coal industry? The mining 
industry does not fit the classic exit-voice theory and also is the most “socialized” 
occupation, perhaps giving loyalty a different meaning — e.g., father-son transmission of 
cultural expectations. Thus, one must ask, how does self-selection into a company or 
industry — such as the coal miners in this study — coincide with the concept of loyalty? Are 
these workers pre-socialized to be so committed even before joining the company? If so, 
does that present a qualitatively different type of loyalty such that Hirschman’s model 
needs to re-interpret how loyalty could be in place before a worker joins the company? 
Alternatively, can loyalty be created within an organization (such as in the cooperatives)? 
The work of Hodson, for example, suggests that how managers treat workers — e.g., 
whether or not the managers show them respect — can create greater commitment and 
loyalty from workers. In this way, organizations may be able to create loyalty by how their 
managers choose to act towards their workers. For example, can the opportunity for voice 
create loyalty, just as loyalty might create the workers’ desire to voice their concerns? If 
loyalty can be created, is this created loyalty different from the loyalty that workers may 
have before joining the company? For example, do the relationships and commitment of 
the workers at Coop Cab and Organix Coop create a different type of loyalty from the loyalty 
of the coal miners? 

Second, how should scholars take into account the salience of the disputes? How does 
the degree of salience have an impact on Hirschman’s exit-voice thesis? Does the salience 
of the dispute interact with loyalty in ways that affect the workers’ exit-voice options? The 
whistle-blowing literature says that when employees resort to external authorities because 
their voice was ignored by their superiors (Rothschild and Miethe 1999), their concerns 
often involve high-salience disputes (see e.g., Perrucci, Anderson, Schendel and Trachtman 
1980). However other disputes, such as the example in this article of a cab driver stealing 
another driver's fare, are of less salience. These disputes are important to the individuals 
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involved, but they are not so salient as to transcend further. The statements by worker 
cooperative members, such as that by the woman at Organix Cooperative, that they would 
leave rather than raise a grievance that would destroy their confidence in the cooperative, 
indicate grievances with great salience. Such disputes with greater salience may speak to 
the workers’ loyalty to the movement or industry ideals rather than to the company itself. 
An important expansion of Hirschman’s thesis might address this issue of salience and its 
relationship to loyalty, as well as to the exit-voice choices. 

Third, how do the differences in exit and entry costs — in either a particular industry or 
company — affect decisions to exit vs. voice? How does the type of exit and entry cost 
matter? For example, how do psychological exit costs, such as the disinclination to leave a 
job that is a significant part of one’s identity, differ from monetary exit costs, such as the 
situation in a cooperative in which someone who left would have to forfeit any equity she 
or he had acquired in the company during her or his tenure? A reformulation of Hirschman’s 
model might develop a closer analysis of exit and entry costs that takes these differences 
into account. 


Implications for Application and Future Research 


Implications of these findings include some support for Hirschman’s exit-voice framework. 
The data demonstrate that workers with greater organizational loyalty are more likely to use 
voice when confronted with workplace problems. However, when workers ideological 
loyalty surpasses their organizational loyalty, and when these two loyalties are in conflict — 
such as if a cooperative no longer meets its egalitarian goals - even members with high 
loyalty might resort to exit. On an applied level, the findings imply that simply creating or 
finding employees with a high level of loyalty is not sufficient to ensure low turnover. 
Companies that use motivational techniques to pull employee support behind the 
ideological goals of the company may experience greater exit than companies who simply 
try to generate high employee enthusiasm for the company itself. 

Future research should explore how employees might attempt radical reform in 
organizations by using a whistle-blowing form of voice. By holding greater loyalty to their 
industry or to an ethic, than they may have to their organization, some employees might 
embrace whistle-blowing grievances. In doing so, they might significantly alter some 
organizations’ objectionable, yet customary, practices, thereby significantly changing how 
both their own organization and their industry do business. ; 

The research findings also imply that high exit costs, such as those found in the coal 
industry, will affect not only the decision to exit, but also the decision to voice. On a 
practical level, this underscores that no company or industry can be examined in isolation. 
The other employment options that are available to workers will have an impact on their 
dispute resolution strategies. If unemployment rates are high or the available job 
alternatives are unattractive, workers may be disinclined to choose exit. 

Future research might explore the questions posed above in industries that demand or 
create different types of loyalty. For example, some scholars have found that certain 
industries have an almost self-sacrificial type of loyalty, such as nursing and teaching 
professions (e.g., Abel and Nelson 1990; Grimwood and Popplestone 1993). This type of 
loyalty in the caring industries might or might not reflect the same patterns as those in the 
three industries studied here. 
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Abstract 

Inherent to all social exchange relations are elements of both cooperation and competition. 
We develop and test a theoretical model which proposes that the relative salience of the 
competitive, conflictual elements of exchange mediate and explain the negative effects of 
negotiated exchange, as compared with reciprocal exchange, on actors’ evaluations of 
fairness. By creating inequality within rather than across transactions, and by making 
relations between one actor’s gain and another’s cost more transparent, negotiated 
exchanges alter the relational context of exchange to one of competition and conflict and 
heighten actors’ sense of unfairness. Results of experimental tests show that (1) the salience 
of conflict increases and perceived fairness decreases as we make reciprocal exchanges more 
like negotiated exchanges on dimensions of conflict, (2) the salience of conflict mediates 
and explains the relation between the form of exchange and perceived fairness, (3) conflict 
affects fairness directly rather than through self-serving attributions, and (4) regardless of 
the relative conflict in reciprocal exchange, actors reciprocally exchange with unfair 
partners far more often than they negotiate agreements with them. 


The mutual dependence that underlies social exchange relations has two distinct faces: a 
cooperative face based on the mutual benefits that can be obtained from exchange and a 
competitive face that reflects the conflict of interests inherent in direct exchange (Blau 1964). 
This duality is common to all mixed-motive structures, including exchange, in which actors’ 
interests partially correspond and partially conflict. Both actors are better off with exchange 
than they would be without it, but at the same time, actors benefit in inverse proportion to 
what each gives the other. 

The relative emphasis on the cooperative or competitive aspects of social exchange can 
vary with both the objective structure of exchange and the subjective perception of exchange. 
We propose that the form of exchange - reciprocal or negotiated — affects both of these. 
Objectively, the joint negotiation of binding agreements makes the structure of negotiated 
exchange more cooperative than the structure of reciprocal exchange, in which actors 
individually give benefits to each other without negotiation. Subjectively, however, the process 
of negotiation can increase the salience of the conflictual, competitive face of exchange. 
These two consequences have been the basis for alternative predictions of the effects of the 
form of exchange on such integrative outcomes as commitment, solidarity, trust and affective 
ties (Lawler 2001; Molm, Takahashi and Peterson 2000, 2003). 

Here, we examine how the relative salience of the conflict in direct exchange relations 
affects actors’ perceptions of the fairness of their partner's behavior, the fairness of the 
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distribution of benefits and the fairness of the exchange process. While most sociological 
research on the fairness of exchange has focused on the cognitive processes of justice 
judgments (comparing inputs and outcomes, evaluating actual benefits against just benefits, 
weighting under-reward and over-reward), we propose that the relational context of exchange 
sets the stage for these judgments, affecting whether actors are concerned with justice and 
whether they respond to objective inequalities as unfair. More specifically, we propose that the 
more conflictual and competitive actors perceive their relationship, the more likely they are to 
perceive inequalities as unjust and their partners as unfair. And that perception is directly tied to 
whether actors’ exchanges have key characteristics of reciprocal or negotiated exchange. 

Previous work shows that actors in negotiated exchanges perceive their partner's behavior as 
more unfair than actors in reciprocal exchanges, even when the network structure, the partner's 
behavior and the inequality of exchange are equivalent (Molm et al. 2003). We develop and test 
an explanation of that finding, which links the form of exchange to perceived fairness through two 
sets of mediating variables: (1) the salience of conflict, and (2) the attribution of responsibility and 
intent, with the former affecting the latter. If the salience of conflict does indeed explain the 
differences between the two forms of exchange in perceived fairness, then /ncreasing the 
salience of conflict in reciprocal exchange should reduce the perception of fairness in reciprocal 
exchanges and reduce the differences between the two forms of exchange. 

We test this logic in an experiment that directly manipulates dimensions of reciprocal 
exchange to increase the salience of conflict in these exchanges, thus systematically 
increasing their comparability to negotiated exchange on these dimensions. We then 
measure intervening perceptions of conflict and attributions of responsibility and intent, and 
examine the extent to which they mediate, and thus explain, effects of the variations in 
exchange form on three targets of perceived fairness: the partner's behavior, the distribution 
of exchange outcomes and the exchange process. 


Theoretical Background 


Social Exchange, Inequality and Fairness 


Our analysis addresses the perception of fairness in direct exchange relations between two 
actors, A and B, each of whom controls resources that the other values. We assume A and B 
are individual persons, acting alone or representing a group, embedded in a larger network that 
provides each of them with alternative partners for obtaining valued resources. We also 
assume that A and B interact repeatedly over time. Examples are co-workers in an office and 
friendship networks. ; 

The exchanges between A and B may produce either equal or unequal benefits for the two 
actors. Unequal benefits usually result from unequal power, produced by differences in actors’ 
structural positions in exchange networks (Emerson 1972). Our interest here is not in structural 
determinants of inequality, however, but in the classic question of how actors evaluate the 
fairness or justice of their exchanges. As the literature on justice has evolved, scholars have 
identified three main targets of justice evaluations: outcomes (distributive justice), processes 
(procedural justice) and interaction partners (interactional justice). Our study examines all three. 

Distributive justice refers to how people evaluate the fairness of the reward distributions 
that result from exchange or allocations. This is the form of justice that most sociological 
theory and research have addressed. Evaluations of distributive justice are based on the 
justice expectations that actors bring to the relation from past experience or social norms 
(such as equity or equality) and on the socia/ comparisons that they make with others. Early 
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exchange theorists assumed that actors compare their outcomes (and inputs) to those of 
specific others (Adams 1965; Homans 1974); later theories made the target of comparison a 
more general “justice standard” and focused on the cognitive process of comparing actual 
outcomes to just outcomes (Berger et al. 1972; Jasso 1980). Attribution theorists argue that 
departures from just outcomes prompt further cognitive work, encouraging individuals to 
analyze the cause of injustice and to assign responsibility and intent (Utne and Kidd 1980). 

Theories of procedural justice, which developed later, address the fairness of the process 
or procedures through which outcomes are obtained (Leventhal, Karuza and Fry 1980; Lind 
and Tyler 1988; Thibaut and Walker 1975). Some procedural justice theorists argue that fair 
procedures are more fundamental than fair outcomes, and that procedural justice influences 
the perception of distributive justice. As the tradition has evolved, procedural criteria have 
expanded to include elements of both formal decision-making procedures (e.g., consistency, 
neutrality and voice) and interpersonal treatment (which conveys information about respect 
and status). Tyler and Lind’s (1992) relational model of procedural justice encompasses both. 

Recently, scholars of organizational justice have argued for a distinction between the 
fairness of formal decision procedures and the fairness of interpersonal treatment (Bies 2001; 
Cropanzano et al. 2001). These researchers use the term /nteractional justice to refer to the 
latter — fairness of the treatment that an actor receives from others. In organizational contexts, 
the target of interactional justice is often the supervisor; in the direct exchange processes that 
we study, the target is the exchange partner. 


Forms of Exchange 


We study how perceptions of fairness are affected by two forms of direct exchange: 
negotiated exchange and reciprocal exchange (Blau 1964; Emerson 1981; Molm, Peterson 
and Takahashi 1999). In negotiated exchange, actors jointly negotiate the terms of an 
agreement that gives both parties benefits of equal or unequal value. In the negotiated 
exchanges studied by exchange researchers, these agreements are also strictly binding; i.e., 
they automatically produce the benefits agreed upon (Cook and Emerson 1978; Markovsky, 
Willer and Patton 1988). In reciprocal exchange, actors individually perform beneficial acts for 
another (such as giving assistance or advice) without knowing whether or when the other will 
reciprocate (Molm et al. 1999). Exchange relations evolve gradually, as beneficial acts prompt 
reciprocal benefit (A assists B with work, B subsequently does a favor for A, and so on). 

These two forms of exchange vary on several theoretical dimensions, two of which are 
particularly important for fairness evaluations. The first is the contingency of actors’ outcomes 
on either joint action or individual action. In reciprocal exchange, each actor's outcomes are 
contingent solely on another's individual actions; i.e., As behavior produces benefits for B, 
and vice versa. Consequently, benefits can flow unilaterally: actors can initiate exchanges that 
are not reciprocated, and they can receive benefit from another or multiple others at the same 
time — without giving in return. An influential politician, for example, might enjoy favors from 
numerous supporters who may or may not benefit from the politician's patronage in the 
future. In contrast, when exchanges are negotiated, each actor's outcomes depend on the 
joint actions of self and other, and the flow of benefits is bilateral. 

Second, the two forms of exchange vary on the timing with which equality or inequality 
emerges; this difference follows directly from the first. In negotiated exchanges, each 
bilateral transaction provides either equal or unequal outcomes for actors. Thus, inequality 
occurs within transactions and results from differences in the va/ue of benefits exchanged (or 
the profit received), not from variations in the reciprocity of exchange (a given in bilateral 
transactions). When a buyer and seller negotiate the purchase of a used car, for example, both 
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presumably profit, to varying degrees, from the single bilateral transaction. In reciprocal 
exchanges, equality or inequality instead develops over time not on discrete transactions, and 
it is potentially determined by differences in both the rates at which actors reciprocate each 
other's giving and the va/ue of that reciprocity. Two co-workers, for example, might regularly 
help each other with both technical assistance and career advice, with the value and 
frequency of assistance and advice varying over time. 

These distinctions are the basis for the theoretical logic linking the form of exchange to 
perceived fairness. We develop that logic below and test it in a laboratory experiment. 


The Causal Model 


Figure 1 shows the basic logic of our theoretical argument. The causal sequence links the 
form of exchange to perceived fairness through two intervening mechanisms: the salience of 
conflict and attributions of causality. Below, we describe the logic underlying each of the 
paths in the model, and the objectives of our research. 


Figure 1. The Causal Model 


Form of Exchange ae Salience of Conflict —— oe of Fairness 
(Negotiated) 
ve 
Attributions 


The Form of Exchange and the Salience of Conflict 


The first path in the model is the most critical and our primary focus. The major theoretical 
issue is how and why the form of exchange affects the relative salience of the cooperative 
or competitive aspects of the two actors’ interests. Two theories predict opposing effects of 
the form of exchange on perceptions of relational conflict, based on the dimensions of form 
we identified above: the contingency of actors’ outcomes and the timing of inequality. The 
first dimension, the contingency of outcomes on joint or individual action, makes negotiated 
exchanges structurally more cooperative than reciprocal exchanges (Lawler 2001; Molm et al. 
1999). Joint action — negotiating an agreement — is required before either actor benefits, and 
after an agreement is reached, outcomes are also joint: exchanges are always bilateral with 
both actors obtaining some benefit (equal or unequal) from the agreement. Lawler and Yoon’'s 
(1993, 1996) studies of relational cohesion in negotiated exchanges and Lawler’s (2001) 
recent affect theory of exchange make a strong case for the greater perceived 
“cooperativeness” of negotiated exchange, based on this structural difference. Lawler and 
Yoon argue that actors’ repeated, successful negotiations produce positive emotions that are 
attributed, in part, to the relationship itself. Lawler’s (2001) affect theory of exchange further 
proposes that attributions to the relationship are more likely when individuals share 
responsibility for their outcomes, and that shared responsibility increases with “jointness of 
task” — the degree to which individual contributions to task outcomes are inseparable. 
Because jointness of task is greater in negotiated than in reciprocal exchange, Lawler predicts 
stronger perception of shared responsibility, stronger global emotions and stronger attribution 
of those emotions to the relationship in negotiated than in reciprocal exchange. 
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Molm and colleagues, on the other hand, have argued that negotiation heightens the 
salience of conflict in exchange relations (Molm et al. 2003). They propose that the same 
features that make negotiated exchange more structurally cooperative than reciprocal 
exchange — joint decision-making, the bilateral flow of benefits and the two-party unit created 
by the task of negotiation — also bring into sharp relief the inherent conflict of interests between 
actors in exchange relations. They do so by affecting the temporal emergence of inequality in 
the two forms of exchange, producing three differences that contribute to our prediction (in 
Figure 1) of greater salience of conflict in negotiated than in reciprocal exchanges. 

First, actors’ outcomes are more easily compared in the bilateral transactions of 
negotiated exchange than in the sequentially contingent, unilateral acts of reciprocal 
exchange. Because each negotiated transaction produces either equal or unequal outcomes 
for actors, success or failure is more obvious. In reciprocal exchanges, it is more difficult for 
actors to keep track of who fares better, or who owes whom, because equality or inequality 
develops over time. When outcome comparisons are easier and more obvious, actors’ 
awareness of the objective inequalities and their subjective reactions to them should 
increase, heightening feelings of competition and contflict. 

Second, in negotiated exchanges, actions of the two parties (offering less, bargaining 
harder, refusing to make concessions) are, by definition, acts of commission explicitly 
directed at the other person with the intent of obtaining a more favorable deal for the actor 
at the other's expense. In reciprocal exchanges, inequality can instead result from acts of 
omission — from not acting — rather than from tough bargaining. One friend regularly takes 
another out to dinner, for example, while the other reciprocates only occasionally. The 
interpretation of such acts of omission is often more ambiguous: If A reciprocates B's giving 
less often, is A deliberately paying less for B's favors or is the inequality between A and B an 
unanticipated side effect of As exchange with another actor? If the latter, B may be less likely 
to perceive As behavior as competitive and their relation as conflictual. 

Third, the relative costs of exchange are more transparent and the relation of one actor's 
gain to another's loss more direct in negotiated than in reciprocal exchange. In negotiated 
exchange, bilateral agreements favor one actor at another's expense; i.e., the more A offers 
B, the less A gains from the exchange. !n reciprocal exchanges, a comparable inequality is 
produced when actors reciprocate each other's giving at different rates. If disadvantaged 
actors must give more frequently to maintain their powerful partner's intermittent reciprocity, 
they pay more for the benefits they receive and their advantaged partner pays less. This 
difference in relative costs affects actors’ abilities to obtain benefits from other relations. If, for 
example, one spouse does most of the household work, the other will have more time to 
pursue career advancement or leisure activities. But because the relation between one actor's 
cost and the other's gain is more indirect in reciprocal exchanges, and operates through the 
relative opportunities that each has to invest in other rewarding activities, the “zero-sum” 
aspect of exchange — the more you get, the less | get — is likely to be less salient in reciprocal 
exchange than in negotiated exchange. 


The Salience of Conflict and Perception of Fairness: Direct and Indirect Paths 


The model in Figure 1 proposes that the salience of conflict affects perceived fairness both 
directly and indirectly. The indirect path links conflict to fairness through its effect on self- 
serving attributions; i.e., conflict increases the tendency to assign responsibility and intent to 
the partner for unsatisfactory outcomes, which in turn decreases perceived fairness. The 
direct path proposes that the salience of conflict decreases perceived fairness without the 
intervening role of attributions. 
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The indirect path, through attributions of responsibility and intent, is the route proposed by 
attribution theories of justice (Cohen 1982; Utne and Kidd 1980). These theories begin with 
the assumption that individuals search for attributional information to help explain 
unsatisfactory outcomes by inferring causality, responsibility and intent. These attributions are 
often self-serving; i.e., individuals are more likely to take credit for good outcomes while 
blaming their partner for poor outcomes (Nisbett and Ross 1980). The salience of conflict 
increases this tendency. The more actors perceive their relation as conflictual, the more likely 
they are to hold the other responsible for unsatisfactory outcomes and to perceive the other's 
behavior as intentional (Hegtvedt and Killian 1999; Thompson and Lowenstein 1992). On the 
other hand, the more cooperative actors perceive their relation to be, and the more mutual 
their interests, the less likely they are to make self-serving attributions (Lawler 2001). As the 
path linking attributions to fairness indicates, attributing responsibility and intent to another 
heightens the severity of felt injustices and increases the likelihood that disadvantaged actors 
will protest or take action to restore equity. Thus, attributions are linked theoretically to both 
the perception of fairness and behavioral reactions to perceived unfairness. 

A direct effect of conflict on fairness should occur to the extent that relational conflict 
increases actors’ sense of injustice (Deutsch 2000), i.e., actors’ tendency to perceive objective 
inequalities as unfair without mediating cognitions. Both Deutsch (1983) and Scher and Heise 
(1993) have criticized social psychological theories of justice for their strong emphasis on the 
individual, cognitive, information-processing aspects of justice evaluations. Deutsch (1983) 
criticizes the theories for focusing too much attention on the individual rather than the social 
interaction from which justice emerges, and for ignoring the relation between conflict and 
justice. Scher and Heise (1993) argue that emotional reactions, rather than cognitive processes, 
lead to judgments about fairness. These emotions arise from the social interaction itself in 
situations where justice concerns are at issue. Together, these arguments suggest a direct link 
between the salience of conflict and perception of injustice. By increasing an actor's sensitivity 
to inequality and injustice, the salience of conflict in the relation increases the likelihood that an 
actor will perceive an unequal exchange as unfair without necessarily attributing responsibility. 

We test these arguments on the three justice targets identified earlier: the exchange 
outcomes (distributive justice), exchange process (procedural justice) and exchange partner 
(interactional justice). Interactional justice bears the most direct relation to the logic of our 
causal model, particularly the indirect path through attributions: if actors perceive their 
partner's interests as conflicting with their own, and blame the partner for unequal 
exchanges, then the partner is the most logical target of feelings of unfairness. However, 
interactional justice and distributive justice are often closely linked; actors who perceive 
interactional injustice are also likely to believe their outcomes are unfair. Procedural justice 
bears the most direct connection to the form of exchange itself, that is, to the process by 
which actors exchange benefits. Its relation to the salience of conflict, however, is less 
obvious than for the other two forms of justice. 


The Research 


We test the causal model in Figure 1 in a controlled laboratory experiment that allows us to 
manipulate dimensions of reciprocal exchange that should affect the salience of conflict while 
holding constant the network structure, resources and context of exchange. In essence, we 
increasingly make reciprocal exchanges more like negotiated exchanges on the dimensions 
of conflict we have identified, and examine whether differences between the two forms of - 
exchange in perceived fairness decrease as we do so and whether those differences are 
explained by the intervening variables in the model. To create comparable distributions of 
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benefits across the forms of exchange, we use computer-simulated actors who are 
programmed to exchange with our real subjects in particular ways. 


Experimental Design 


The single factor in our experimental design is the form of exchange: negotiated exchange and 
three modes of reciprocal exchange. The three modes of reciprocal exchange systematically 
increase factors that should increase the salience of conflict, with conflict lowest when 
reciprocal exchanges of fixed value create inequality across (but not within) transactions, at 
indirect cost to the actor, and highest when reciprocal exchanges — like negotiated exchanges 
— create inequality within transactions of variable value at direct cost to the actor. Reciprocal 
exchanges of variable value and indirect cost should be intermediate in conflict. We study all 
four variations in the form of exchange under the condition of subject disadvantage to which 
our causal model theoretically applies.1 Forty undergraduate subjects were randomly assigned 
to one of the four conditions, with 10 subjects (five males and five females) in each condition. 


Experimental Procedures 


Subjects (S) engaged in either reciprocal or negotiated exchanges with two alternative 
partners (A and B) to whom they were connected in three-actor networks (Figure 2). The two 
partners were computer-simulated, allowing us to make subjects’ absolute and relative 
outcomes comparable across the different forms of exchange. The focal relation between S 
and A offered high but unequal exchange value; the alternative relation between S and B 
offered equal exchanges of low value. Subjects remained in the same network positions 
throughout the experiment and interacted repeatedly with the same two partners. Relations 
in the networks were negatively connected and operationalized in the traditional way (Cook 
and Emerson 1978): in negotiated exchanges, an agreement with one actor precluded an 
agreement with another on that exchange opportunity; in reciprocal exchanges, initiating an 
exchange with one actor precluded initiating exchange with another on that opportunity. 


Figure 2. The Exchange Network 


Note: $ = Subject; A and B are simulated actors. The solid line between S and A indicates a high-value 
relation; the dashed line between S and B, a low-value relation; and the dotted line between A and B, a 
hypothetical relation. 
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Subjects were seated in isolated rooms to maintain the deception that they were 
interacting with real persons. Following detailed instructions and practice trials, subjects 
participated in a series of exchange opportunities: 100 opportunities in the negotiated 
exchange experiment (the maximum that could be completed in a two-hour session), and 200 
opportunities in the reciprocal exchange experiment. At the end of each opportunity, they 
were informed about the source and amount of any points gained, and their total earnings 
were cumulated and shown on their computer screens. Negotiated and reciprocal exchange 
sessions took roughly the same amount of time (thus controlling for effort), and subjects were 
not informed of the number of opportunities in either. The monetary value of points was 
adjusted so that subjects in the reciprocal exchanges earned the same amount as subjects in 
the negotiated exchanges for comparable behaviors (i.e., 1 point = 1 cent in the reciprocal 
conditions and 2 cents in the negotiated conditions). 

In all conditions, subjects had full information about the size and shape of the network, the 
potential value of exchange in each of their relations, the number of points that they and their 
partners received from their exchanges, and who exchanged with whom on each opportunity. 
This information is necessary for making fairness judgments. Subjects did not know the 
potential value of exchanges between the two simulated actors or the outcomes of their 
(hypothetical) exchanges. By omitting this information, both the equal and unequal exchanges 
between S and S's two partners were made plausible. 

At the end of the exchange period, subjects responded to a post-experimental 
questionnaire that included measures of the dependent variables, and to an open-ended 
question asking them to describe any strategies used during the experiment. They were then 
paid the amount they had earned and debriefed. 


Manipulating the Form of Exchange 


Subjects engaged in one of four forms of exchange: reciprocal exchange with fixed value and 
indirect cost (our baseline reciprocal condition), reciprocal exchange with variable value and 
indirect cost, reciprocal exchange with variable value and direct cost, and negotiated 
exchange (which, by definition, has variable value and direct cost). The settings for the four 
forms were designed to be as comparable as possible on all dimensions other than their 
distinguishing differences. 

In the negotiated exchange setting, actors (real and simulated) negotiated the relative benefits 
that each would obtain from an agreement over a series of exchange opportunities. In the high- 
value relations, S and A could divide 24 points on each exchange; in the low-value relations, S and 
B could divide 4 points. Each opportunity consisted of up to five rounds of negotiation. On each 
round, all actors simultaneously made offers to both partners. Actors made offers by requesting 
the number of points they wanted to receive; an actor's request for points was then converted, 
by the computer, into an offer of the remaining points for the other, After the first round, actors 
could accept another's offer, repeat their last offer or make a counteroffer. Negotiations continued 
until two actors reached an agreement or the five rounds were up. As soon as an agreement was 
reached, both actors received the amounts agreed upon (agreements were binding) and were 
told how much the other actor received. If no agreement was reached, no points were received. 

In the three reciprocal exchange settings, each actor simultaneously gave points to one of 
his or her alternative partners, without communication, on each exchange opportunity. The 
subject was then informed that each of the others either added n points to the subject's 
earnings or did not act toward the subject, and the subject's cumulative points were updated. © 
The three versions of this form of exchange vary (1) whether the va/ue of benefits that an 
actor can give to an exchange partner on a single opportunity is fixed or variable (and thus 
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under the control of the actor), and (2) whether the relation between the partner's gain and the 
cost of exchange to the actor is direct or indirect. 

The first reciprocal exchange setting, reciprocal exchange with fixed value and indirect 
cost, is a replication of the form originally studied by Molm et al. (2003). In this baseline 
condition, the va/ue of the benefits that each actor can give another is fixed and equal to half 
the total points that actors in the negotiated setting can divide on each opportunity. The cost 
of exchange is indirect; i.e., giving to one partner reduces an actor's opportunity to invest in 
a potentially rewarding exchange with the other partner, but incurs no direct costs to the actor. 

In the second reciprocal exchange setting, variable reciprocal exchange with indirect cost, 
actors choose not only which partner to reward, but how much benefit to give from a range 
of points. Giving actors in reciprocal exchanges control over the value they give makes it 
possible for sequential acts of giving to produce unequal benefits for exchange partners, and 
this changes the partner's acts of inequality to acts of commission rather than merely 
omission. In the third reciprocal exchange setting, variable reciprocal exchange with direct 
cost, actors not only choose how many points to give to another from a range of points, but 
can keep any points not given to the partner. Points given to another triple in value for the 
other, while points kept for self remain the same. Thus, as in negotiated exchange, actors 
incur a direct cost when they give more to the partner (and keep less for self), which should 
make the zero-sum aspect of exchanges more transparent. 


Programming the Simulated Actors 


Unequal Exchanges with Partner A 

In all four conditions, As behavior was programmed to create unequal exchanges between 
Aand S in which S was disadvantaged. For every 14 points that A received from S, S received 
an average of 10 points from A.2 While this level of inequality was equivalent across the 
different forms of exchange, it was produced in different ways. 

In the negotiated exchanges, inequality was created by programming A to make an initial 
request of 14 to 16 points from S. After the first round, A decreased its request by 1 point (if 
its initial offer was not accepted) and then repeated that request on any subsequent rounds. 
Thus, after the first round, A requested an average of 14 points, thus offering 10 points to S. 

In the fixed reciprocal exchanges with indirect cost, inequality was created by manipulating 
the rate of A‘s reciprocity of S's giving, while holding constant the value of A's reciprocity. S 
and A could give 14 points to each other on each opportunity, and A was programmed to 
reciprocate S's prior giving only 10 of every 14 times.’ 

In both of the variable reciprocal exchange conditions, the opposite was true. Inequality was 
created by manipulating the va/ue of As reciprocity — the number of points with which A 
reciprocated S's giving — while holding constant the rate of A's reciprocity. A used a modified 
titfor-tat strategy, reciprocating S's prior giving 90 percent of the time and initiating giving 10 
percent of the time. This manipulation makes reciprocal exchanges more similar to negotiated 
exchanges, in which the rate of reciprocity is always 1.0 (all negotiated exchanges are bilateral) 
and inequality results from the relative value obtained within transactions. In the variable 
reciprocal exchanges with indirect cost, S and A could give each other 10 to 18 points on each 
opportunity (a mid-point of 14), and A always gave 4 points fewer to S than S gave to A on the 
previous opportunity within the constraints of range. This meant that the actua/ range of As 
giving was controlled to vary from 10 to 14, as in the negotiated conditions, while S could give 
Aup to 18 points.‘ In the variable reciprocal exchanges with direct cost, S and A could give each 
other up to 6 points per opportunity (with points tripled in value when given), and A always 
reciprocated S's giving with one point fewer than S gave on the previous opportunity. Because 
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points given were tripled in value for the other, this pattern meant that exchanges between S 
and A always produced 4 points fewer for S than for A (with no constraints of range).° 


Equal Exchanges with Partner B 

B's behavior was programmed to create equal, low-value exchanges between S and B in all 
conditions. In the negotiated exchanges, B requested 2 to 4 points on the first round, lowered 
its request by 1 point on the second round, and repeated that request on any subsequent 
rounds. In the reciprocal conditions, B responded to S’s prior behavior with the modified tit- 
for-tat strategy that A used in the variable-value conditions. In the fixed-value exchanges, S 
and B could give each other 2 points each opportunity; in the variable-value exchanges with 
indirect cost, 1 to 3 points (a mid-point of 2); and in the variable-value exchanges with direct 
cost, each could give or keep 2 points (with points tripled if given). In the variable-value 
conditions, B matched S's giving on the previous opportunity. 


Measures 


At the conclusion of the experimental session subjects responded to a series of seven-point 
bipolar semantic differential scales, measuring evaluations of their exchange partners, the 
distribution of exchange benefits and the process of exchange. Subjects’ evaluations of their 
focal partner, A, and their relationship with A, provided the primary data for developing 
measures of perceived fairness and mediating perceptions of conflict and attribution. 


Perceptions of Fairness 
We derived a measure of our primary dependent variable — perception of interactional fairness 
—from three items asking subjects to evaluate the fairness of As behavior toward the subject 
as unfair/fair, unjust/just, and inequitable/equitable. Subjects’ scores on the three items were 
averaged to form a scale with a potential range of 1 (unfair) to 7 (fair) and a neutral midpoint 
of 4 (Cronbach's a = .89). 

Perception of distributive fairness was measured by asking subjects, “Over the course of 
the experiment, do you think the distribution of points between you and Person A was fair or 
unfair?” Responses could range from 1 (unfair) to 7 (fair) with a neutral midpoint of 4. 

Perception of procedural fairness was measured by subjects’ responses to the following 
item (with wording varied for negotiated or reciprocal exchange): “In the experiment you 
earned money [by receiving points from other participants, who in turn earned money by 
receiving points from you/ by negotiating agreements with other participants that determined 
how many points each of you received]. How fair or unfair do you think that process for 
earning money was?” Scores again ranged from 1 (unfair) to 7 (fair). 

We also analyze exchange frequency — the proportion of exchange opportunities on which 
subjects made agreements with A (negotiated exchange) or gave points to A (reciprocal 
exchange) - as a behavioral indicator of perceived fairness. Because exchange with A was far 
more profitable for S than exchange with B, even in the disadvantaged conditions, subjects 
who refused to exchange with A incurred significant costs. We interpret this refusal as both 
a behavioral indicator of subjects’ negative feelings toward A and as an effort to restore 
fairness in one of the only ways available — by not engaging in unfair exchanges. 


Mediating Variables 
Two items assessed the salience of conflict to the subjects. One item, on relational conflict, 
asked subjects to describe their relationship with A as harmonious or conflictual. The second 
item, On partner competitiveness, asked subjects to describe A’s behavior toward them as 
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cooperative or competitive. High scores on either item indicate greater salience of the relation 
or the partner as conflictual and competitive. 

A series of three successive items provided data for measures of attributions about 
responsibility and intent. The series began by asking subjects’ perception of the inequality of 
their exchange: “How equal or unequal were the total points that you and A received from each 
other: very unequal in your favor, very unequal in A's favor or somewhere in between? (If you 
think the points were equal, mark 4 on the scale.) A score of 1 indicated inequality in S's favor; 
a score of 7, in As favor. The second item asked subjects for their attribution of responsibility for 
the inequality: “Who would you say was more responsible for how equal or unequal the points 
were that you and A received from [your agreements/each other]: you, A or both of you? (If you 
think you were both equally responsible, mark 4 on the scale.)” A score of 1 indicated primary 
attribution to self; a score of 7, to A. The third item assessed subjects’ attribution of intent to As 
behavior: “To the extent that A's actions affected the equality or inequality of the points you 
received, how intentional or unintentional do you think Person A‘s actions were?” A score of 1 
indicated that A's actions were unintentional; a score of 7, intentional. 


Results 


Replication and Extension of Original Findings 


Table 1 reports the means and standard deviations for all four dependent variables by 
experimental condition. We first compare the means for the forms of exchange originally 
studied by Molm et al. (2003) to see if we replicate the finding that actors engaged in 
reciprocal exchanges with fixed value and indirect cost perceive their partners as significantly 
more unfair than actors engaged in negotiated exchanges. The means for these conditions 
are shown in the first and fourth columns of Table 1. The results of t-tests comparing mean 
interactional fairness confirm the findings of Molm et al. (2003): Actors engaged in unequal 
fixed-value reciprocal exchanges perceive their partners as significantly fairer than actors 
engaged in unequal negotiated exchanges (t = 3.32, d.f. = 18, p < .005). The mean difference 
is even larger than that originally found, close to 2 pts. (1.93) on the seven-point scale. The 
form of exchange also has a significant, although somewhat smaller, effect on distributive 
fairness (t = 2.23, d.f. = 18, p < .05), but no effect on procedural fairness (t = .80). Our results 
also show the marked differences in exchange frequency that were originally observed for the 
two forms of exchange (t = 4.27, d.f. = 18, p < .001). Again, the differences are even greater 
than those reported in Molm et al. (2003); subjects exchanged with their partner twice as 
often in the reciprocal as in the negotiated exchanges. 


Increasing the Salience of Conflict in Reciprocal Exchange 


Our two new conditions — reciprocal exchange with variable value and indirect cost, and 
reciprocal exchange with variable value and direct cost — were designed to increase the 
salience of conflict and reduce the differences in perceived fairness between negotiated and 
reciprocal exchange. We examine the effects of each of the new conditions, in turn, and then 
analyze the relation between all four forms of exchange and perceived fairness. 

Comparing the means in Table 1 shows that making the value of reciprocal exchange 
variable and under the control of the actor while costs remain indirect (column 2) decreases 
the original differences between negotiated exchange (column 4) and reciprocal exchange 
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Table 1: Means and Standard Deviations for Subjects’ Perceived Fairness and Exchange 
Frequency, by Form of Exchange 





Reciprocal Exchange 
Fixed Value/ Variable Value/ Variable Value/ = Negotiated 


Subjects’ Responses _— Indirect Cost _Indirect Cost Direct Cost Exchange 


Perceived Fairness 


Interactional Fairness 4.73 3.83 3.50 2.80 
(1.13) (1.56) (1.91) (1.45) 

Distributive Fairness 4.00 3.40 3.10 2.40 
(1.89) (1.78) (1.91) (1.26) 

Procedural Fairness 4.70 3.40 4.50 4.30 
(1.25) (1.43) (2.17) (.95) 

Exchange Frequency ° 69 60 de .34 
(.17) (.15) (.13) (.19) 





Note: Numbers in parentheses are standard deviations. 
“Mean proportion of opportunities on which S exchanged with A. 


with fixed value (column 1) by 47 percent for interactional fairness and 38 percent for 
distributive fairness. Consequently, the mean differences between negotiated exchange and 
this new reciprocal condition in interactional and distributive fairness are reduced to 
nonsignificance. Procedural fairness continues to be unaffected by the form of exchange. 
While introducing variable value to reciprocal exchange reduces perceived interactional and 
distributive fairness, however, it does not reduce the frequency with which subjects 
exchanged with A, which is still .60 in the new reciprocal condition. Consequently, the form of 
exchange (negotiated versus reciprocal with variable value and indirect cost) still has a 
significant effect on exchange frequency (t = 3.28, d.f. = 18, p < .01). 

Our third mode of reciprocal exchange adds a direct cost of giving more benefits to the 
partner. As the means in Table 1 show, adding direct cost to the variable-value reciprocal 
exchanges (column 3) further lowers evaluations of interactional fairness and distributive 
fairness, but by a smaller amount. The original differences between negotiated and reciprocal 
exchange are reduced by an additional 17 to 18 percent for interactional and distributive 
fairness, respectively. Increasing the salience of conflict continues to produce no systematic 
pattern for procedural justice and no effect on the frequency of reciprocal exchange, which 
remains quite high at .72. 

The graphs in Figure 3 summarize the relations between the form of exchange and our 
four dependent variables for all four conditions. The graphs for interactional and distributive 
fairness (Figures 3a and 3b) show consistently ordered relationships with perceived fairness 
decreasing as features are added that should increase the salience of conflict. Polynomial 
contrasts confirm significant linear relations between form and interactional fairness (F[1,36] 
= 7.95, p < .01) and between form and distributive fairness (F[1,36] = 4.35, p < .05), with no 
deviation from linearity. In contrast, no pattern is evident for procedural fairness (Figure 3c); 
reciprocal exchange with variable points and indirect cost appears to be perceived as less 
procedurally fair than the other three conditions, but none of the differences between 
conditions are significant. Figure 3d shows a nonlinear relation between form and exchange 
frequency; none of the reciprocal conditions differ from one another, but all three differ 
significantly from negotiated exchange (op < .01 on Scheffe’s test). 
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Tests of the Causal Mechanisms 


Finally, we test whether the salience of conflict and attributions of responsibility and intent 
mediate the significant linear relations shown in Figure 3a and 3b between exchange form 
and two of our four dependent variables: interactional fairness and distributive fairness.® 

Table 2 shows subjects’ mean responses to the potential mediating variables: 
perception of relational conflict, perception of partner's competitiveness, perception of 
inequality in points, attributing responsibility for inequality to the partner and judging the 
partner's actions as intentional. All of the means increase as we move from the original 
reciprocal exchange condition with fixed value to negotiated exchange with one exception: 
Subjects’ perception of inequality does not vary with the form of exchange (F[3,36] = .67, p 
= .42). Across all four conditions, subjects correctly perceived that their exchanges with A 
were unequal in A's favor. 


Table 2: Means and Standard Deviations for Subjects’ Salience of Conflict and Self-Serving 
Attributions, by Form of Exchange 


Reciprocal Exchange 
Fixed Value/ Variable Value/ Variable Value/ Negotiated 


Subjects’ Responses — Indirect Cost _Indirect Cost Direct Cost Exchange 
Salience of Conflict 
Relation Conflictual 3.20 3.50 3.70 4.70 
(.63) (1.35) (1.89) (1.06) 
Partner Competitive 4.00 4.30 5.10 6.50 
(1.76) (1.77) (2.64) (.71) 
Self-Serving Attributions 
Points Unequal 5.50 5.80 5.40 6.10 
(1.18) (1.03) (1.65) (.88) 
Partner Responsible 4.10 4.20 4.50 5.30 
(2.08) (2.62) (2.07) (1.34) 
Partner Intentional 5.30 5.80 5.90 6.10 


(1.06) (.92) (1.66) (1.20) 


Note: Numbers in parentheses are standard deviations. 





Within conditions, however, subjects’ perceptions of inequality did vary. Theoretically, 
attributing responsibility and intent to the partner should have a negative effect on fairness 
only to the extent that the partner is perceived as the cause of unfavorable outcomes; 
consequently, we give greater weight to these attributions by multiplying them by subjects’ 
perception of the extent to which the point distribution favored A. The weighted attributions 
have a potential range of 1 (1 X 1) to 49(7 X 7). 

To test the causal model shown in Figure 1, we make the simplifying assumption that 
our manipulations of the form of exchange represent four points on an underlying, 
continuous variable (with reciprocal exchange with fixed value anchoring the low end, and 
negotiated exchange the high end, of the variable). This assumption allows us to use 
ordinary least squares analysis to test each of the paths in the model. The assumption of 
interval data is supported by the significant linear relations between our four variations in 
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Figure 3. Relations between the Form of Exchange and Four Dependent Variables 
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(a) 


Distributive Fairness 


Reciprocal Fixed Reciprocal Variable — Reciprocal with Cost Negotiated 


(b) 


Conflict and Fairness * 2345 


Figure 3. Relations between the Form of Exchange and Four Dependent Variables (continued) 
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form and perceptions of interactional and distributive fairness. Polynomial contrasts also 
show significant linear relations between form and relational conflict (F[1,36] = 6.38, p < 
.02) and between form and partner competitiveness (F[1,36] = 10.04, p < .005), the conflict 
variables that are proximal to exchange form in our causal model. 

Table 3 shows the regression coefficients for the first path in the model, the effects of 
the form of exchange on the salience of conflict. As the form of exchange acquires more of 
the characteristics of negotiated exchange, our model predicts that the salience of conflict 
should increase. The results show that the farm of exchange significantly affects both 
conflict measures, with effects on partner competitiveness the stronger of the two. Giving 
actors the ability to reciprocate with varying value and adding a direct cost to giving greater 
value to the other heighten the salience of conflict as predicted. 


Table 3: Unstandardized OLS Coefficients from the Regression of Salience of Conflict on the 
Form of Exchange 


Salience of Conflict 
Independent Variable Relation Conflictual Partner Competitive 


Form of Exchange ae Oo 
(.18) (.26) 
[.38] [.46] 
R KD 21 


Note: Numbers in parentheses are standard errors; numbers in brackets are 
standardized coefficients; N = 40. 
*p <.01 ***p < .001 (one-tailed tests) 


We next examine the paths leading to self-serving attributions. Our causal model 
predicts stronger attribution of responsibility and intent to the partner as the form of 
exchange takes on more characteristics of negotiated exchange, with this relation 
mediated by the increasing salience of conflict produced by the form of exchange. To test 
these predictions, Table 4 shows the result of regressing our two weighted attributional 
measures (attribution of responsibility and intent) on the form of exchange (Model 1) and on 
the form of exchange and the salience of conflict (Model 2). Both effects of form are in the 
predicted direction in Model 1, but only the effect on attribution of intent to the partner is 
significant (p < .05); the effect on responsibility is borderline (o = .07). With both conflict 
measures included in Model 2, these modest effects of form disappear completely, 
indicating that the effect of form on self-serving attributions is mediated by the salience of 
conflict, as our model predicts. Only perception of the partner's competitiveness is 
significant, but its relation to the two attributions is quite strong. Perception of the partner 
as competitive increases the attribution of intent and responsibility to the partner for the 
inequality of exchange outcomes. 

Finally, Table 5 shows the result of regressing our two dependent variables — evaluations 
of interactional fairness and distributive fairness — on all prior variables in the causal model. 
Successively adding the variables in each path of the causal model shows that (1) the form 
of exchange significantly affects perceptions of both interactional fairness and distributive 
fairness (Model 1); (2) with the salience of relational conflict and partner competitiveness 
controlled, the significance of the form of exchange disappears, indicating that the form of 
exchange affects perceived fairness through its effects on the salience of conflict (Model 2): 
and (3) adding controls for the attribution of responsibility and intent does not significantly 
alter these relationships (Model 3). Thus, this analysis strongly supports the mediating effects 
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Table 4: Unstandardized OLS Coefficients from the Regression of Self-Serving Attributions on 
the Form of Exchange and the Salience of Conflict 


Weighted Attributions* 


Partner Responsible Partner Intentional 
Independent 
Variables Model 1 Model 2 Model 1 Model 2 
Form of Exchange 2.90 42 peiey a) 
(1.96) (2.13) (1.45) (1.43) 
[.23] [-.00] [.29] [.04] 
Relation Conflictual _— -.02 — -.66 
(1.77) (1.19) 
[-.00] [-.09] 
Partner Competitive — 3.00** — Sor 
(1.26) (.85) 
[.43] [.62] 
R .06 el) .09 36 


Note: Numbers in parentheses are standard errors; numbers in brackets are 


standardized coefficients; N = 40. 
“Weighted by subject’s perception of the extent to which the inequality of points 
favored the partner. 


Tp<.10 *p<.05 **p<.01 ***p < .001 (one-tailed tests) 


Table 5: Unstandardized OLS Coefficients from the Regression of Interactional and Distributive 
Perceived Fairness on All Independent Variables 





Interactional Fairness Distributive Fairness 
Independent Variables Model1 Model2 Model3 Model’ Model2 Model 3 
Form of Exchange -.61** -.06 -.05 -.51* ny ead 


Bazi ror) zane ree een 
ey CA ee pat sap pe ey 


Relation Conflictual _ 440 =40° _ -.36** -.38"* 
(.14) (.14) (.14) (.14) 
[-.37] [-.38] [-.26] [-.30] 
Partner Competitive “= AL = 0 — Ol ce Oo. 
(.10) (.12) (.10) (.12) 
[-.52] [-.38] [-.70] [-.64] 
Partner Responsible — — -.01 _ — .00 
(weighted) (.01) (.01) 
[-.08] [.04] 
Partner Intentional — — -.03 — _— -.02 
(weighted) (.02) (.01) 
[-.17] [.04] 
R’ 18 63 66 11 .69 10 


Note: Numbers in parentheses are standard errors; numbers in brackets are standardized 


coefficients; N = 40. 
*p<.05 *p <.01 ***p < .001 (one-tailed tests) 
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of the salience of conflict on the relation between the form of exchange and perceptions of 
interactional and distributive fairness, but shows that the effect of conflict on fairness Is direct 
and not mediated by self-serving attributions. Attributions of responsibility and intent affect 
perceptions of fairness (zero-order correlations between both variables and perceptions of 
fairness range from .36 to .54, p < .05 to < .001) as attributional theorists predict, but they do 
not mediate the effects of either the form of exchange or the salience of conflict on fairness.’ 


Conclusions 


Four main conclusions emerge from our study. First, the conflict that is inherent in all 
exchange relations becomes more salient as the form of exchange acquires more of the 
characteristics of negotiated exchange — characteristics that bring actors face to face with the 
inverse relation between their own interests and those of their partner. Second, the salience 
of conflict mediates and explains the relation between the form of exchange and perceptions 
of both the fairness of the partner's behavior and the fairness of the distribution of benefits. 
Reciprocal exchanges are perceived as more interactionally and distributively fair than 
negotiated exchanges because they are perceived as less conflictual and competitive. Third, 
the salience of conflict affects fairness directly rather than indirectly through individuals’ 
tendencies toward self-serving attributions. And fourth, increasing the salience of conflict in 
reciprocal exchange does not affect the difference in actors’ behavioral responses to 
inequality in reciprocal and negotiated forms of exchange. Actors are far less willing to enter 
into unequal negotiated agreements than to participate in unequal acts of reciprocal 
exchange, however that inequality is produced. 

These findings have important implications for theories of exchange, theories of justice and 
exchanges in natural settings. First, our study clearly speaks to current debates over the relative 
cooperativeness or competitiveness of different forms of exchange and their effects on bonds of 
affect and solidarity in exchange relations (Lawler 2001; Molm et al. 2000, 2003). We show that 
despite the greater structural cooperativeness of negotiated exchanges, they are perceived as 
less cooperative and more conflictual than reciprocal exchanges. We identified and tested three 
features of negotiated exchange that potentially contribute to this perception: the establishment 
of inequality within bilateral transactions rather than across sequential transactions, the partner's 
more active and unambiguous role in producing inequality, and a more direct and transparent 
relation between one actor's cost and another's benefit. Our research shows that altering 
reciprocal exchanges to have more of these characteristics — while maintaining the fundamental 
distinction between the individual, non-negotiated acts of reciprocal exchange and the jointly 
bargained agreements of negotiated exchange — does indeed increase actors’ sense of unequal 
exchanges as conflictual and competitive and, consequently, unfair. 

Our manipulations did not affect the greater structural cooperativeness of negotiated 
exchanges; all of our comparisons preserved that difference. But rather than reducing the 
sense of conflict in negotiated exchanges, we propose that the cooperative structure of 
negotiation may instead have the opposite effect. By providing a stronger moral framework 
for caring about injustice, the cooperative elements of negotiated exchanges may actually 
increase the likelihood that actors will take action to restore justice even at high cost to self. 
If so, the more cooperative structure of negotiated exchange may account for the high rates 
at which disadvantaged actors refused to negotiate agreements with an unfair partner. 
Making an agreement with another is a cooperative act and one that is fundamentally 
inconsistent with perception of another as competitive and unfair. Participants in negotiated 
exchanges were willing to accept significant cost to self to avoid exchanging with an unfair 
partner, while participants in reciprocal exchanges continued to engage in such exchanges 
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at far higher rates. Second, our results have potential implications for theories of justice, 
particularly attributional theories. In contrast to theories that argue that the relative 
competitiveness or cooperativeness of the relation should affect perceived fairness through 
its effect on self-serving attributions (Thompson and Lowenstein 1992), we find no evidence 
of mediation by attribution of responsibility or intent. Furthermore, the salience of conflict 
does not affect cognitive perceptions of objective inequality. Disadvantaged subjects in all 
forms of exchange correctly perceived the distribution of benefit as unequal in their partner's 
favor, but perceptions of inequality translated into perceptions of unfairness only as the 
conflict in the relation became more salient. Attributing responsibility and intent for unequal 
outcomes to the partner does increase perceptions of the partner and the outcome 
distribution as unfair (Cohen 1982), but these attributions do not mediate the relation between 
the salience of conflict and fairness. This finding suggests that the features of the exchange 
process that enhance the conflictual elements of exchange affect perceived fairness through 
a more direct process, not by prompting an attributional search for the causes of inequality. 
Heightened conflict creates a relational context in which inequality is more likely to be 
perceived as unfair, regardless of attributions of responsibility or intent. 

Our findings also support Bies’ (2001) call for distinguishing between the fairness of decision 
procedures and the fairness of interpersonal treatment; clearly, our subjects perceived the two 
quite differently, with the form of exchange strongly affecting the latter but not the former. The 
lack of patterned results for procedural fairness may reflect the conflict that subjects faced in 
evaluating the procedures in the different forms of exchange. On the one hand, negotiated 
exchanges meet more of the criteria of procedural fairness such as joint decision-making, 
communication of offers and counteroffers, and advance knowledge of terms (Molm et al. 
2003). On the other hand, the same procedures that ostensibly make negotiated exchanges 
fairer also increase the salience of conflict and actors’ negative affect toward one another. The 
different responses may also reflect the different loci of control involved in the two types of 
justice (i.e., the process by which subjects exchanged benefits was dictated by the 
experimenter, while interpersonal treatment was under the partner's control). 

In natural settings, exchanges vary on the same kinds of characteristics that we studied in the 
laboratory. The prototypical form of reciprocal exchange that served as our baseline, in which actors 
exchange benefits of fixed value and inequality emerges primarily from different rates of reciprocity 
that impose different opportunity costs on participants, describes many of our everyday exchanges: 
helping a neighbor with a discrete task, giving a co-worker advice on a problem, accompanying our 
spouse to his or her office party. Exchanges tend to have these characteristics when the resources 
exchanged are not divisible on some continuous scale and the primary cost is the time invested in 
the act of benefitting another. As our research shows, inequalities in these kinds of exchanges, 
produced by differences in the frequencies with which individuals perform acts that benefit one 
another, are less likely to be evaluated as unfair even if they are perceived as objectively unequal. 

When the resources exchanged become divisible and quantifiable so that individuals can 
_ easily compare and actively control the value of their reciprocity, fairness becomes more of an 
issue. To the extent that inequality occurs on discrete, identifiable transactions, inequality is 
more likely to be perceived as unfair. And when those characteristics are combined with the 
cooperative structure of formally negotiated exchanges, concerns with the fairness of the 
transaction can actually dominate behavior and lead to actions that help neither party. This has 
important implications for the widespread use of negotiation in economic and political 
transactions. While the process of negotiation is objectively fair, unintended effects on 
subjective feelings of injustice may affect transactions and relationships in undesirable ways. 
Our research suggests that one way to counter these negative effects is to embed negotiated 
transactions in social relationships that include reciprocal exchanges on other dimensions 
(Granovetter 1985). Another alternative is to conduct negotiations with third-party mediators 
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who may reduce the salience of conflict by making bargaining less direct (Lind et al. 1993). 
Both of these options have received considerable attention in the organizational literature, and 
both could improve exchange outcomes by reducing the salience of relational conflict. 


Notes 


1. We also studied two of our four forms of exchange — negotiated exchange and reciprocal 
exchange with variable value and indirect cost — under conditions in which the subject was 
equal or advantaged in exchange. Differences between negotiated and reciprocal 
exchange on perceived fairness tended to decrease as exchanges became more favorable 
to the subject, as we expected, but none of the interactions between form and level of 
inequality reached statistical significance. In the interests of space, we have omitted these 
conditions from the results reported here. 


2. This fairly moderate level of inequality was based on previous research by Molm et al. 
(2003). Higher levels of inequality produce substantially lower frequencies of exchange in 
the negotiated conditions, but have little effect on perceived fairness in either reciprocal 
or negotiated exchange. 


3. To create a ratio of 10 gives from A for every 14 gives from S, the probability of A’s giving 
if S gave on the previous opportunity was .64, and the probability of A's giving if S did not 
give was .07. These conditional probabilities produce an expected outcome of 
approximately 10 points per opportunity [(14 x .64) + (14 x .07) = 9.94]. 


4. Analyses were conducted with and without controls for the number of points that S gave 
to Ain this condition; results were essentially the same. 


5. For example, if S gave 4 points to A, keeping 2 for self, then on the next opportunity A 
gave 3 points to S, keeping 3 for self. S's total points for that exchange were [2 + (3 x 3)] 
= 11, and A's total points were [3 + (4 x 3)] = 15. At the midpoint of the 1 to 6 point 
range, the exchange ratio was 10:14, as in the other conditions. 


6. Because we found no systematic relation between form and procedural fairness, and 
because the relation between form and exchange frequency is nonlinear, we do not test 
our causal model on these two dependent variables. 


7. We also tested the interaction terms between exchange form and each of the mediating 
variables for the relevant models; adding the interaction terms did not significantly increase 
R? for any of the models. In addition, we replicated the regression analyses in tables 3-5 
with the two reciprocal conditions with variable value (and either indirect or direct cost) 
combined as a single point on the continuum; the pattern of results was identical. 


References 
Adams, J. Stacy. 1965. “Inequity in Social Exchange.” Advances in Experimental Social Psychology 2:267-99. 
Berger, Joseph, Morris Zelditch, Jr., Bo Anderson and Bernard P Cohen. 1972. “Structural Aspects of 


Distributive Justice: A Status Value Formulation.” Pp. 119-46. Sociological Theories in Progress. 
Joseph Berger, Morris Zelditch, Jr. and Bo Anderson, editors. Houghton-Mitflin. 


Conflict and Fairness * 2351 
Bies, Robert S. 2001. “Interactional (In)Justice: The Sacred and the Profane.” Pp. 89-118. Advances in 
Organizational Justice. Jerald Greenberg and Russell Cropanzano, editors. Stanford University Press. 


Blau, Peter M. 1964. Exchange and Power in Social Life. Wiley. 


Cohen, Ronald L. 1982. “Perceiving Justice: An Attributional Perspective.” Pp. 119-60. Equity and Justice in 
Social Behavior. Jerald Greenberg and Ronald L. Cohen, editors. Academic. 


Cook, Karen S., and Richard M. Emerson. 1978. “Power, Equity and Commitment in Exchange Networks.” 
American Sociological Review 43:721-39. 


Cropanzano, Russell, Zinta S. Byrne, D. Ramona Bobocel and Deborah E. Rupp. 2001. “Moral Virtues, 
Fairness Heuristics, Social Entities, and Other Denizens of Organizational Justice.” Journal of 
Vocational Behavior 58:164-209. 


Deutsch, Morton. 1983. “Current Social Psychological Perspectives on Justice.” European Journal of Social 
Psychology 13:305-19. 


. 2000. “Justice and Conflict.” Pp. 41-64. The Handbook of Conflict Resolution: Theory and Practice. 
Morton Deutsch and Peter T. Coleman, editors. Jossey-Bass. 


Emerson, Richard M. 1972. “Exchange Theory, Part Il: Exchange Relations and Networks.” Pp. 58-87. 
Sociological Theories in Progress. Vol. 2. Joseph Berger, Morris Zelditch, Jr. and Bo Anderson, 
editors. Houghton-Mifflin. 


. 1981. “Social Exchange Theory.” Pp. 30-65. Social Psychology: Sociological Perspectives. Morris 
Rosenberg and Ralph Turner, editors. Basic Books. 


Granovetter, Mark. 1985. “Economic Action and Social Structure: The Problem of Embeddedness.” 
American Journal of Sociology 91:481-510. 


Hegtvedt, Karen A., and Caitlin Killian. 1999. “"Fairness and Emotions: Reactions to the Process and 
Outcomes of Negotiations.” Socia/ Forces 78:269-302. 


Homans, George C. 1974. Social Behavior: Its Elementary Forms. Harcourt Brace Jovanovich. 
Jasso, Guillermina. 1980. “A New Theory of Distributive Justice.” American Sociological Review 45:3-32. 
Lawler, Edward J. 2001. “An Affect Theory of Social Exchange.” American Journal of Sociology 107:321-52. 


Lawler, Edward J., and Jeongkoo Yoon. 1993. “Power and the Emergence of Commitment Behavior in 
Negotiated Exchange.” American Sociological Review 58:465-81. 


1996. “Commitment in Exchange Relations: Test of a Theory of Relational Cohesion.” American 


Sociological Review 61:89-108. 


Leventhal, Gerald S., Jurgis Karuza, Jr. and William Rick Fry. 1980. “Beyond Fairness: A Theory of Allocation 
Preferences.” Pp. 167-218. Justice and Social Interaction. Gerald Mikula, editor. Springer-Verlag. 


Lind, E. Allan, Carol T. Kulik, Maureen Ambrose and Maria V. de Vera Park. 1993. “Individuals and Corporate 
Dispute Resolution: Using Procedural Justice as a Decision Heuristic.” Administrative Science 
Quarterly 38:224-51. 


Lind, E. Allan, and Tom R. Tyler. 1988. The Social Psychology of Procedural Justice. Plenum. 


Markovsky, Barry, David Willer and Travis Patton. 1988. “Power Relations in Exchange Networks.” American 
Sociological Review 53:220-36. 


Molm, Linda D., and Karen S. Cook. 1995. “Social Exchange and Exchange Networks.” Pp. 209-35. 
Sociological Perspectives on Social Psychology. Karen S. Cook, Gary Alan Fine and James S. House, 
editors. Allyn and Bacon. 


2352 ¢ Social Forces Volume 84, Number 4 ¢ June 2006 we 
Molm, Linda D., Gretchen Peterson and Nobuyuki Takahashi. 1999. “Power in Negotiated and Reciprocal 
Exchange.” American Sociological Review 64:876-90. 


Molm, Linda D., Nobuyuki Takahashi and Gretchen Peterson. 2000. “Risk and Trust in Social Exchange: An 
Experimental Test of a Classical Proposition.” American Journal of Sociology 105:1396-1427. 


. 2003. “In the Eye of the Beholder: Procedural Justice in Social Exchange.” American Sociological 
Review 68:128-52. 


Nisbett, Richard E., and Lee Ross. 1980. Human Inference: Strategies and Shortcomings of Social Judgment. 
Prentice-Hall. 


Scher, Steven J., and David R. Heise. 1998. “Affect and the Perception of Injustice.” Pp. 223-52. Advances in Group 
Processes. Vol. 10. Edward J. Lawler, Barry Markovsky, Karen Heimer and Jody O'Brien, editors. JAI. 


Thibaut, John W., and Laurens Walker. 1975. Procedural Justice: A Psychological Analysis. Erlbaum. 


Thompson, Leigh, and George Lowenstein. 1992. “Egocentric Interpretations of Fairness and Interpersonal 
Conflict.” Organizational Behavior and Human Decision Processes 51:176-97. 


Tyler, Tom R., and E. Alan Lind. 1992. “A Relational Model of Authority in Groups.” Advances in Experimental 
Social Psychology 25:115-91. 


Utne, Mary Kristine, and Robert F Kidd. 1980. “Equity and Attribution.” Pp. 63-93. Justice and Social 
Interaction. Gerald Mikula, editor. Springer-Verlag. 


Book Reviews 





Militants and Citizens: The Politics of Participatory Democracy in Porto Alegre 
By Gianpaolo Baiocchi 


Stanford University Press, 2005. xiv +224 pages. $70 (cloth), $22.95 (paper). 


Say “Porto Alegre” and many people think of a democratic utopia, a city where ordinary 
citizens, many of whom are poor and illiterate, make the decisions that affect their lives in a 
kind of never-ending deliberative workshop. Yet the romance surrounding Brazil's Porto Alegre 
and its participatory management has discouraged scholars from explaining why and when 
the process actually works. In Militants and Citizens, University of Massachusetts sociologist 
Gianpaolo Baoicchi sets himself that task. His conclusions are surprising, and have broader 
implications for our understanding of the bases of civic engagement. 

To social theorists who locate the wellsprings of democratic innovation only in the union 
halls, churches, literary circles and other “free spaces” of civil society, Baoicchi shows that 
municipal governments can implement genuine mechanisms for citizen input. In the 
budgeting forums that Baoicchi observed, residents made decisions that mattered. They 
debated, negotiated and sometimes revised their original preferences, exhibiting a firm 
commitment to the public good. This was not always the case. Truly deliberative forums 
depended on the existence of activists and activist networks. This was true in two senses. It 
was activists who, in the late 1980s, pushed the governing party to expand participatory 
mechanisms in a way that made them effective. More importantly, and with implications that 
extend well beyond Porto Alegre, activists played a critical role in facilitating constructive 
political dialogue within the forums after they were established. Long experienced in 
negotiating among diverse interests and possessing strong oppositional credentials with 
residents, activists kept discussion focused and temperate while often pushing past the 
bounds of the immediate to promote political learning. They recruited participants, negotiated 
among parties before, after and during meetings, and secured information that residents 
needed to deliberate effectively. 

There are two important implications for democratic theorists. One is that democratic 
deliberation depends on what happens before, after and in the margins of meetings as much 
as during the formal period. The other is that activism is not the enemy of deliberation. As 
many democratic theorists have seen it, deliberation is civil, reasoned and oriented to the 
common good. Activism, by contrast, is palpably uncivil, impassioned and single-minded. 
Baoicchi rejects that view by drawing on an ingenious comparison of three municipal districts 
with very different civic configurations, that is, different relationships among neighborhood 
activists and officials. The district with longstanding activist networks and a popular council 
that functioned independently of the government-sponsored forum produced high levels of 
citizen participation and trust and few occasions in which the deliberative process broke 
down. By comparison, forums in a district lacking opportunities for civic interaction outside 
the forum were popular, but tended to become platforms for participants to attack and defend 
each other's personal reputations. This is not to say that activists and officials always worked 
together harmoniously. In a third district, activists were cohesive and mobilized but they 
consistently opposed the administration and sought to derail the budgeting forums. The 
result was that residents distrusted both the budgeting process and even their own 
delegates, and breakdowns in deliberation were frequent. The punch line is that government- 
sponsored participatory democracy depends on the existence of civil society associations, 
but ones that are unafraid to work within the system as well as outside it. 
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Baoicchi draws on a variety of methods to reach that conclusion, including ethnographic 
observation of citizen forums and activist meetings, interviews with officials and activists, and 
surveys of residents. His comparative strategy not only illuminates the structural conditions 
for effective citizen engagement, but offers a new way to think about the composition of civil 
society, long an ill-defined term. By combining survey data on residents’ levels of trust in their 
institutions with ethnographic observation of public encounters, Baoicchi gets at the 
processes by which trust is established or undermined. |In short, Militants and Citizens should 
have a wide audience among democratic theorists, political sociologists, scholars of civil 
society and sociologists of culture. 

Does the book give us a new way to think about whether Porto Alegre can be reproduced 
elsewhere? For many readers, this will be a pressing issue and | wish that Baoicchi had 
devoted more space to answering basic questions such as why people participated in the 
budget forums and how much power they actually had, as well as to-considering more fully 
the distinctiveness of the case. With respect to the first, a John Gaventa-style analysis of the 
issues that were kept off the table would have been valuable. With respect to the second, 
Porto Alegrean activists were a special breed: simultaneously community organizers and 
national movement activists, neighborhood-based and party-connected. | wonder if activists 
who are unused to performing on so many stages simultaneously could do all the things that 
Baoicchi expects of them. One final concern has to do with Baoicchi's conception of success. 
| referred earlier to the district in which activists’ oppositional style was responsible for an 
atmosphere of contention and distrust. But Baoicchi mentions, without reflecting much on 
the fact, that this was also the district that received the most funding in the budget process. 
So, while the presence of oppositional activists may not foster smooth deliberation, it may get 
more public goods for residents. That dilemma of participatory democracy merits more 
discussion. 

These concerns aside, Militants and Citizens offers a persuasive analysis of one of the 
most famous experiments in participatory democracy and a model for how to study live 
democracies in all their local messiness. 


Reviewer: Francesca Polletta, University of California, Irvine 





Democracy’s Voices: Social Ties and the Quality of Public Life in Spain 
By Robert M. Fishman 
Cornell University Press. 2004. 194 pages. $29.95 (paper). 


Robert M. Fishman in his book, Democracy’s Voices, Social Ties and the Quality of Public 
Life in Spain, traces an interesting picture of labor politics in Spain as he explores the 
relationship between social network ties and the quality of democratic life. Fishman 
focuses attention squarely on his primary independent variable — the ties between workers 
and intellectuals — wherein intellectuals are identified as “authors who write for an educated 
or general audience.” His primary claim is that worker-intellectual ties shape the quality of 
democratic life because they point leaders toward one of two distinct styles of labor politics 
- defensive localism or global discursive horizons. Defensive localism implies that leaders 
focus on local concerns by expressing decidedly local objectives, while leaders adopting 
global discursive horizons attempt to connect specific, local concerns to broader and more 
general, extra local objectives. Fishman compellingly elaborates on the differences 
between these two political styles with a rich comparison of the struggles of labor in 
numerous communities. 
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Fishman’s research strategy involves analyses of survey data from a snowball sample of 
leaders in 49 industrial, working-class communities in Spain, supplemented with qualitative 
interviews. Fishman treats readers to a fascinating discussion of the character of work-related 
politics in Spain, as well as an analysis of the Socialist and post-communist subcultures 
through which the influence of worker-intellectual ties operate. His empirical analyses also 
highlight the enduring effects for those participating in the opposition movement during the 
Franco period — a finding that resonates with McAdams work on Freedom Summer 
participants. Overall, Fishman assembles a persuasive case for the important influence that 
worker-intellectual ties have on the democratic process in Spain. 

Of course, any attempt to link social ties to democratic outcomes will eventually lead 
social scientists to thoughts of social capital. Fishman tackles social capital with surprising 
and convincing ferocity in chapter four. He argues that the concept of social capital is 
problematic because it is multidimensional, it masks important distinctions between crucial 
indicators, and its usage implies the misguided assumption that social relations represent 
instrumental investments. While his argument will no doubt ruffle a few feathers, his 
forceful discussion underscores the importance of considering carefully concepts that 
become vogue. 

The multivariate analyses presented in chapter six were a disappointment. The author 
uses logistic regression to predict the dichotomous outcome of whether local leaders 
articulate globalizing discursive horizons or defensive localism. In addition to including 
variables that were considered in bivariate analyses in previous chapters, the author 
introduces a series of control variables. Unfortunately, these control variables produce 
confusion for readers because their measurement is not carefully discussed, In particular, the 
variables that indicate three different institutional positions of the respondents in leadership 
roles initially appear to be a set of dichotomous dummy variables. Including a set of dummy 
variables in a logistic regression model is a straightforward procedure, with the interpretation 
of the dummy coefficients involving a comparison to an omitted category. In contrast to the 
typical presentation of a single model with a focus on comparisons to the omitted category, 
the author reports two models, each with a different omitted institutional position variable, “In 
order to assess the causal impact of all three institutional locations.” (p.134) The coefficients 
presented in these models, as well as the author's interpretation of these estimates, led me 
to believe that these institutional variables do not constitute a set of dummy variables as 
typically understood. In one table, the reported N differs between the two models. A more 
detailed description of the institutional position variables, as well as the other control 
variables, would be of great interest to readers. Additionally, while the bivariate analyses 
highlight the usefulness of measuring the strength of ties as a three-category variable (no ties, 
simple ties and multiple ties), the multivariate models appear to treat tie strength as a 
continuous variable. | would like to see an explicit examination of how each level of tie 
strength influences the likelihood of adopting globalizing discursive horizons in the 
multivariate models. 

Democracy’s Voices will interest scholars of social movements, the sociology of work, 
political sociology and social networks. The focus on Spanish labor politics is interesting and 
accessible to readers without prior knowledge of labor relations in Spain. Fishman does a 
masterful job of predicting, and in most cases, convincingly responding to readers’ reactions; 
most questions that | jotted down while reading were subsequently addressed by Fishman in 
the next sections of the book. Such foresight and organization are commendable. Finally, the 
critical discussion in chapter four should be required reading for anyone with an interest in 


social capital. 


Reviewer: Thomas Rotolo, Washington State University 
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Resisting Dictatorship: Repression and Protest in Southeast Asia 
By Vincent Boudreau 
Cambridge University Press, 2004. 290 pages. $75 (cloth). 


In this book, Vincent Boudreau offers a formidable contribution to the literature on 
democratization and resistance to authoritarian rule in Southeast Asia. It is framed around 
important theoretical questions, argued quite persuasively and written in a manner that is 
consistently and searchingly comparative. 

The book is on three Southeast Asian countries — Burma, Indonesia and the Philippines. Two 
of these countries, Indonesia and the Philippines, are now widely regarded as democracies, albeit 
flawed ones. Their respective fallen dictators — Hadji Muhammad Soeharto (deposed in 1998) and 
Ferdinand Marcos (deposed in 1986) — were confronted by large-scale protests after years of 
successfully engineering a mechanics of politics that sustained their grip on power. Burma, on the 
other hand, is still ruled by an autocratic military junta, which for years was dominated by the 
shadowy figure of the late Ne Win. Rule by military junta remains stubbornly in place. 

The main theoretical issue that Boudreau deals with is how to understand the way modes 
of protest in authoritarian settings are established in relationship to styles of state repression 
(p.1). He rightly argues against the kind of analysis that is mostly concerned with the 
quantitative dimension of mass organization as a measure of state repression as well as the 
existence of protest. His more qualitative concerns allow Boudreau to provide a picture that is 
quite nuanced, as he delves into the complexities of organized resistance, its many forms of 
expression, and its relationship to the different strategies employed by authoritarian regimes to 
contain the development of viable mass movements. He suggests that these state strategies 
form the broad parameters of what is possible for oppositional activists, and the kinds of 
strategies and forms of organization and resistance they will employ. Though he speaks of a 
“political calculus” (p.83) of activist responses, it is one that emerges from such things as fear, 
historical memory, as well as concrete experience, and is very different from the way in which 
social actors’ behavior is understood and predicted in crude forms of rational choice. 

In a nutshell, Boudreau — best known as a Philippines specialist — is interested in “the 
influences that shaped protest,” and to “describe more general patterns of state attack” as well 
as “movement response.” (p.16) While this makes great sense, in doing so he opens himself to 
criticism of underplaying the possibility of the reverse: do not the nature, objectives (e.g., 
moderate reform or violent revolution), and organizational capacities of opposition movements 
also conceivably help to shape the authoritarian state’s style and strategy of repression? Of 
course the answer in specific cases has to be dealt with empirically and not just investigated 
theoretically. 

But Boudreau’s work, in spite of some necessary broad sweeps, is clearly grounded in 
empirical and historical investigation. He analyses the legacies of colonialism in the three 
societies in question to understand the way in which the processes of state formation occurred 
and the way in which the relationship between states and movements of various kinds (e.g., 
communist, separatist, religious or elite/mnass based, geographically concentrated/dispersed) 
was experienced. He then compares the three post-colonial authoritarianisms. In the Philippines, 
Marcos’s repression — which notably did not involve the sort of killings seen in Burma and 
especially in a mass scale in Indonesia in the 1960s — produced an “organized, moderate and 
increasingly hardy fraction” of the anti-dictatorship movement, in spite of the existence of more 
radical streams. In Burma, pro-democracy elites had only weak links with mass society — partly 
a lingering legacy of colonial politics — but also little connection with regime members as the 
result of a kind of repression that allowed no room for any dissent. In Indonesia, an elite and 
moderate pro-democracy movement existed during Soeharto’s rule, but it was poorly organized 
and fearful of mass unrest (p. 243-44). 
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As always, there are objections that can be raised. Among these is that Boudreau 
ultimately reveals that he is indeed a Philippines specialist — some of the discussion of 
Indonesia is marred by the repeated misspelling of names and words. Also, the Indonesian 
scholar-activist Arief Budiman was not detained in 1990 and labor leader Muchtar Pakpahan’s 
serious brushes with the law only began in 1994, not 1990 (see p.128). There is also more that 
could have been done to systematically deal with the different socio-economic contexts 
provided by either fast-growing or moribund capitalism (Indonesia and the Philippines, until 
1997 and 1998, respectively) or isolation from the world economy (Burma). Boudreau does 
discuss “democratic transitions,” but there is no theoretical treatment of the concept. Still, 
these are relatively minor objections to an otherwise interesting and engaging contribution. 


Reviewer: Vedi R. Hadiz, National University of Singapore 





Civilization and its Contents 
By Bruce Mazlish 
Stanford University Press. 2004. 188 pages. $48 (cloth), $17.95 (paper). 


Until Sept. 11, 2001 the concept of “civilization” was in a gentle dormant state as far as the 
social sciences were concerned. The multicultural debates of the preceding decade regarding 
the hegemony of “Western civilization” as canon in teaching university students standards of 
humanistic good taste and peaks of knowledge did not arguably disturb the peace of 
sociology, economics or political science. But the past five years have changed the world, 
much stemming from challenges to Western civilization: a very violent political challenge from 
radical Islam, and increasingly strong economic challenges from two civilizations that Weber 
thought had fatal flaws in the quest for modernity: China and India. Add to this that in the past 
two decades the social sciences have become increasingly aware of the dynamics of change 
of the transformations of the world order from an order of nation-states to something called 
“globalization”, forcefully real but lacking a center, and one has all the ingredients necessary 
to seek a new integrative conceptual framework, a new cognitive vision or road map that 
might put in play “civilization” and “globalization” into a paradigm for the 21* century. 

It is for this playing field, | imagine, that Bruce Mazlish, an intellectual historian noted for 
his earlier interpretive study of the birth of sociology, takes up the challenge of 
“deconstructing” civilization as discourse, its origins, vision and interconnections from its 18" 
century Enlightenment context to the present. The result, somewhat like Russian matryoshka 
nesting dolls, is an engaging, polished study which reads easily with minimal jargon. (He can 
be forgiven for trying to advance as a concept “accivilization.”) 

In reaching for the origins and later modern development of the concept, the range is 
broad, from its initial coinage by the fiery orator Mirabeau, whose Enlightenment vision of 
civilization was in universalistic terms as a collective achievement of mankind, to the more 
restrictive and even racialist exclusionist perspective in the 19" century used as justification 
for imperial conquest and domination (sometimes idealized as a “mission civilisatrice"). The 
notion got new analytical and historical terrain in the 20th century by two persons Mazlish 
thinks highly of: Freud, whose shadow from his Civilization and its Discontents hovers over 
Mazlish’s title, and Elias, with the latter's secularist treatment of civilization as an ongoing 
historical process having a paramount function of curbing violence for state and elite 
domination. 

It is in the first five chapters that Mazlish is at his best; the subsequent chapters on “other 
civilizations” and “dialogue of civilizations in a global epoch” offer little insight or substance. 
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It is nice to hear “if the notion of civilization continues to have meaning, it will be in terms of 
a global civilization and local cultures,” but the trail stops cold after the statement “There is 
also the possibility of empirical research into their actual interactions."(p. 160) The author's 
ambivalence to “civilization” brings him to conclude that the concept’s usefulness is at an end 
and that we need to expand our notion of what it means to be “civilized.” 

Chapter Five opens up a needed dialogue between Western/European and “other 
civilizations,” but no mention is made of the fruitful pioneer work of Benjamin Nelson on 
“intercivilizational encounters” (in Nelson's posthumous On the Roads to Modernity, 1981). 
That might have come in handy in extending Mazlish’s focus on Egyptian civilization and 
its attraction for the West, if he had shown awareness of the powerful attraction of late 
18" century speculative Freemasonry for Egypt as a cradle of masonry (indicated in the 
reverse of the Great Seal of the United States). 

After Egypt, Mazlish proceeds to the encounter of Japan with the West, and the very 
successful adaptation (“accivilization”) of Japan to become “part of European civilization,” 
though the historical record was not one of contributing “a separate and equal form of the 
civilizing process.” (p. 106) A broader discussion of civilizational interchange should have 
included the great importance of Japanese aesthetics and art forms coming into Europe 
in the second half the 19" century and their influence on Western art perspective, while 
in the 20", the import of Zen Buddhism also has been marked. Perhaps most glaring is the 
absence of any reference to S.N. Eisenstadt’s masterful study of Japanese civilization 
(1996) and indeed to all of the voluminous works of Eisenstadt and his associates on the 
comparative study of civilizations within the paradigm of “multiple modernities.” 

For an advocate of dialogue, Mazlish's neglect of contemporary sociological theorizing 
on “civilization,” “globalization” and their linkage with modernity — for example, in such 
varied perspectives as those of Robertson (who launched the concept “globalization” 20 
years ago), Ritzer, and those in the recent collective volume of S. Arjomand and E. 
Tiryakian (Rethinking Civilizational Analysis 2004) — is jarring. But it is equally unfortunate 
that not even some of the salient earlier sociological contributors to the analysis and 
conceptualization of “civilization” enter into the contents of this study: neither Max Weber, 
nor Mauss, nor Sorokin! 

Sins of omission aside, this study has heuristic merit in drawing sociological attention 
to the potential of “civilization” and “civilizational processes” as conceptual tools for our 
new global period. While “civilization” may still be bandied about by zealot political and 
religious leaders to justify mobilization against “the other,” it may have far greater cognitive 
payoff for viewing dynamic cross-regional cultural networks because international 
migration and processes of globalization make more fluid the territorial boundaries of 
civilizations and render moot the question of supremacy or superiority of any one 
civilization. 

What of the original Enlightenment vision of “civilization” and the “civilizing process”— 
does it retain any normative vitality for conceptualizing “global civilization”? If we view the 
curbing of wanton or ritual violence and enhancing the universal dignity of humans as a 
complementary te/os of modernity, then it might be realized that the civilizing process is 
only at an early stage: not until on a global basis will violence, material and symbolic, end 
on women of all ages can there be a new phase of modernity. For nearly half of the world, 
“civilization” has been, de facto, ideology. How to make it a reality for the other half is an 
eminent challenge for sociology. 


Reviewer: Edward A. Tiryakian, Duke University 
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Class Practices: How Parents Help Their Children Get Good Jobs 
By Fiona Devine 
Cambridge University Press. 2004. 285 pages. $26.99 (paper). 


In Class Practices, Fiona Devine engages the empirical and theoretical work of John 
Goldthorpe to delve into the processes underlying patterns of social mobility and social 
reproduction in the United States and Great Britain. Devine wants to understand precisely 
how the middle class arrived at its status and how it attempts to maintain that status in the 
next generation. In particular, she is concerned that Goldthorpe has adopted a perspective 
that is too economistic to understand all the ways that families invest economic, cultural and 
social resources in their mobility projects. She also argues that Goldthorpe and Bourdieu, 
whose theory she also examines, are too deterministic in their claims about class mobility and 
reproduction. To study this issue, she samples doctors and teachers in the Boston and 
Manchester areas and, through life history interviews with 86 parents of 116 children, 
attempts to reconstruct the processes by which these professionals became middle class 
and how they are attempting to maintain or increase their children’s social positions. 

Concerned that Goldthorpe has focused too much on economic resources and neglected 
the role of cultural and social resources in determining class mobility and stability, Devine 
presents three pairs of chapters about how her respondents have used each of these types 
of resources. The first chapter of each pair discusses how the interviewees’ parents invested 
in a given resource to produce the interviewees’ social positions; the second recounts how her 
interviewees have invested that resource in their children’s attainments. Though this is a clever 
analytical strategy, the potential benefits of Devine’s rich qualitative data are undermined by the 
mode of presentation. Instead of presenting in a nuanced, textured way how families balance 
their varied resources in employing strategies of class mobility and reproduction, we get a pair 
of chapters on how the respondents used their economic resources, another pair on their use 
of cultural resources, and another pair on social resources. Subsequently, readers will fail to 
understand how families struggle with their decisions about balancing the investment of the 
different types of resources at their disposal. This tactic also doesn't allow the leverage gained 
from the comparative approach to highlight the U.S.-English differences. Though Devine is 
quite intrigued by Bourdieu’'s theory and wants to adopt, or at least adapt, parts of it, she 
misses the opportunity to construct, in a “thickly described” manner, her respondents’ habitus 
— their perceptions of the opportunity structure, of what's in the realm of the possible — and to 
explain how, in the context of that perception, families invest in their economic, cultural and 
social capital in different ways. In the endnotes, Devine provides intriguing glimpses of amore 
holistic account of a given family's investments, one that not only conveys the subtleties of 
family mobility strategies but makes clear that Goldthorpe's theory misses some key elements 
of the process. These should have been in the text itself. 

In the end, Devine makes two key points. The first is that family resources do not convert 
in any simple way to specific inter-generational patterns: there is much complexity and 
indeterminacy. The second point - this is more of an assertion — is that economic, cultural 
and social resources are equally important in reproducing privilege and power (pp. 178, 186). 
Along the way, there are many useful nuggets that provide depth to these broad emphases. 
Much of her work on social resources, for example, speaks to “the class character of social 
networks.” (p. 144) In particular, she gives a number of concrete examples of how one’s 
friends and one’s school, over and above one’s family, affect one’s perceptions of future 
possibilities. Throughout the book, Devine scatters empirical instances of specific 
connections among economic, cultural and social resources. 

| believe Devine probably got as much information as possible from her two-hour 
interviews and she seems to have squeezed from them enough analytical material for her tri- 
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partite resource story. At the same time, from the perspective of a U.S. reader, there were a 
number of reports from her Boston interviews that simply did not ring true. | would deem it 
quite rare, for example, for the son of a painting contractor to say that he “worked parttime 
whilst | was going to school...” (p. 29, emphasis added) | was quite surprised as well to read 
a quote from a mother that her daughter “didn’t apply to lvy League schools because they 
don't give money bursaries..." (p. 53, emphasis added) Perhaps such oddities are of no great 
consequence; perhaps, they simply reveal the difficulties of doing comparative research 
(even with the “same” language); they do, however, give one pause to wonder about what 
else might have been missed. 

Devine’s problem — how do we understand the processes of social reproduction and 
mobility on the ground, across generations, under different political and economic conditions — 
is an important one. Though | had questions about her theory, her data and her mode of 
presentation, | did learn that the answers are complicated and the findings are difficult to convey. 


Reviewer: David Karen, Bryn Mawr College 





American Juvenile Justice 
By Franklin E. Zimring 
Oxford University Press, 2005. 246 pages. $49.95 (cloth), $19.95 (paper). 


Franklin Zimring provides a collection of essays dealing with various policy issues in 
contemporary juvenile justice. In the introductory section, he admits the chapters in this book 
originally were written by him as individual essays and articles on various topics, then later 
organized and published as a book-length collection of readings. The work contains four parts 
— Adolescence: Social Facts and Legal Theory, A Rationale for American Juvenile Justice, The 
Adolescent Offender and Policy Problems in Modern Juvenile Justice. Each of these parts 
contains chapters, a total of 13, dealing with a specific issue or perspective. 

Zimring displays a keen ability to mesh deeply philosophical principles with empirical 
findings. The early chapters present philosophical discussions on the historical and personal 
significance of adolescence including the inherent immaturity of teenagers, an argument that 
modern adolescence is a “learners permit” (p. 17) for adult responsibilities, the relationship of 
individual variation in adolescent development to penal proportionality and diminished 
responsibility for offending behavior, as well as the possibilities for rehabilitation and the 
particular needs for punishment of juvenile offenders. He also spends significant effort 
defending the original treatment ideology of the juvenile justice system arguing this 
underlying philosophy and approach still are relevant in contemporary American society. 
Subsequent chapters examine specific policy issues including peer influence on juvenile 
offending and the culpability of kids who simply accompany other offenders, evidence 
supporting the notion that most youths age out of offending behavior, the role of juvenile 
policy regarding teen pregnancy, practices involved with waivers of juveniles to the adult 
criminal court, approaches to dealing with juvenile access to guns, and the complicated 
matters involved with handling young homicide offenders, among other topics. 

| found two chapters particularly interesting. The author provides a deconstruction of 
policy issues concerning the juvenile “superpreditor” (p. 105) scare of the mid-1990s, and 
another chapter presents thoughts on policy initiatives for combating minority 
overrepresentation in the juvenile justice system. Zimring uses strong empirical data 
demonstrating the prediction by various scholars and policy-makers of a violent future “blood 
bath” (p. 122), caused by a growing number of increasingly violent kids, had no evidence base 
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and, in fact, is not occurring. In another chapter he questions the utility of policies designed to 
decrease minority overrepresentation through bringing more non-minority youth into the 
system. He makes a convincing argument that the overall harm to all kids increases as a result 
of this approach. These sections of the book are extremely insightful, convincing, and 
demonstrate the author's impressive data analysis skill and theoretical insight. 

The book has a few minor shortcomings. First, there is very little substantive discussion of 
actual recent juvenile policy changes in specific jurisdictions. For example, even though the 
author extensively discusses increasing waivers to adult court along with the policy of 
extending the range of sentences available to the juvenile courts, more specific examples of 
this legislation, along with a brief overview of the social and political history of the contexts 
that give rise to these changes, would provide an excellent substantive grounding. Changes 
in juvenile policy are jurisdiction-specific and not sweeping at a national level. Second, the 
author admits the book is a collection of essays, but fails to provide a summary of common 
themes or thoughts. Given Professor Zimring's extensive research, it would be interesting to 
read his overall impressions when drawing conclusions from the collection of readings. Third, 
this book is written at a very sophisticated level making it a difficult fit for most undergraduate 
courses in juvenile justice. This text does not provide basic information about the juvenile 
system, but clearly assumes a fundamental familiarity with the stages, processes, 
terminology and history of the system. Zimring’s writing style often is complex and the 
discussion is beyond the reading ability of most undergraduates. However, the nature of the 
material certainly is appropriate for graduate students interested in the philosophical 
underpinnings, empirical research and policy implications of juvenile justice. 

It is clear these shortcomings are minor given the overall contribution this book offers 
when providing an excellent collection of thoughts and insights on contemporary juvenile 
justice issues. The intention of the book, despite its title, should not be assumed to be a 
general, introductory overview for lower-division undergraduates. In fact, a more accurate title 
might be Rethinking Contemporary Issues in Juvenile Justice or even Deconstructing Current 
Trends in Juvenile Justice Policy. However, Zimring’s insight, along with his expertise in the 
philosophical underpinnings, empirical data, and policy implications of contemporary issues 
and trends, makes this collection of essays an intriguing perspective for scholars, 
practitioners, and policy-makers in this important and influential realm of the justice system. 


Reviewer: Karl R. Kunkel, Missouri State University 





Rape Work: Victims, Gender, and Emotions in Organization and Community 
Context 

By Patricia Yancey Martin 

Routledge, Taylor & Francis Group, 2005. 280 pages. $85 (cloth), $24.95 (paper). 


Pathways to justice for rape victims have not been the easiest trails. Victims choosing to 
pursue rape charges have confronted multiple obstacles, not the least of which is the 
“second rape.” Rape Work is about the victim's pursuit of justice and those who could help or 
hinder her pursuit. Rape workers are those who can make the search for justice difficult at 
best, make the victim feel degraded, humiliated and victimized once again. Martin points out 
that rape workers do their work in the confines of organizations that are seldom placing the 
victim center stage in what is, without doubt, the biggest drama of her life. Martin draws on 
two primary conceptual tools, frames and networks, to highlight the plight of rape victims and 
workers. She defines the concept of frames as “cognitive schemata through which individuals 
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interpret and give meaning to concrete events, and activities are practices that are appropriate 
to given situations.” (40) Organizational structures provide goals and guidelines that members 
try to achieve, while simultaneously generating conduct norms that constrain members. Thus 
in the end it becomes important just how those frames are defined, operationalized and 
ultimately put into practice by rape workers. 

Patricia Martin elaborates on how the various individuals practice their profession within that 
set of frames (goals, policies, scripts, vocabularies, etc.), which may be counterintuitive to the 
interests of victims. Her conceptual model of organizational frames and networks has much to 
offer. Rather than seeing rape crimes entering and working their way through a system with an 
imagined set of integrated parts seeking a common goal of legal truths, or as sifting through a 
criminal justice sieve with ever decreasing holes, eliminating cases until only those with the 
proper characteristics are left to face the whims of juries, she envisions distinct groups 
networked together with distinct primary goals and often no strong central tie. The reality is that 
only one organization, the Rape Crisis Center (RCC), has at the center of its mission the interests 
of the rape victim. Perhaps because members tend to be activists they expect to be involved in 
highly emotional streams, and are prepared to work for and support the victim in every arena . 
It helps that RCCs are not tied in any formal sense to the legal system, leaving their frames 
completely unadulterated. In all the other organizations, rape and rape victims are only a small 
part of what agents confront and consequently they have little concern for the victim as the 
primary hub of their activities. Martin talks about this as “owning rape.” 

The key organizations involved with rape victims are emergency rooms, law enforcement, 
prosecutors, judges (courts) and defense attorneys. RCCs, although the latest group to be 
involved, have been around long enough to access their impact and as it turns out might just 
be the missing link that can bring all of these organizations into a network that is optimal for 
the victim. It is a major task and seems to require ramping down some of the more overt 
political tones of its activist roots. 

As mentioned, each organization comes with its set of frames that are not necessarily 
victim friendly. Hospital ERs are dominated by male physicians and typically female nurses. 
Authority clearly resides with the physicians. Many workers in this setting do not see the rape 
victim as a medical emergency placing other trauma victims before her. Performing duties 
more related to evidence collection and becoming unwitting participants in legal cases is 
outside their frame; they resent it, and to a degree, resist it. Although many ERs are now 
approaching rape victims in a more positive way, it is interesting to note that alternatives such 
as SANE programs seem to work better as a viable alternative simply because they have 
victims as the center of their attention. Law enforcement's frame stresses social control and 
order maintenance and focuses on investigating crimes and collecting evidence. Again, 
primarily a male-dominated profession; females are apt to be given the task of emotional 
support for the victim while males do fact-gathering tasks. Negative reactions by police at this 
point can lead to unfounding of rape cases. Victim statements and demeanor are scrutinized 
within the framework of credibility of evidence and with an eye on bringing order back to the 
community. For prosecutors (most lead prosecutors are male) working within the confines of 
heavy caseloads and limited budgets, victims are transformed into witnesses for the state. 
After all, crimes are by definition committed against the state. The legal system in the United 
States is an adversarial process pitting the great and mighty Oz (the state) against the 
accused. There exists a body of constitutional and case law designed to safeguard the 
defendant against potential abuses by the state. Judges oversee the proceeding and if it 
looks like they are on the side of the accused, often they need to be to keep the state from 
overstepping its bounds. The accused has to have a fair trial. There is no fair trial mandate for 
victims even where victim rights exist. It goes without saying that defense attorneys are 
dedicated to protecting their clients, predominantly males in a system where the accuser is 
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female. Part of their job is to take the star witness (the victim) apart at every level without 
losing the respect of the jury. 

Martin suggests that RCCs provide impetus for changing frames in traditional 
organizations, improve training and altering the construction of victims. The author carefully 
weaves through the connections, and the recommendations she offers at the end of the book 
— to make the whole process more victim centered — will create some discussion. 
Implications for changing the legal process to be more pro-victim may very well have 
ramifications for the entire legal process in other kinds of cases. This needs to be thought 
through very carefully. The fact is, what the victim needs may not be what the legal justice 
system needs. Rape Work is a book well worth the read particularly to those interested in how 
activism can have positive community. 


Reviewer: John Hamlin, University of Minnesota, Duluth 





Handbook of the Life Course 
Edited by Jeylan T. Mortimer and Michael J. Shanahan 
Kluver Academic/ Plenum Publishers, 2003. 750 pages. $295 (cloth). 


‘All the king’s horses and all the king’s men couldn’t put Humpty 
Dumpty together again.” 
— Mother Goose 


A handbook on life course research is long overdue. The earlier overviews by Sorensen, 
Weinert and Sherrod (1986) and Riley, Huber and Hess (1988) charted the potential of a new 
and exciting interdisciplinary research program wherein social history, demography, sociology, 
life-span psychology and microeconomics would converge in the study of human lives. Such 
a program would mean new longitudinal data collections, new methods and models of 
dynamic analysis as well as exponentially increased computing power could open up new 
opportunities for empirical analysis. After more than two decades of a mounting wave of 
research it was time to take stock of what has been achieved and which new developments 
are shaping this field. 

Jeylan Mortimer and Michael Shanahan have set for themselves the very ambitious goal 
of representing the current frontiers of theoretical, empirical and methodological work in a 
manner that captures the multifold perspectives across: life domains, life stages, disciplines, 
historical and national contexts and the levels of individual action and behavior and social 
organization. The powerful intellectual influence of Glen Elder is not only reflected in his and 
his co-author’s opening piece on “The Emergence and Development of Life Course Theory,” 
but very systematically throughout the Handbook. Most of the authors take all or parts of 
Elder’s heuristics of life course theory as the structuring guide for their chapters. 

The main rubrics and a number of subsections reflect the programmatic attempt of the 
editors to represent the quite divergent undercurrents and traditions of life course studies: 
human development and normative age differentiation, historical and cultural construction, 
social structure and demographic accounting, institutional influences and patterning, 
biography and life experience, functional aging and health. Under the rubric of “Historical and 
Cross-National Variability of the Life Course,” we find a general overview chapter on 
“Generations, Cohorts and Social Change” (Alwin and McCammon) and a comparative case 
study on the United States and Japan on “Stratified Incentives and Life Course Behavior” 
(Kariya and Rosenbaum). The rubric “Normative Structuring of the Life Course” is covered by 
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one partly systematic, partly empirical chapter by Settersten on “Age Structuring and the 
Rhythm of the Life Course.” 

The largest rubric “Movement Through the Life Course” contains five chapters on the 
“Institutional Structuring of Life Course Trajectories” focusing on newlyweds (Tallman), family 
context and individual well-being (Uhlenberg and Mueller), intergenerational relations (Putney 
and Bengtson), educational transitions (Pallas), an essay on contingent work lives (Heinz) and 
an excellent theoretical piece on governmental influences on the life course (Leisering). Three 
chapters under the subheading of “Transitions” deal with first-grade school entry (Entwisle, 
Alexander and Olson), the trajectory from school and training to work in a comparative 
perspective (Kerckhoff) and a “new” life stage called midcourse around retirement (Moen). 
The third subheading of “Turning Points” covers two partly overlapping contributions on 
desistance from crime (Sampson/Laub and Uggen/Massaglia) and an analytical overview of 
migration research with an empirical case study on the health of immigrants (Jasso). 

The theme of “Life Course Construction” is divided into two subsections on “Agency” and 
“Connections Between Early and Subsequent Life Phases,” the former being represented by an 
essay on self agency in which the life course connections are only marginally touched (Gecas) 
and the latter by highly informative reviews on the connections between childhood and adulthood 
(McLeod/Almazan) and on developmental continuity and discontinuity in adolescent substance 
use (Schulenberg/Maggs/O'Malley) and an empirical research report on adolescent work and its 
consequences for later careers (Mortimer, Staff and Oesterle). “Methods” are covered by five 
chapters: a less-than-up-to-date piece on cohort analysis (Glenn), event history models (Wu), 
panel and growth models (Halaby), latent structures (Macmillan and Eliason) and life story 
narratives (Cohler and Hostelter). Among the very best contributions to the volume are the three 
chapters on “Interdisciplinary Collaborations”: personality trait development in adulthood (Roberts 
and co-authors), biological models of behavior in the life course (Shanahan/Hofer and Shanahan), 
and SES and health over the life course (Frytak, Harley and Finch). 

Finally, six quite different kinds of papers are assembled under the rubric “The Future of the 
Life Course.” Dannefer connects “global geography” and “late modernity;” Furstenberg reflects 
about potential life course developments; George writes a kind of companion piece to Elder's 
and his co-author’s introduction; Hogan and Goldschneider assess the revolution in 
demography brought about by life course micro modeling; O'Rand introduces “life course risks” 
as a new analytical concept; and Weyman bridges the wide gap between Jean-Jacques 
Rousseau and Samuel Huntington in his erudite theoretical essay on the future of the life course. 

The contributions to Handbook demonstrate a very wide range of genres, from somewhat 
loose essayistic reflections (e.g., Heinz, Gecas) and empirical research papers (Mortimer et al., 
Kariya/Rosenbaum) to highly informative and highly organized summaries of extant analytical 
models and empirical research (e.g., Roberts et al., Entwisle et al.), from novel and innovative 
pieces (e.g., Leisering and Shanahan) to reproductions of well-known earlier work (e.g., Glenn, 
Sampson/Laub, Halaby). 

Despite these partly intentional and successful, partly unfortunate disparities in genre, 
Handbook succeeds admirably in informing about a large variety of topics, styles of theoretical 
thinking, modes of analysis and salient empirical research. For those users new to the field, 
Handbook provides efficient and informative introductions. But also practitioners well versed in 
life course research can learn a lot from this volume, especially from the chapters on 
neighboring disciplines. There will probably be few who read Handbook from beginning to 
end, but for the reviewer, this exercise has been highly instructive and often enjoyable. 

Obviously, even such a comprehensive Handbook cannot possibly cover all aspects and 
areas of the field. Still, there are a few conspicuous omissions. Aging is hardly touched on, 
although the subject is at the forefront of future policy concerns. The newly emerging subfield 
of cross-national institutional and policy impacts on life courses does not get the attention it 
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deserves. Also conspicuously absent are perspectives and research from economics. | also 
missed a piece about major longitudinal data collections. 

The publication of Handbook provides ample material on which to reflect about the state 
of the art. Even the editors and one of the contributing authors seem puzzled about what kind 
of field the life course actually is. George sees two primary forms of life course research. One 
focuses on the life course itself, its historical and cross-national patterns as well as its diversity 
between gender, class and race. Studies in this tradition have mainly served as important 
exemplars demonstrating the enormous fruitfulness and indispensability of the heuristics of 
a longitudinal and life course perspective. By now they have accomplished this service and 
future life course research will not be a specific area, but will be part and parcel of other 
substantive areas. 

| tend to agree with George that the attempt has failed (so far) to constitute life course 
research as a unitary field. Handbook provides ample evidence of that. Life courses are studied 
in fairly different manners as: developmental processes, culturally constructed life stages, 
biographical meanings, outcomes of institutional regulation and policies, demographic 
accounts or mere empirical connectivity across the life course. There is limited usefulness in 
putting all of this in one great bag. However, | disagree with George (and the editors of 
Handbook) that the principles and different modes of life course analysis are already 
sufficiently spelled out. On the one hand, we need to sharpen the analytical distinctions in 
order to make their systematic interrelations more productive. For example, there are vast 
research opportunities in studying institutional conditions as developmental tasks (see the 
chapters by Rosenbaum, Kariya and Kerckhoff) or the interplays between personality 
development and institutional breakdowns. We have also barely scratched the surface in 
unraveling the institutional effects and social policies of various political economies on life 
courses. On the other hand, there is still a long way to go in developing life course theory. In her 
chapter, O'Rand makes important proposals in regard to the conceptualization of life courses 
risks which could be very fruitfully combined with the somewhat different concepts of risk 
exposure and risk compensation by DiPrete (AUS 1999). Also, the intersection between life 
course analysis, stratification and class theory is still not sufficiently mapped out and applied 
empirically (e.g., in regard to collective class life histories and cumulative advantage). And, not 
least, it remains a Sisyphus task to overcome the constant tendency of fragmentation into life 
domains (like family formation and working lives), life stages and disciplines. 


Reviewer: Karl Ulrich Mayer, Yale University 








From Adolescence to Adulthood in the Vietnam Era 
By Timothy Owens 
_ Springer, 2005. 195 pages. $69.95 (cloth). 


The life course perspective provides a general framework for studying the temporal organization 
of lives. Lives are constituted of sequentially contingent changes in statuses and roles over the 
life span that are shaped by historical context, social relations and the biographical accumulation 
of experiences across contexts that influence individual perceptions and behaviors. Arguably, 
the transition to adulthood is the most pivotal — and perhaps fateful — of status changes because 
it represents the emergence of the adult self, who carries the influences of the past into the 
future and selects among new contexts for the further development of self. 

Timothy Owens’ study of the transition from adolescence to adulthood in the Vietnam era 
adopts this perspective. He follows boys from the 10" grade in 1966 to 1974 using the Youth in 
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Transition study (YIT), a five-wave longitudinal study. He is interested in how the movement of 
these boys from high school to three alternative statuses — military service, fulltime work, and 
fulltime college attendance — affects their subsequent self-esteem in young adulthood. These 
three “choice contexts” are described in light of their historical circumstances, i.e. the mid- 
1960s to early 1970s when military service was associated with the Vietnam War, full-time work 
occurred in the recent historical environment of a thriving economy (soon to begin changing for 
the worse in 1974), and college attendance was the highest in the history of the United States. 
He argues that self-esteem should be influenced by these “occupational choices” in ways 
shaped by their historical context, although he makes no specific predictions drawn from 
theory for alternative outcomes. 

The linkage of self-esteem to social contexts is putatively framed by reference to three 
perspectives: Dannefer’s sociogenic thesis, Glenn's aging-stability hypothesis and Bronfenbrenner’s 
ecology of human development. These perspectives are summarized, but their integration into a 
coherent framework with a specific set of hypotheses is never fully developed. In effect, Owens 
argues that individual development unfolds across social contexts such as family of origin and post- 
high school choice contexts (c.f., Dannefer and Bronfenbrenner) in ways that reinforce relatively 
stable value orientations (c.f., Glenn), but again, without specific hypotheses. He defers to Elder's 
work on the principles of the life course perspective, but here again he does not integrate this with 
the other literatures. He also refers to theories of vocational personality and social networks that 
are left unintegrated into a coherent conceptual framework. Why he omits some middle-range 
theories, such as self-efficacy theory is not clear, although he later refers to them while interpreting 
his results. All of this produces an introductory chapter that offers little direction regarding the 
analyses to follow and fails to anticipate the results. 

The analysis confronts and attempts to deal with all the usual challenges of longitudinal 
research: sample attrition, missing data and omitted variables. Sample selection methods are 
applied and described in considerable detail. Attrition and missing values are handled in a 
straightforward way. Also, he retains those “other” cases (39 percent of the total sample) in the 
final models that do not fit into his three preferred normative choice contexts (military, work and 
college) as controls for sample selection. The “other” category consists of boys who moved 
between work, non-work and schooling too frequently to be classified in these categories over 
the period. 

However, one can argue that this decision itself introduces a kind of selective analytical 
bias that perhaps has more significant consequences for understanding the findings than 
conventional sample selection effects (which in the final structural equation models appear to 
have no effect on self-esteem anyway). Owens finds that his three normative contexts have 
weak negative (military, work) or no effects (college attendance) on self-esteem. But, he does 
find that “time-in-context” has the strongest positive effect on self-esteem. He argues that 
this finding demonstrates that “the longer an individual participates in a specific social 
context... the more competent and efficacious, and self confident one should feel.” (p. 146) 
Why, then, ignore the 39 percent of respondents who seem perhaps the most at risk of lower 
self-esteem. Why not look inside that modal category in the data for some answers to the life- 
course finding related to “time-in-context”? And, finally, in light of the growing awareness in 
life course research that such heterogeneity in patterns of transition are prevalent and 
probably consequential, why limit the analysis to normative categories? 

Other analytical decisions also raise questions: the exclusion of blacks and the treatment 
of educational aspirations and attitudes towards war as exogenous variables are two such 
decisions. Also, non-significant (college attendance) and unexpected (no SES effects on self- 
esteem) results raise more questions. 

This study provided a strategic opportunity to apply the life course framework to a cohort’s 
transition to adulthood that occurred in a formative historical setting — one that was volatile 
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and disruptive. Many technical challenges of longitudinal analysis were confronted. However, 
theoretical and conceptual innovation did not result. 


Reviewer: Angela M. O’Rand, Duke University 





Discrimination at Work: The Psychological and Organizational Bases 
Edited by Robert L. Dipboye and Adriene Colella 
Lawrence Erlbaum Associates, 2005. 508 pages. $99.95 (cloth). 


In one of the most comprehensive collections on workplace discrimination, the contributing 
authors make a strong case that workplace discrimination is alive and well, but workplaces 
are not beyond repair. As the title suggests, this collection of 18 chapters draws mainly from 
psychology to frame a discussion of workplace discrimination although some authors 
incorporate perspectives from management, economics and sociology into their discussions. 
In addition to synthesizing research on workplace discrimination, this volume also establishes 
a new framework for studying discrimination and new directions for future research. 

The chapters are grouped into three sections with overlapping themes. The first section 
provides a comprehensive explanation of the individual, group, organization and extra- 
organizational causes of discrimination at work. While each chapter in this section covers one 
source of discrimination, the authors recognize that all levels contribute to and shape how 
people exhibit, experience and respond to discrimination. One chapter in this section, Chapter 
3, critiques relational demography’s ability to explain discrimination. The authors of this chapter 
suggest four ways relational demography can improve our understanding of discrimination. 
Their suggestions bridge the gap between psychology and demography in ways that easily 
translate into practice. Consistent with the section's focus, Chapter 4 discusses the role of group 
membership and demographic composition on workplace discrimination. This chapter offers 
tangible ways for organizations to avoid group-based discrimination: construct new identities for 
members or allow members’ multiple identities to coexist. | anticipated greater specificity in the 
authors’ solutions, but understand that space limitations and complexity of their solutions made 
that difficult. In this section’s last chapter, the authors chart the environmental inputs, 
organizational throughputs and behaviors/processes, and multi-level outputs of discrimination. 
This framework will be extremely useful for scholars studying the organizational and 
environmental causes and consequences of discrimination. We learn from the chapters in this 
section that discrimination stems from a set of complex, inter-related processes and systems 
and that the elimination of discrimination requires equally complex solutions. 

The second section considers discrimination on the basis of group membership. Chapter 6, 
which addresses organizational race composition, clearly demonstrates the need for a multi- 
disciplinary approach to the study of workplace discrimination as well as a major limitation of a// 

- disciplines — the failure to study race beyond a black-white dichotomy. Less a criticism than a 
caveat, Chapter 7, on gender discrimination, concisely summarizes a substantial body of work, 
but does so at the expense of identifying the specific ways employers discriminate on the basis 
of sex. This section's remaining chapters discuss research on salient yet understudied 
dimensions of discrimination: sexual orientation, age, disability, personality and physical 
appearance. Together, these chapters demonstrate a crucial point about workplace discrimination 
—the underlying mechanisms linking different group memberships to disparate outcomes are 
varied. As a result, the theories that apply to one type of discrimination (e.g., race discrimination) 
do not necessarily apply to other forms of discrimination (e.g., age discrimination). For example, 
paternalism and pity on the part of a discriminator may set in motion disability discrimination. An 
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entirely different mechanism, homophobia, may play a role in the discrimination that lesbians and 
gays face at work. That said, the chapters in this section offer researchers a theoretical framework 
for investigating multiple types of discrimination and point out just how little we know about 
discrimination outside of race and sex discrimination. 

The volume’s final section consists of six chapters that focus on the practice, policy and 
legal implications for the research summarized here. With the exception of Chapter 16 (a 
summary of the causes and consequences of employment discrimination outside of the 
United States) and Chapter 17 (a description of the process of studying Wyoming's sex wage 
gap), the chapters in this section offer researchers and practitioners advice on what to do 
about workplace discrimination. Readers should read Chapters 13 and 15 together. The 
former identifies human resource practices that can diversify organizations; the latter 
highlights the unintended (negative) consequences of some human resource practices. In 
what is an impressive summary of the law as it pertains to workplace discrimination, Chapter 
14 clarifies for the reader what constitutes illegal discrimination and discusses how current 
law, with its simplistic assumptions about what motivates discriminatory behavior, may be 
unable to eliminate a lion's share of workplace discrimination. In the volume’s last chapter, the 
editors draw attention to the chapters’ common themes and lay out an impressive agenda for 
future workplace discrimination research. 

Because of its comprehensive summaries of current research — especially understudied 
forms of discrimination — and innovative ideas for future research, this volume is certain to 
become indispensable for working scholars. This volume will also appeal to practitioners 
charged with reducing workplace discrimination and to anyone who wants to better understand 
social interactions at work. | would recommend it for a graduate seminar on work/labor markets, 
stratification or social psychology. 


Reviewer: Julie A. Kmec, Washington State University 





The Time Divide: Work, Family and Gender Inequality 
By Jerry A. Jacobs and Kathleen Gerson 
Harvard University Press, 2004. $45 (cloth), $19.95 (paper). 


In The Time Divide, Jacobs and Gerson argue that a time crunch exists in America that cannot 
be overcome by individual efforts but rather requires changes in social policy. According to 
the authors: 


“Working parents in dual-earner and single parent households have 
always faced a time bind, and the principal change over the last thirty 
years has been a marked growth in the number of people living in 
these family situations.” P 39 


The issues of this book, to a great extent, reflect not only changes in women’s roles and 
expectations but also the emergence of a knowledge economy in which a minority of workers 
with higher levels of education and income work longer hours than those with lower levels. 
The authors examine hours of work and find evidence of this division. 

They reject the conclusions of Robinson and Godbey (1999) that free time has actually 
increased. Part of the reason for this rejection is the authors’ conclusions that self-reported 
hours of workers, used to measure working time by the Federal government, is an accurate 
measure of time spent on the job. “Across a range of groups, there is no evidence to support 
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the claim that some groups of workers are especially prone to exaggerate their devotion to — 
or time spent at — their jobs.” (p. 17) In spite of this claim, there is evidence that workers’ time 
estimates do exaggerate actual hours of work when compared with their own time diaries, 
and the longer the hours of actual work recorded in time diaries the more they overestimate 
hours of work in self-reports (Robinson and Godbey, 1999). (At my own university, faculty 
members estimate they work an average of 55 hours a week, which would be 8 a.m. to 5 p.m. 
with an hour for lunch six days a week and 8 a.m. to 4 p.m. on Sunday!) 

Jacobs and Gerson find not only that hours of work for the average worker have remained 
constant, but also that the short vacations of Americans are nothing new. What is new is a 
higher percentage of people in the labor force, with women's participation increasing since 
1970 from 43 percent to 60 percent and men’s declining somewhat from 80 percent to 76 
percent. Parents are also spending more, rather than less, time with children. 

The book maintains a politically correct tone in which “gender inequality” is automatically 
assumed to disadvantage females. Never mind that nine out of ten job-related deaths are 
males (Farrell, 2005) or that married males rarely have the option of working parttime or 
staying home. Never mind that it is not only married females who spend about twice as much 
time doing housework as their husbands. Women who live alone also spend twice as much 
time doing housework as men who live alone (Robinson and Godbey, 1999). There is also little 
attention given to the decline in time spent in housework by women. This decline during the 
past few decades, of roughly 12 hours a week, almost offsets the average additional time 
spent in paid work by females (Robinson and Godbey, 1999). The unspoken assumption of 
this book is that it would be desirable to have a society in which both responsibilities and time 
expenditures of males and females were equal in every category. It happens, however. All 
known societies, both past and present, have a division of labor by gender. As any 
anthropologist will tell you, males and females differ in terms of reproductive strategies and 
demands, so such a division is surely genetically derived. 

The Time Divide convincingly argues that many institutions of society, from schools to 
employers, still operate as though U.S. households were composed of one-worker families. It 
also demonstrates how misleading aggregated statistics about time use can be. What could 
have been addressed more is whether or not two-worker households are increasingly 
composed of individuals with unrealistically high expectations. 

The authors review the case for expanding consumerism at the heart of the time squeeze, 
but reject it as an essential explanation. Economists Hamermesh and Lee (2003) played down 
the seriousness of concerns over a sped-up lifestyle, characterizing it in large part as “Yuppie 
Kvetch.” Analyzing time stress data from five Western countries, they conclude that “.... 
complaints about busy lifestyles and lack of time are generally a feature of well-off couples 
who have a lot of income and not enough time to spend it.” 


Reviewer: Geoffrey Godbey, Penn State University 
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Fighting for Time: Shifting Boundaries of Work and Social Life 
Edited by Cynthia Fuchs Epstein and Arne L. Kalleberg 
Russell Sage Foundation, 2004. 354 pages. $39.95 (cloth). 


In the late 1990s, American workers surpassed Japanese workers in average number of hours 
of paid employment per year. This shift was accompanied by other changes in the social 
organization of work, many of which have explicit temporal references: the rapid diffusion of 
“temporary” work, the corporate press for “just-in-time” capability, the growing “time bind” 
faced by dual-career families, and so forth. It is thus fitting that some of the brightest minds 
in sociology have begun to take time seriously as a social and sociological construct. One 
such set of efforts is represented by Fighting for Time, an edited volume whose contributing 
authors rely on a range of methodologies, from large-scale survey analysis to ethnographies, 
to demonstrate how sophisticated conceptualizations of time use and time norms can enrich 
our understanding of work and, to a lesser extent, family life. 

The first of three main sections in Fighting for Time describes trends in (paid) work hours 
and explores how work hours affect the emotional and physical well being of workers and 
their families. Jacobs and Gerson’s chapter, an extension of their influential 7he Time Divide 
(2004, Harvard University Press), emphasizes that the “time squeeze” is most salient for 
salaried professionals. Presser demonstrates how qualitative distinctions in work schedules — 
i.e., whether one works a standard or nonstandard schedule — affect interfamilial relationships 
and negotiations. More specifically, she finds that while evening and graveyard shifts can lead 
to more equitable gender distributions in household labor, they are also associated with 
higher probabilities of divorce and stress-related health problems. Fenwick and Tausig argue 
that the negative effects of shift work are largely dependent on the degree of autonomy that 
employees have over their schedules. 

The second section examines how employers use conceptions of time to structure and 
control the workplace. Stewart's study of bicycle messengers, Sharone’s study of engineers, 
and Collinson and Collinson’s study of middle managers reveal how employers create 
environments in which workers are “intrinsically” motivated to work longer, often more stress- 
filled, hours. By hiring too many bike riders, evaluating engineers on a curve or intensifying the 
workload of middle managers, employers embed social control mechanisms in the 
organization of work, mechanisms that continually push performance expectations higher. 
(Burawoy looms large here.) Bluedorn and Ferris offer the only chapter to deal explicitly with 
organizational, rather than individual, outcomes. They find that while long-term, forward and 
backward-looking management strategies tend to be lucrative for younger companies, such 
“temporal depth” does not appear to benefit older companies. 

The last section explores the gendered dimension of work time. Epstein argues that 
gendered norms about time use limit men and women’s ability to put egalitarian ideologies 
into practice. Men, for example, may forego paternity leave because of anticipated social 
sanctions from employers, colleagues and family members, while women may avoid jobs 
that require working during “family time.” Levin's fieldwork on commodity traders highlights 
the temporal dependency of gendered interactions at work. When trading is busy, gender is 
expressed through “gender-neutral” evaluations of competence, but when trading slows, 
sexualized difference becomes salient. In an oddly placed chapter, Blair-Loy shows that long 
hours are not always vilified by the employees who work them; indeed, about half of the 
female finance executives in her study retained their faith in a “work-devotion schema” and 
were rewarded for their long hours with not just high pay, but social status, independence, 
excitement and collegiality. 

Fighting for Time's main contribution lies in its thoughtful development of time as an 
analytical tool for future research on work, gender and organizations. This is, obviously, an 
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exceedingly ambitious task, and the editors should be praised for the diversity of approaches 
they included rather than faulted for not covering all the bases. There are, nonetheless, two 
sets of issues about the social organization of time whose absence from the volume we 
found curious. First, the volume lacks any mention of the temporal dimensions of housework 
or leisure, both of which are intrinsically linked to time use and time norms in paid work. 
Second, the volume is decidedly United States-centric, even though a truly comprehensive 
treatment of time as a social construct would require that researchers attend to the legal and 
institutional contexts in which expectations about work time and organizational strategies are 
developed. For example, in legal environments in which there are restrictions on the number 
of weekly hours that salaried employees can work (as in some European countries), 
productivity per hour, rather than number of hours, may be the metric by which employers 
assess employees and around which norms about time use develop. 

These gaps notwithstanding, Fighting for Time is a strong contribution to the literatures on 
time, work, gender and organizational dynamics. With the exception of the rare digression 
into sociological jargon (e.g., a “spatial-emotional dialectic” of bike courier work), the volume 
is very accessible. Although readers embedded in the work-family literatures will find some 
overlap with extant publications by a few of the chapters’ authors, they will also find enough 
fresh material here to keep them engaged for quite some time. 


Reviewers: Sarah Thébaud and Kim A. Weeden, Cornell University 
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